Hanyar ban sha'awa, wanda na gano a cikin DiarioLinux, don gano ɓatattun hanyoyin haɗin yanar gizo (cikakke) ta amfani da kayan aiki mai ƙarfi wget.
Wannan yana da sauƙin aiwatarwa akan Linux. Na bude tashar mota na rubuta:
wget --spider --no -parent -r -o log.txt http://tuweb.com
Anan aikin kowane sigogi:
–Spider: shiga cikin gidan yanar sadarwar da zaka fada musu, amma BATARE da zazzage komai ba. Kawai tafiya da shi.
- r: recursive, kamar dai shi ne Google robot 🙂
- o fayil: fitowar aiwatar da umarnin da ya bayyana akan allon
–Baba-iyaye: idan muka wuce sunan kundin adireshi a matsayin siga, ba za mu so hakan ya hau kan kundin adireshin iyayen ba.
Yana iya ɗaukar lokaci mai tsawo kafin a gama (kamar yadda ake bincika duk gidan yanar gizon ko kundin adireshin da kuka nuna). Jerin hanyoyin da aka fasa zasu kasance a log.txt (bangare na karshe)
Idan kuna son ƙarin koyo game da wget, na rubuta
mutum wget