Yini i-wget?
Akukho lutho olungcono kunokuba Wikipedia ukuchaza ukuthi leli thuluzi liqukethe ini:
I-GNU Wget iyithuluzi lesoftware lamahhala elivumela ukulandwa kokuqukethwe kusuka kumaseva wewebhu ngendlela elula. Igama lalo lisuselwa ku-World Wide Web (w), naku- "uthole" (ngesiNgisi uthole), lokhu kusho ukuthi: thola ku-WWW.
Njengamanje isekela ukulandwa kusetshenziswa imigomo ye-HTTP, HTTPS ne-FTP.
Phakathi kwezici ezivelele kakhulu ezinikezwayo wget kunethuba lokulanda kalula izibuko eziyinkimbinkimbi ziphindelela, ukuguqulwa kwezixhumanisi ukubonisa okuqukethwe kwe-HTML endaweni yangakini, ukusekelwa kwama-proxies ...
De wget hemos hablado ya bastante aquí en DesdeLinux. Empeleni ya Sibonile ukuthi ungayilanda kanjani iwebhusayithi ephelele nge-wget, inkinga ukuthi kulezi zinsuku abaphathi abavumeli noma ngubani ukuthi alande iwebhusayithi yabo yonke kanjalo, akuyona into abayithanda ngempela ... futhi, kusobala ukuthi ngiyayiqonda. Isayithi likhona kwi-inthanethi ukuze kuboniswane nalo, umfundi afinyelele kokuqukethwe okuthakazelisayo kubo futhi umphathi wesayithi azuze kahle ngokwezimali (ngokukhangisa), njengokuvakasha, njll. Uma umfundi elanda isayithi ekhompyutheni yakhe, ngeke kudingeke bangene ku-inthanethi ukubheka okuthunyelwe kwangaphambilini.
Ukulanda isayithi nge-wget kulula njengokuthi:
wget -r -k http://www.sitio.com
- -r : Lokhu kukhombisa ukuthi yonke iwebhusayithi izolandwa.
- -k : Lokhu kukhombisa ukuthi izixhumanisi zesayithi elilandiwe zizoguqulwa ukuze zibonwe kumakhompyutha ngaphandle kwe-inthanethi.
Manje, izinto ziba nzima lapho umphathi wesayithi enza kube nzima ngathi ...
Yimiphi imikhawulo engaba khona?
Okuvame kakhulu esingakuthola ukuthi ukufinyelela kusayithi kuvunyelwe kuphela uma une-UserAgent eyaziwayo. Ngamanye amagama, isiza lizobona ukuthi i-UserAgent elanda amakhasi amaningi kangaka ayilona lalawo "ajwayelekile" ngakho-ke izovala ukufinyelela.
Futhi ngefayela le-robots.txt ungacacisa ukuthi wget (njengenqwaba yezinhlelo zokusebenza ezifanayoNgeke ukwazi ukulanda njengoba iklayenti lifisa, kahle ... kahle, umphathi wesayithi uyayifuna, isikhathi 😀
Ungayinqanda kanjani le mikhawulo?
Ngecala lokuqala sizosungula i-UserAgent ukuze i-wget, singakwenza lokhu ngenketho -I-ejenti yomsebenzisi, lapha ngikukhombisa ukuthi:
i-wget --user-agent = "Mozilla / 5.0 (X11; Linux amd64; rv: 32.0b4) Gecko / 20140804164216 ArchLinux KDE Firefox / 32.0b4" -r http://www.site.com -k
Manje, ukuzungeza i-robots.txt, khipha lelo fayela, okungukuthi, ake ulande isayithi futhi ungakhathali ukuthi i-robots.txt ithini:
wget --user-agent = "Mozilla / 5.0 (X11; Linux amd64; rv: 32.0b4) Gecko / 20140804164216 ArchLinux KDE Firefox / 32.0b4" -r http://www.site.com -k -e robots = off
Manje ... kunezinye izinketho noma imingcele esingayisebenzisa ukukhohlisa isayithi ngisho nangokwengeziwe, isibonelo, ukukhombisa ukuthi singena kusayithi kusuka kuGoogle, nansi ngishiya umugqa wokugcina ngayo yonke into:
wget --header = "Yamukela: umbhalo / html" --user-agent = "Mozilla / 5.0 (X11; Linux amd64; rv: 32.0b4) Gecko / 20140804164216 ArchLinux KDE Firefox / 32.0b4" --referer = http: / /www.google.com -r http://www.site.com -e robots = off -k
Ingabe kulungile ukwenza lokhu?
Lokho kuya ... kufanele ngaso sonke isikhathi ukukubona kuwo womabili amaphuzu wokubuka, kusuka kumphathi wesayithi kepha futhi nakumfundi.
Ngakolunye uhlangothi, njengomlawuli, angithandi ukuthi bathathe ikhophi ye-HTML yesiza sami kanjalo nje, ilapha ku-inthanethi hhayi ukuzithokozisa, ukujabulisa bonke ... inhloso yethu ukuba nokuqukethwe okuthakazelisayo etholakalayo, ongayifunda.
Kepha, ngakolunye uhlangothi ... kunabasebenzisi abangenayo i-intanethi ekhaya, abangathanda ukuba nesigaba sonke samaTutorials esisibeke lapha ... ngazibeka endaweni yabo (empeleni ngikhona, ngoba ekhaya anginayo i-intanethi) futhi akumnandi ukuba sekhompyutheni, unenkinga noma ufuna ukwenza okuthile futhi ungakwazi ngenxa yokuthi awukwazi ukufinyelela kwinethiwekhi.
Ukuthi kulungile noma akulungile kuncike kumlawuli ngamunye, iqiniso lomuntu ngamunye ... okungangikhathaza kakhulu kungaba ukusetshenziswa kwezinsizakusebenza ezibangela iseva, kepha ngohlelo lwe-cache oluhle kufanele lwanele kuseva ukuhlupheka.
Iziphetho
Les pido que ahora no se pongan a estar todos descargando DesdeLinux JAJAJA!! Por ejemplo, mi novia me pidió que descargara unos trucos de Geometry Dash (algo así como Geometry Dash Cheats), no descargaré el sitio web completo, sino que simplemente abriré la página deseada y la guardaré en PDF o en HTML o algo así, eso es lo que les recomendaría a ustedes.
Si tienes algún tutorial de DesdeLinux que deseas guardar, guárdalo en tus marcadores, como HTML o PDF … pero, por uno o dos tutoriales no hace falta generar un tráfico y consumo excesivo en el servidor 😉
Cha, ngiyethemba ilusizo ... Sanibonani
Ithiphu ethakazelisayo. Bengingazi ukuthi ungakwenza lokho.
Ngokusobala yilokho okwakwenzeke kimi kabili, futhi ngokuqinisekile kwakungenxa yako. Noma, bekungezizathu zejubane (ikhaya vs eyunivesithi) ebengifuna ukufinyelela kokuqukethwe ngaleyo ndlela. 😛
Siyabonga ngezeluleko. Ozithobayo.
Kuhle kakhulu kithi esingenayo i-intanethi. Ama-tutorials amahle impela.
I-athikili ethakazelisa kakhulu.
Umbuzo: kungenziwa kanjani kumasayithi we-https?
Kukuphi lapho kudingeka khona ukuqinisekisa ngegama lomsebenzisi nephasiwedi futhi nengxenye enkulu yesayithi ibhalwe kuJava?
Ukubingelela nokubonga
futhi okulandwayo kugcinwa kuphi?
Ngiyaziphendula: kufolda yomuntu siqu. Kepha manje umbuzo uthi ... ungakhombisa ngandlela thile ukuthi ungakulanda kuphi okuqukethwe?
ngiyabonga
Ngicabanga ukuthi uqala ukufinyelela kufolda lapho ufuna ukuyigcina khona bese usebenzisa i-wget
umbuzo ... futhi kuzoba nokuthile okufana nalokhu "ukuhlanganisa" i-database
Nginelukuluku, uyayithola imali yokubeka lezo zixhumanisi kuma-web-niches webs?
I-wget ebusisiwe ... yileyo ndlela engilande ngayo izithombe ezingcolile eziningi ezinsukwini zami zezingulube xD
icebo elihle. ngiyabonga
Kuhle kakhulu, ngiyithandile ingxenye mayelana nokugwema imikhawulo.
Siyabonga ngalelo gem:
wget –header = »Yamukela: text / html» –user-agent = »Mozilla / 5.0 (X11; Linux i686; rv: 31) Gecko / 20100101 Firefox / 31 ″ –referer = http: //www.google.com - r https://launchpad.net/~linux-libre/+archive/ubuntu/rt-ppa/+files/linux-image-3.6.11-gnu-3-generic_3.6.11-gnu-3.rt25.precise1_i386.deb -k -e amarobhothi = kuvaliwe
wget –header = »Yamukela: text / html» –user-agent = »Mozilla / 5.0 (X11; Linux i686; rv: 31) Gecko / 20100101 Firefox / 31 ″ –referer = http: //www.google.com - r https://launchpad.net/~linux-libre/+archive/ubuntu/rt-ppa/+files/linux-headers-3.6.11-gnu-3_3.6.11-gnu-3.rt25.precise1_all.deb -k -e amarobhothi = kuvaliwe
wget –header = »Yamukela: text / html» –user-agent = »Mozilla / 5.0 (X11; Linux i686; rv: 31) Gecko / 20100101 Firefox / 31 ″ –referer = http: //www.google.com - r https://launchpad.net/~linux-libre/+archive/ubuntu/rt-ppa/+files/linux-headers-3.6.11-gnu-3-generic_3.6.11-gnu-3.rt25.precise1_i386.deb -k -e amarobhothi = kuvaliwe
Kuyathandeka kakhulu
i-wget ingelinye lalawo mathuluzi anamandla amakhulu, ngohlelo oluncane lwe-terminal ungenza i-robot yakho yesitayela se-google ukuqala ukulanda okuqukethwe kwamakhasi bese ukugcine ku-database yakho bese wenza noma yini oyifunayo kamuva naleyo datha.
Ngithola leli thuluzi lithakazelisa kakhulu, bengingakaze nginake imingcele yalo, ngithanda ukwazi ukuthi ungalanda yini okuqukethwe ekhasini le- «X» okudingeka ungene kulo ukuze ungene, futhi uma ngabe likule ndawo «X» ikhona ividiyo, bengingaphinde ngiyilande noma ngabe ingeye-CDN ehlukile kuneye- «X» site?
Uma lokhu bekungenzeka, isayithi livikela kanjani kuthuluzi elinjalo?
Ukubingelela!
Ulale kahle:
Nginibhalela ukuze nibonisane. Ngilande ngomyalo wokugcina wale ndatshana, cishe i-300MB yolwazi .. amafayela .swf, .js, .html, ekhasini http://www.netacad.com/es nomsebenzisi wami kusuka enkambweni encane engiyenze eMaracay, eVenezuela.
Umbuzo wami uthi… Ngabe kuzokwazi ukubona izithombe ezi-flash?
Ngifaka "Ukucushwa Komhlaba Wonke" futhi izinketho ezibonisa ukuthi azikho ezingivumela ukuthi ngizilungiselele.
Ngibonga noma iyiphi impendulo.
Siyabonga kusengaphambili!
Nginemininingwane efanayo, i-.swf ilandwe uhhafu, uma ukwazi ukweqa, ungabele imininingwane. Engikwenze kokugcina ukuzama ukusebenzisa isicabucabu ukuthola zonke izixhumanisi ze-netacad kepha noma kunjalo .swf ayiqedi ukulanda njengoba kufanele
Kuhle kakhulu !!! ngiyabonga.
Sawubona, ngiyabonga nge-tuto yakho. Ngizama ukulanda ibhulogi engimenywe kuyo, ene-password, ukuze ngiyifunde ngisekhaya ngaphandle kokuxhumeka. Ngisebenzisa lolu hlelo, futhi kusobala ukuthi ngine-password yebhulogi (i-wordpress), kepha angazi ukuthi ngiqhubeka kanjani. Ungangikhombisa?
Ngibonga kusengaphambili nokuzithoba okuhle!
kuhle okuthunyelwe !!!
okuhle kungikhonze kakhulu
Ngingene ngemvume kwiwebhusayithi enamavidiyo we-vimeo ashumekiwe futhi ayikho indlela yokuthi bayilande .. kubonakala sengathi i-vimeo inayo ivikelwe. Noma yimiphi imibono ??