Zazzage dukkan rukunin yanar gizo tare da wget koda kuwa akwai takura

Menene wget?

Ba abin da ya fi wikipedia don bayyana abin da wannan kayan aikin ya ƙunsa:

GNU Wget kayan aikin kayan aikin kyauta ne wanda ke ba da damar sauke abubuwa daga sabar yanar gizo ta hanya mai sauƙi. Sunanta ya samo asali ne daga Gidan yanar gizo na Duniya (w), kuma daga "get" (a Turanci samu), wannan yana nufin: samu daga WWW.

A halin yanzu yana tallafawa saukarwa ta amfani da ladabi na HTTP, HTTPS da FTP.

Daga cikin fitattun sifofin da take bayarwa wget akwai yiwuwar sauƙaƙan sauke abubuwa masu rikitarwa a cikin madubi akai-akai, sauya hanyoyin haɗi don nuna abubuwan HTML a cikin gida, tallafi don wakilta ...

De wget Mun riga mun yi magana isa a nan a DesdeLinux. A gaskiya ya Mun ga yadda ake saukar da cikakken rukunin gidan yanar gizo tare da wget, matsalar kuwa ita ce, a halin yanzu masu gudanarwa ba koyaushe suke barin kowa ya saukar da gidan yanar gizon su duka haka ba, ba wani abu bane da suke so sosai ... kuma, a bayyane na fahimta. Shafin yana nan akan Intanet don tuntubarsa, mai karatu yana samun damar abun cikin sha'awa kuma mai gudanar da shafin yana cin riba sosai (ta hanyar talla), kamar ziyara, da sauransu. Idan mai karatu ya saukar da shafin a kwamfutarsa, ba lallai bane ya hau yanar gizo don tuntubar wani rubutu da ya gabata ba.

Don sauke rukunin yanar gizo tare da wget yana da sauƙi kamar:

wget -r -k http://www.sitio.com

  • -r : Wannan yana nuna cewa za a saukar da dukkanin rukunin gidan yanar gizon.
  • -k : Wannan yana nuna cewa hanyoyin yanar gizon da aka zazzage za a canza su ta yadda za a iya kallon su ta kwamfuta ba tare da intanet ba.

Yanzu, abubuwa suna da rikitarwa lokacin da mai gudanarwa ta yanar gizo ta sanya mana wahala ...

Waɗanne ƙuntatawa zasu iya kasancewa?

Mafi sanannen abin da zamu iya samu shine cewa ana ba da izinin shiga kawai idan kuna da ƙwarewar UserAgent. A takaice, shafin zai gane cewa UserAgent da ke sauke shafuka da yawa ba ɗaya daga cikin "na al'ada" ba saboda haka zai rufe hanya.

Hakanan ta hanyar fayil din robots.txt zaka iya tantance wancan wget din (kamar tarin yawa kayan aikin kama) Ba za ku iya yin kwafa kamar yadda abokin ciniki yake so ba, da kyau ... da kyau, mai kula da shafin yana so, lokaci 😀

Yaya za a keta waɗannan ƙuntatawa?

A karon farko zamu kafa UserAgent don wget, zamu iya yin wannan tare da zaɓi –Mai wakilcin, a nan zan nuna muku yadda:

wget --user-wakili = "Mozilla / 5.0 (X11; Linux amd64; rv: 32.0b4) Gecko / 20140804164216 ArchLinux KDE Firefox / 32.0b4" -r http://www.site.com -k

Yanzu, don yin kusa da robots.txt, kawai keɓe wannan fayil ɗin, wato, bari wget ya sauke rukunin yanar gizon kuma kar ku damu da abin da robots.txt ya ce:

wget --user-wakili = "Mozilla / 5.0 (X11; Linux amd64; rv: 32.0b4) Gecko / 20140804164216 ArchLinux KDE Firefox / 32.0b4" -r http://www.site.com -k-robots = kashe

Yanzu ... akwai wasu zaɓuɓɓuka ko sigogi waɗanda za mu iya amfani da su don yaudarar shafin har ma da ƙari, misali, nuna cewa mun shiga shafin daga Google, a nan na bar layin ƙarshe tare da komai:

wget --header = "Accept: rubutu / html" --user-agent = "Mozilla / 5.0 (X11; Linux amd64; rv: 32.0b4) Gecko / 20140804164216 ArchLinux KDE Firefox / 32.0b4" --referer = http: / /www.google.com -r http://www.site.com -e mutummutumi = kashe -k

Ba tilas bane cewa shafin yana dauke da http: // www a farko, yana iya zama daya kai tsaye http: // kamar misali wannan lissafi Dash

Shin yayi kyau ayi wannan?

Wannan ya dogara ... koyaushe ku ganshi daga duka ra'ayoyin biyu, daga mai kula da rukunin yanar gizo amma kuma daga mai karatu.

A gefe guda, a matsayina na mai gudanarwa, ba zan so cewa suna ɗaukar kwafin HTML na shafin yanar gizo kamar haka ba, a nan kan layi ne ba don jin daɗi ba, don jin daɗin duka ... burinmu shi ne samun abubuwan da ke da ban sha'awa a gare ku, waɗanda za ku koya.

Amma, a gefe guda ... akwai masu amfani waɗanda ba su da intanet a gida, waɗanda za su so duk sashin Koyarwar da muka sanya a nan ... Na sanya kaina a wurin su (a gaskiya ni ne, domin a gida ba ni da intanet) kuma ba shi da daɗin zama kan kwamfutar, samun matsala ko son yin wani abu da rashin iyawa saboda ba ku da damar shiga cibiyar sadarwar yanar gizo.

Ko daidai ne ko kuskure ne ga kowane mai gudanarwa, gaskiyar kowane mutum ... abin da zai fi damuna shine amfani da albarkatun da ke haifar da sabar, amma tare da kyakkyawan tsarin ɓoye ya isa ga saba bai wahala ba.

internet

ƘARUWA

Ina rokonka kar ka fara downloading yanzu. DesdeLinux HA HA HA!! Misali, budurwata ta ce in sauke wasu daga cikin Geometry Dash Cheats (wani abu kamar Geometry Dash Cheats), ba zan zazzage dukkan gidan yanar gizon ba, amma kawai zan bude shafin da nake so in ajiye shi a PDF ko HTML ko wani abu, wannan shine. abin da zan ba ku shawara.

Idan kuna da koyawa DesdeLinux wanda kake son adanawa, ka adana shi a cikin alamominka, azaman HTML ko PDF...

To babu komai, ina fata yana da amfani ... gaisuwa


Bar tsokaci

Your email address ba za a buga. Bukata filayen suna alama da *

*

*

  1. Wanda ke da alhakin bayanan: Miguel Ángel Gatón
  2. Manufar bayanan: Sarrafa SPAM, sarrafa sharhi.
  3. Halacci: Yarda da yarda
  4. Sadarwar bayanan: Ba za a sanar da wasu bayanan ga wasu kamfanoni ba sai ta hanyar wajibcin doka.
  5. Ajiye bayanai: Bayanin yanar gizo wanda Occentus Networks (EU) suka dauki nauyi
  6. Hakkoki: A kowane lokaci zaka iyakance, dawo da share bayanan ka.

  1.   lokacin3000 m

    Tukwici mai ban sha'awa. Ban san za ku iya yin hakan ba.

  2.   Emmanuel m

    A bayyane yake abin da ya faru da ni sau biyu, kuma tabbas hakan ne. Kodayake, saboda dalilai ne na sauri (gida da jami'a) ne nake son samun damar abun ciki ta wannan hanyar. 😛
    Godiya ga nasihar. Gaisuwa.

  3.   Gerardo m

    Mai kyau ga waɗanda ba mu da intanet. Tabbas kyawawan koyarwa.

  4.   Quinotto m

    Labari mai ban sha'awa.
    Tambaya: ta yaya za'ayi shi don shafukan https?
    A ina ake buƙata don tantancewa ta hanyar sunan mai amfani da kalmar wucewa sannan kuma rubutaccen ɓangaren shafin yana rubuce cikin java?
    Gaisuwa da godiya

  5.   Gelibium m

    kuma a ina aka saukar da abubuwan da aka sauke?

    1.    Gelibium m

      Na amsa kaina: a cikin jakar sirri. Amma yanzu tambayar ita ce ... shin ko ta yaya za ku iya gaya masa inda za a sauke abubuwan?

      godiya

      1.    Daniel m

        Ina tsammani kun fara samun damar babban fayil ɗin da kuke son adana shi sannan kuma ku kunna wget

  6.   Cristian m

    tambaya ... kuma za'a sami wani abu kamar wannan don "dunƙule" matattarar bayanai

  7.   xphnx m

    Ina da son sani, shin kuna karɓar kuɗi don sanya waɗancan hanyoyin haɗin yanar sadarwar micro-niches ɗin?

  8.   Rupert m

    Mai farin ciki wget ... wannan shine yadda na saukar da batsa da yawa a cikin aladun alade na xD

  9.   wata m

    kyakkyawan tip. godiya

  10.   null m

    Yayi kyau sosai, Ina son ɓangaren game da ƙetare ƙuntatawa.

  11.   Franz m

    Godiya ga wannan mai daraja:
    wget –header = »Accept: rubutu / html» –user-agent = »Mozilla / 5.0 (X11; Linux i686; rv: 31) Gecko / 20100101 Firefox / 31 ″ –referer = http: //www.google.com - r https://launchpad.net/~linux-libre/+archive/ubuntu/rt-ppa/+files/linux-image-3.6.11-gnu-3-generic_3.6.11-gnu-3.rt25.precise1_i386.deb -k -e mutummutumi = a kashe

    wget –header = »Accept: rubutu / html» –user-agent = »Mozilla / 5.0 (X11; Linux i686; rv: 31) Gecko / 20100101 Firefox / 31 ″ –referer = http: //www.google.com - r https://launchpad.net/~linux-libre/+archive/ubuntu/rt-ppa/+files/linux-headers-3.6.11-gnu-3_3.6.11-gnu-3.rt25.precise1_all.deb -k -e mutummutumi = a kashe

    wget –header = »Accept: rubutu / html» –user-agent = »Mozilla / 5.0 (X11; Linux i686; rv: 31) Gecko / 20100101 Firefox / 31 ″ –referer = http: //www.google.com - r https://launchpad.net/~linux-libre/+archive/ubuntu/rt-ppa/+files/linux-headers-3.6.11-gnu-3-generic_3.6.11-gnu-3.rt25.precise1_i386.deb -k -e mutummutumi = a kashe

  12.   Dovecote m

    Very ban sha'awa.

  13.   oscar meza m

    wget na ɗaya daga cikin waɗannan kayan aikin masu ƙarfin gaske, tare da ɗan ƙaramin shirin shirye-shirye zaka iya yin mutum-mutumi mai salon google don fara zazzage abubuwan da ke cikin shafukan ka adana su a cikin rumbun adana bayanan ka kuma kayi duk abin da kake so daga baya tare da wannan bayanan.

  14.   Carlos G Ba m

    Na ga wannan kayan aiki yana da ban sha'awa sosai, ban taɓa kula da abubuwan da ke tattare da shi ba, Ina so in san ko zai yiwu a sauke abun ciki daga shafin «X» wanda kuke buƙatar shiga don shiga, kuma idan yana wani wuri a wannan rukunin yanar gizon «X» akwai wani bidiyo, shin zan iya kwafa shi ko da kuwa na wani CDN ne daban da shafin na «X»?

    Idan wannan ya yiwu, ta yaya shafi zai iya kariya daga irin wannan kayan aikin?

    Na gode!

  15.   Erick zanardi m

    Ina kwana:

    Ina rubuto muku ne don neman shawara. Na zazzage tare da umarnin karshe na wannan labarin, kusan 300MB na bayanai .. fayiloli .swf, .js, .html, daga shafin http://www.netacad.com/es tare da mai amfani da ni daga karamin kwas din da na yi a Maracay, Venezuela.

    Tambayata ita ce… Shin zai yiwu a ga rayarwar walƙiya?

    Na shiga "Kanfigareshan na Duniya" kuma zaɓuɓɓukan da yake nuna babu wanda zai bani damar daidaitawa.

    Ina godiya da kowane martani.

    Godiya a gaba!

    1.    ADX m

      Ina da daki-daki iri daya, ana saukar da .swf rabi, idan ka sami damar tsallake shi, raba ni da bayanai. Abinda nayi a bara shine inyi amfani da gizo-gizo don samin duk hanyoyin sadarwar netacad amma har yanzu .swf bai gama saukar da yadda ya kamata

  16.   alexander.hernandez m

    yayi kyau sosai !!! godiya.

  17.   Ana m

    Barka dai, na gode da tutocin ka. Na yi kokarin zazzage wani shafi wanda aka gayyace ni, tare da kalmar sirri, don in iya karanta shi daga gida ba tare da haɗi ba. Ina amfani da wannan shirin, kuma a bayyane, Ina da kalmar sirri ta bulogi (wordpress), amma ban san yadda zan ci gaba ba. Za a iya nuna mani?
    Godiya a gaba kuma mafi yawan gaisuwa!

  18.   Fran m

    menene babban matsayi !!!

  19.   Santiago m

    kwarai da gaske ya yi min hidima sosai

  20.   Fran m

    Na shiga wani gidan yanar gizo tare da saka vimeo bidiyo kuma babu wata hanyar da za a zazzage su .. kamar dai vimeo ta kiyaye su. Duk wani ra'ayi ??