Ungayenza kanjani i-OCR i-PDF futhi unike amandla ukukhetha kwemibhalo nokusesha

Ake sithi une-PDF eyakhiwe kusetshenziswa isithwebuli, noma oyidluliselwe kodwa iqukethe imininingwane enesimo sesithombe. Inqubo okufanele sifake kuyo i-PDF yethu esiyithandayo ibizwa OCRinqubo ekhomba ngokuzenzakalela izimpawu noma izinhlamvu ezingokwezinhlamvu ezithile zamagama, kusuka esithombeni ukusigcina ngendlela yedatha esingaxhumana ngayo ngohlelo lokuhlela umbhalo noma okufanayo.


i-pdfocr iyithuluzi elilula elenza i-PDF entsha enongqimba lombhalo oshumekiwe, uvumela umsebenzisi ukuthi akhethe umbhalo futhi afune amagama kuwo, ngaphandle kokushintsha ukubonakala kokugcina kwe-PDF.

Yini i-pdfocr engeyona ye:

Lokhu kusebenza kuphela uma i-PDF iqukethe imininingwane ekwifom yesithombe; uma uthumele i-PDF ku-OpenOffice, isivele inongqimba lombhalo oshumekiwe, ngakho-ke le nqubo ayidingekile.

Ungayifaka kanjani i-pdfocr:

i-sudo engeza-apt-repository ppa: gezakovacs / pdfocr
sudo apt-get update
sudo apt-get ukufaka i-pdfocr

Ungayisebenzisa kanjani i-pdfocr:

Vula i-terminal, iya enkombeni lapho i-PDF ofuna ukuyiguqula itholakala khona, bese ufaka okulandelayo (esikhundleni se-input.pdf nge-PDF ofuna ukuyiguqula bese uyikhipha.pdf ngegama lefayela elisha elinohlaka olushumekiwe lombhalo )

i-pdfocr -i input.pdf -o okukhiphayo.pdf

Lindela ikhasi ngalinye le-PDF yakho ukuthi lenziwe i-OCR bese kwenziwa ifayili lokugcina eliguquliwe. Lokhu kufanele kuthathe imizuzwana embalwa ekhasini ngalinye, kuya ngesinqumo se-PDF yakho.


Shiya umbono wakho

Ikheli lakho le ngeke ishicilelwe. Ezidingekayo ibhalwe nge *

*

*

  1. Ubhekele imininingwane: Miguel Ángel Gatón
  2. Inhloso yedatha: Lawula Ugaxekile, ukuphathwa kwamazwana.
  3. Ukusemthethweni: Imvume yakho
  4. Ukuxhumana kwemininingwane: Imininingwane ngeke idluliselwe kubantu besithathu ngaphandle kwesibopho esisemthethweni.
  5. Isitoreji sedatha: Idatabase ebanjwe yi-Occentus Networks (EU)
  6. Amalungelo: Nganoma yisiphi isikhathi ungakhawulela, uthole futhi ususe imininingwane yakho.

  1.   URodolfo Lara kusho

    rodolfo @ rodolfo-desktop: ~ $ sudo apt-get ukufaka i-pdfocr
    Kufundwa uhlu lwephakheji ... Kwenziwe
    Ukwakha isihlahla sokuncika
    Kufundwa imininingwane yesimo ... Kwenziwe
    E: Iphakethe le-pdfocr alitholakalanga
    rodolfo @ rodolfo-desktop: ~ $

  2.   Masisebenzise iLinux kusho

    Uqinisekisile ukungeza i-PPA ehambisanayo?
    Le PPA kungenzeka ukuthi inezinguqulo ze-pdfocr yezinguqulo ezindala ze-Ubuntu. Cabanga ukuthi lokhu okuthunyelwe sekunezinyanga ezimbalwa ubudala. Noma kunjalo, umqondo uyefana. Iya ku-Launchpad bese ubheka i-PPA equkethe izinhlobo ze-pdfocr zeMaverick.
    Halala! UPaul.

  3.   I-Javare kusho

    Kuzokuba yindaba yokuyihlola ukuze ubone ukuthi isebenza kanjani

  4.   Masisebenzise iLinux kusho

    Qhubeka! Sazise uma ngabe uphumelele !! Uma kungasebenzi singazama futhi ukukusiza! Halala! UPaul.

  5.   a01653 kusho

    Sawubona,
    Ngiluvivinye uhlelo ku-pdf futhi umphumela awumuhle neze. Ngijwayele i-acrobat eyi-8 efanele futhi ngangifuna into efanayo. I-Acrobat idlulisa izinsiza kumafayela ukuhlanza nokulungisa ama-pdfs askeniwe futhi ngaleyo ndlela ithole umthombo ongcono we-ocr. Uyazi uma kunesixazululo salokhu.

    Un saludo

  6.   Masisebenzise iLinux kusho

    Sawubona! Ngizwile nxazonke ukuthi iTesseract iyona openource OCR ehamba phambili. Angazi noma kuzoba kuhle yini. Futhi, kufanele ungcolise izandla zakho ukuze zisebenze. Nayi imiyalo. Uma uphumelela, ngicela ungazise ngoba, uma kuyasebenza, kuzogcina sekuyiposi.

    Okokuqala faka amaphakheji we- "tesseract 2.03-4" ne- "imagemagick" usebenzisa iSynaptic, "xsane2tess" kusuka ku- "http://download.tuxfamily.org/guadausers/guadaV4/".

    Ngemuva kwalokho dala ifolda ye-tmp ku: / home / yourusername / tmp

    Bese uvula i-Xsane ukuyilungiselela, Okuncamelayo-> Ukucushwa-> Ithebhu ye-OCR bese ugcwalisa okulandelayo:

    Umyalo we-OCR -> xsane2tess -l spa
    Inketho yefayela lokufaka -> -i
    Inketho yefayela lokukhipha -> -o
    Inketho yokukhipha -fd interface -> -x

    Kulungiselelo lwe-Xsane kuthebhu ethi "londoloza" engxenyeni lapho ithi umkhombandlela wesikhashana, qiniseka ukuthi kukhona ifolda "tmp" oyidalile ku "/ home / yourusername"

    Ngikushiya nekhasi elinemininingwane yokuthi ungayenza kanjani i-OCR ku-Ubuntu: https://help.ubuntu.com/community/OCR

  7.   Masisebenzise iLinux kusho

    Enye indlela engiyitholile x kukhona elandelayo:

    Ukuthatha ukuthi isithwebuli sesivele sixhunyiwe futhi sabonwa uhlelo

    1. Ngivula iSystem> Administration> iSinaptic Package Manager (ku-GNOME)

    2. Ngisesha nohlaka lokufaka i-tesseract-ocr-spa (ukuskena ngeSpanishi) kanye ne-gscan2pdf

    3. Ukuskena ngivula i-Applications> Graphics> gscan2pdf

    Futhi ngomumo.

  8.   I-Troubadour kusho

    Sawubona mngani, ngiyabonga kakhulu, iqiniso ukuthi i-tesseract iyithuluzi elihle, kepha lilinganiselwe kakhulu uma liqhathaniswa nezincwadi ezineskena "esinenkinga". Ngakolunye uhlangothi, le software iguquguquka kalula ... 😀

  9.   ujuan anez kusho

    Enqubeni yokwenza izithombe zibe yidijithali, amafayela e-PDF-A ayaguqulwa, kufanele abe yi-OCRed. Izwela kangakanani umphumela ukuskena kokumnyama nokumhlophe noma okumpunga? Yini enconywayo?