Maitiro ekuOCR PDF uye gonesa mameseji kusarudza uye kutsvaga

Ngatiti iwe une PDF iyo yakagadzirwa uchishandisa scanner, kana kuti vakaipfuudza kwauri asi iine ruzivo rwuri muchimiro chemufananidzo.. Maitiro atinofanira kuendesa edu anodikanwa PDF anonzi OCR: chiitiko chinozviratidza zvoga zviratidzo kana mavara ari eimwe mavara, kubva pamufananidzo kuachengeta nenzira yedata iyo yatinogona kuyanana nayo kuburikidza nechirongwa chekugadzirisa zvinyorwa kana zvakafanana.


pdfocr chishandiso chakareruka chinogadzira iyo nyowani PDF ine yakadzamidzirwa mameseji, kubvumira mushandisi kusarudza zvinyorwa uye kutsvaga mazwi mariri, pasina kushandura chimiro chekupedzisira chePDF.

Izvo pdfocr ISI kwete ye:

Izvi zvinoshanda chete kana iyo PDF iine ruzivo mune fomu yemufananidzo; kana iwe wakaburitsa iyo PDF kubva kuOpenOffice, inotova neyakaiswa mameseji meseji, saka maitiro aya haana basa.

Maitiro ekuisa pdfocr:

sudo yekuwedzera-apt-repository ppa: gezakovacs / pdfocr
sudo apt-get update
sudo apt-tora kuisa pdfocr

Mashandisiro aungaita pdfocr:

Vhura terminal, enda kudhairekitori uko iyo PDF iwe yaunoda kushandura iri, uye nyora zvinotevera (kutsiva input.pdf nePDF yaunoda kushandura uye kuburitsa.pdf nezita refaira idzva rine rakapetwa runyorwa runyoro. )

pdfocr -i yekuisa.pdf -o kubuda.pdf

Mirira peji rega rega rePDF yako kuti ive OCR inoitwa uye yekupedzisira yakagadziridzwa faira kuti igadzirwe. Izvi zvinofanirwa kutora mashoma masekondi pane peji, zvinoenderana nesarudzo yePDF yako.


Siya yako yekutaura

Your kero e havazobvumirwi ichibudiswa. Raida minda anozivikanwa ne *

*

*

  1. Inotarisira iyo data: Miguel Ángel Gatón
  2. Chinangwa cheiyo data: Kudzora SPAM, manejimendi manejimendi.
  3. Legitimation: Kubvuma kwako
  4. Kutaurirana kwedata
  5. Dhata yekuchengetedza: Dhatabhesi inobatwa neOccentus Networks (EU)
  6. Kodzero: Panguva ipi neipi iwe unogona kudzora, kupora uye kudzima ruzivo rwako

  1.   Rudolf Lara akadaro

    rodolfo @ rodolfo-desktop: ~ $ sudo apt-tora kuisa pdfocr
    Kuverenga package package ... Yakaitwa
    Kugadzira kutsamira muti
    Kuverenga ruzivo rwechimiro ... Zvaitwa
    E: Iyo pdfocr pasuru haina kugona kuwanikwa
    rodolfo @ rodolfo-desktop: ~ $

  2.   Ngatishandise Linux akadaro

    Iwe wakave nechokwadi chekuwedzera inoenderana PPA?
    Iyi PPA inogona kunge iine shanduro dze pdfocr yeyakare Ubuntu vhezheni. Funga kuti iyi posvo yatove nemwedzi yakati wandei yakura Kunyange zvakadaro, pfungwa yacho yakafanana. Enda kuLaunchpad uye utsvage iyo PPA ine zvinyorwa zve pdfocr yeMaverick.
    Mufaro! Paul.

  3.   Javare akadaro

    Zvakanaka, ichave iri nyaya yekuiyedza kuti uone kuti inoshanda sei

  4.   Ngatishandise Linux akadaro

    Enderera mberi! Tiudze kana iwe wakabudirira !! Kana zvikasashanda tinogona zvakare kuedza kukubatsira! Mufaro! Paul.

  5.   a01653 akadaro

    Sawa,
    Ndakaedza chirongwa ichi pdf uye mhedzisiro yacho haina kunyatsonaka. Ini ndajairira nyanzvi acrobat 8 ​​uye ndanga ndichitsvaga zvakafanana. Acrobat inopfuudza zvinoshandiswa kumafaira kuchenesa uye kururamisa pdf dzakatemwa uye nekudaro uwane chinyorwa chiri nani cheocr. Iwe unoziva kana paine mhinduro yeizvi.

    Thanks!

  6.   Ngatishandise Linux akadaro

    Mhoro! Ndanzwa kuti Tesseract ndiyo yakanakisa kuvhura OCR. Handizive kana zvichanaka. Zvakare, iwe unofanirwa kuita kuti maoko ako ave netsvina kuti iite kuti ishande. Heano mamwe mirairo. Kana iwe uchibudirira, ndokumbira undizivise sezvo, kana zvichishanda, zvingangopedzisira zvave posvo.

    Kutanga gadza mapakeji "tesseract 2.03-4" uye "imagemagick" uchishandisa Synaptic, "xsane2tess" kubva "http://download.tuxfamily.org/guadausers/guadaV4/".

    Wobva wagadzira iyo tmp dhairekitori mu: / home / yourusername / tmp

    Wobva wavhura Xsane kuti uigadzirise, Zvaunoda-> Kugadziriswa-> OCR tebhu uye zadza zvinotevera:

    OCR raira -> xsane2tess -l spa
    Input faira sarudzo -> -i
    Kuburitsa faira sarudzo -> -o
    Kuburitsa sarudzo -fd interface -> -x

    Mumagadzirirwo eXsane mu "chengetedza" tebhu muchikamu icho panoti dhairekitori renguva pfupi, ita shuwa kuti pane iyo "tmp" dhairekitori iwe yawakagadzira mu "/ imba / zita rako"

    Ini zvakare ndinokusiira iwe peji rine ruzivo rwekuti ungaita sei OCR muUbuntu: https://help.ubuntu.com/community/OCR

  7.   Ngatishandise Linux akadaro

    Imwe nzira iyo ini yandakawana x pane inotevera:

    Kufunga kuti scanner yatove yakabatana uye inozivikanwa nehurongwa

    1. Ini ndinovhura System> Administration> Synaptic Package Manager (muGNOME)

    2. Tsvaga uye furemu yekuisa tesseract-ocr-spa (kuongorora muchiSpanish) uye gscan2pdf

    3. Kuongorora ndinovhura Zvishandiso> Graphics> gscan2pdf

    Uye wakagadzirira.

  8.   Troubadour akadaro

    Hei shamwari, maita basa kwazvo, chokwadi ndechekuti tesseract chishandiso chakanaka, asi chakanyanya kushoma kana ichienzaniswa nemabhuku ane "zvinonetsa" kuongorora. Kune rimwe divi, iyi software inoenderana nyore ... 😀

  9.   Juan Anez akadaro

    Mukuita kwekuisa digitize Mifananidzo, mafaera ePDF-A ari kushandurwa, anofanirwa kuve OCRed. Inonzwisisika sei kumhedzisiro iri kuongorora muBlack & White kana Grayscale? Chii chinokurudzirwa?