LeoFCardoso/pdf2pdfocr
A free tool to OCR a PDF and add a text "layer" in the original file, making a searchable PDF. Use only open source tools. Please tip!
PythonApache-2.0
Issues
- 0
install script under linux is incomplete
#50 opened by nicoursi - 7
RuntimeError: can't start new thread
#49 opened by nguyenvulong - 2
Font issue on Macos Catalina Dark Appearance
#17 opened by ericmoret - 1
- 4
Do we have any parameter / flag for pdf compression here, to reduce pdf size after applying OCR?
#48 opened by yatrik-cloud - 4
- 3
- 2
file not found. Aborting...
#43 opened by yatrik-cloud - 4
A rectangular block is the only portion being selected from within a paragraph.
#42 opened by yatrik-cloud - 2
PIL.Image.DecompressionBombError: Image size (235978454 pixels) exceeds limit of 178956970 pixels, could be decompression bomb DOS attack.
#41 opened by yatrik-cloud - 1
- 3
Bad insertion text on PDF
#40 opened by FloLaco - 4
Zero OCR'ed files
#38 opened by PatrikHlebecStor - 2
PyPDF2 moved PdfReadError from utils to errors
#35 opened by Cragsand - 3
merging multiple files into one pdf-file
#32 opened by tfinke18119 - 1
pdf2pdfocr changing languages
#36 opened by Cragsand - 11
- 1
- 5
- 4
Multiple Files Together
#28 opened by mananchawla2005 - 5
Error Message by OCR via GUI
#27 opened by dempfma - 1
Poor performance in docker container
#26 opened by LeoFCardoso - 1
Blank file
#25 opened by LeoFCardoso - 3
Create language based Dockerimages
#24 opened by Brice187 - 3
Application icon
#18 opened by ericmoret - 6
- 8
Output file could not be created
#21 opened by kenyonit - 2
result pdf file is blank
#20 opened by LeoFCardoso - 3
Documentation update
#16 opened by ericmoret - 2
what does cmd_file implies
#19 opened by Harish202 - 0
- 6
Specify Output Folder using pdf2pdfocr.vbs
#14 opened by der-klabauter - 1
Tesseract 4 LSTM (--oem 1)
#13 opened by gabriel-v - 1
- 5
Integration with Google Vision API
#11 opened by bharat-patidar - 0
script hangs on windows and python 3.7.2
#10 opened by LeoFCardoso - 0
autorotation is broken with tesseract 4
#9 opened by LeoFCardoso - 0
"-g grayscale" fail
#8 opened by LeoFCardoso - 4
- 0
- 7
- 13
- 4
Missing space
#3 opened by ericmoret - 6