/ConvertPDFToOCRTxtFilesPerPage

Converts a pdf file to images, and then uses OCR (tesseract) to convert it to text and save it as files by page number.

Primary LanguagePythonMIT LicenseMIT

This repository is not active