/pdf-to-text

Converts a batch of PDF files to text, with optional keyword matching to move matches into a separate directory using the Tesseract OCR and pdf2image packages.

Primary LanguagePythonMIT LicenseMIT

Watchers