/doc2text

Detect text blocks and OCR poorly scanned PDFs in bulk. Python module available via pip.

Primary LanguagePythonMIT LicenseMIT

Stargazers