/digitization-documents

Using Apache tika and tesseract to extact text from any document

Primary LanguagePythonOtherNOASSERTION

Watchers