Script to generate OCR and hOCR from a directory of page images using Tesseract.
Primary LanguagePythonThe UnlicenseUnlicense