/document-segmentation

Browser-based app for segmenting & OCRing PDF pages based on whitespace rules. To assist researchers (especially in the humanities) with turning their materials into machine-actionable datasets.

Primary LanguagePythonGNU General Public License v3.0GPL-3.0

Stargazers