/doc2dataset

A tool to extract text (and images) from documents (like PDFs)

Primary LanguagePythonMIT LicenseMIT

Stargazers