PDF parsing toolkit for preparing academic text corpus
Primary LanguagePythonApache License 2.0Apache-2.0