ckorzen/pdf-text-extraction-benchmark
A project about benchmarking and evaluating existing PDF extraction tools on their semantic abilities to extract the body texts from PDF documents, especially from scientific articles.
TeXMIT
Issues
- 1
dataset 404 not found
#3 opened by XirenZhou - 4
Doc diff as a library
#1 opened by de-code