/MinerU

A one-stop, open-source, high-quality data extraction tool, supports PDF/webpage/e-book extraction.一站式开源高质量数据提取工具,支持PDF/网页/多格式电子书提取。

Primary LanguagePythonGNU Affero General Public License v3.0AGPL-3.0

Pinned issues

提取PDF中表格的其他方案(间接)

#360 opened by beiluo

Closed19

将模型加载和解析的内容分开

#435 opened by 2257396011

Closed15

模型预加载

#517 opened by BronyaKaslana06

Closed3

Issues