pdf-extractor-pretrain

There are 1 repositories under pdf-extractor-pretrain topic.

  • opendatalab/MinerU

    A one-stop, open-source, high-quality data extraction tool, supports PDF/webpage/e-book extraction.一站式开源高质量数据提取工具,支持PDF/网页/多格式电子书提取。

    Language:Python11.3k66375847