pdf-extractor-llm

There are 5 repositories under pdf-extractor-llm topic.

  • opendatalab/MinerU

    A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。

    Language:Python43.9k1831.7k3.6k
  • DocumindHQ/documind

    Open-source platform for extracting structured data from documents using AI.

    Language:JavaScript1.4k111057
  • aidayang/MinerU-OneClick

    MinerU免安装部署一键启动整合包

  • Alapipapi/MinerU

    A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。

    Language:Python0000
  • RiccardoTOTI/LLM-PDF-Extractor

    LLM-PDF-Parser is a FastAPI-based application that extracts text from PDFs and images and uses NuExtract LLM to extract specific fields based on a given JSON template.

    Language:Python0100