document-understanding

There are 43 repositories under document-understanding topic.

infiniflow/ragflow
RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context layer for LLMs
Language:TypeScript64.3k 293 5.2k6.7k
deepdoctection/deepdoctection
A Repo For Document AI
Language:Python3k 20 193169
X-PLUG/mPLUG-DocOwl
mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding
Language:Python2.2k 33 131130
AlibabaResearch/AdvancedLiterateMachinery
A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team in the Language Technology Lab, Tongyi Lab, Alibaba Group.
Language:C++1.8k 43 205198
tstanislawek/awesome-document-understanding
A curated list of resources for Document Understanding (DU) topic
1.5k 37 2164
OpenBMB/VisRAG
Parsing-free RAG supported by VLMs
Language:Python786 12 5358
wenwenyu/PICK-pytorch
Code for the paper "PICK: Processing Key Information Extraction from Documents using Improved Graph Learning-Convolutional Networks" (ICPR 2020)
Language:Python568 22 114192
jpWang/LiLT
Official PyTorch implementation of LiLT: A Simple yet Effective Language-Independent Layout Transformer for Structured Document Understanding (ACL 2022)
Language:Python355 6 4741
GoogleCloudPlatform/document-ai-samples
Sample applications and demos for Document AI, the end-to-end document processing platform on Google Cloud
Language:Jupyter Notebook289 31 50114
MathamPollard/awesome-table-structure-recognition
A Curated List of Awesome Table Structure Recognition (TSR) Research. Including models, papers, datasets and codes. Continuously updating.
210 9 39
SCUT-DLVCLab/Document-AI-Recommendations
Algorithms, papers, datasets, performance comparisons for Document AI. Continuously updating.
200 10 19
huggingface/chug
Minimal sharded dataset loaders, decoders, and utils for multi-modal document, image, and text datasets.
Language:Python158 10 311
Alpha-Innovator/DocGenome
DocGenome: An Open Large-scale Scientific Document Benchmark for Training and Testing Multi-modal Large Models
Language:Jupyter Notebook144 5 117
andreagemelli/doc2graph
Doc2Graph transforms documents into graphs and exploit a GNN to solve several tasks.
Language:Jupyter Notebook130 6 1922
doc-analysis/ReadingBank
ReadingBank: A Benchmark Dataset for Reading Order Detection
109 1 93
LynnHaDo/Document-Layout-Analysis
Object Detection Model for Scanned Documents
Language:Jupyter Notebook94 4 414
LynnHaDo/Checkbox-Detection
Checkbox Detection Model for Scanned Documents
Language:Jupyter Notebook86 2 123
microsoft/CompHRDoc
Datasets and Evaluation Scripts for CompHRDoc
Language:Python49 4 35
ZeningLin/PEneo
[MM'2024] PEneo, an effective algorithm for key-value pair extraction from form-like documents, designed for real-world applications.
Language:Python37 4 87
NExTplusplus/TAT-DQA
TAT-DQA: Towards Complex Document Understanding By Discrete Reasoning
24 4 21
SCUT-DLVCLab/RFUND
[MM'2024] Official release of RFUND introduced in the MM'2024 paper "PEneo: Unifying Line Extraction, Line Grouping, and Entity Linking for End-to-end Document Pair Extraction"
20 1 00
uakarsh/TiLT-Implementation
Implementation of the paper: Going Full-TILT Boogie on Document Understanding with Text-Image-Layout Transformer.
Language:Jupyter Notebook17 3 40
docling-project/docling4j
Docling4j brings the functionalities of Docling in document understanding to Java® projects
Language:Java16 2 01
jacobmarks/pytesseract-ocr-plugin
Run optical character recognition with PyTesseract from the FiftyOne App!
Language:Python11 2 0
javier-marti-isasi/OCR-free-Document-Understanding-with-Donut-Transformer
This project tackles a real-world challenge of automating client document processing, with a focus on enhancing document classification, error detection, data extraction, and validation.
Language:Jupyter Notebook6 1 02
bwnyasse/dart-documentai-samples
A hands-on CLI tool sample showcasing the integration of Dart with Google Cloud's DocumentAI.
Language:Dart5 2 380
callbacked/smoldocling256M-webgpu
Document Understanding in the Browser!
Language:TypeScript5
dhorvay/document-understanding-ebook
(WIP) ✨ A comprehensive resource for understanding the world of software used in the Document Understanding field. 🧙✨
Language:Markdown5 1 00
irgroup/labelstudio-to-fonduer
This small module connects Label Studio with Fonduer by creating a fonduer labeling function for gold labels from a label studio export. Documentation: https://irgroup.github.io/labelstudio-to-fonduer/
Language:Python5 2 10
Haruhiyuki/yuque-rag
将语雀知识库接入大语言模型，实现基于 RAG（检索增强生成）的智能问答系统，支持FastAPI，兼容OpenAI API与本地Ollama模型。
Language:Python3
ExtrieveTechnologies/QuickCapture_IOS
QuickCapture Mobile Scanning SDK Specially designed for native IOS
Language:Objective-C2 2 00
marcel-lamott/SlimDoc
Official implementation for "SlimDoc: Lightweight Distillation of Document Transformer Models," published in the International Journal on Document Analysis and Recognition (IJDAR), 2025
Language:Python2
PAIR-Systems-Inc/little-dorrit-editor
Multimodal benchmark for evaluating handwritten editorial correction in printed text.
Language:Python2
kariiimadelll/CV-Extractor-UiPath-Automation-Project
A UiPath bot that reads all CVs (PDF files) from a folder, extracts key candidate information, and writes the results into an Excel file for easy review and analysis.
1
phong-lt/LiGT_VQA
This repository includes the ReceiptVQA dataset and the Pytorch implementation of the LiGT method and other evaluated baselines.
Language:Python1 1 00
TomQuez/LLM_document_understanding
Language:HTML1 1 30

document-understanding

infiniflow/ragflow

deepdoctection/deepdoctection

X-PLUG/mPLUG-DocOwl

AlibabaResearch/AdvancedLiterateMachinery

tstanislawek/awesome-document-understanding

OpenBMB/VisRAG

wenwenyu/PICK-pytorch

jpWang/LiLT

GoogleCloudPlatform/document-ai-samples

MathamPollard/awesome-table-structure-recognition

SCUT-DLVCLab/Document-AI-Recommendations

huggingface/chug

Alpha-Innovator/DocGenome

andreagemelli/doc2graph

doc-analysis/ReadingBank

LynnHaDo/Document-Layout-Analysis

LynnHaDo/Checkbox-Detection

microsoft/CompHRDoc

ZeningLin/PEneo

NExTplusplus/TAT-DQA

SCUT-DLVCLab/RFUND

uakarsh/TiLT-Implementation

docling-project/docling4j

jacobmarks/pytesseract-ocr-plugin

javier-marti-isasi/OCR-free-Document-Understanding-with-Donut-Transformer

bwnyasse/dart-documentai-samples

callbacked/smoldocling256M-webgpu

dhorvay/document-understanding-ebook

irgroup/labelstudio-to-fonduer

Haruhiyuki/yuque-rag

ExtrieveTechnologies/QuickCapture_IOS

marcel-lamott/SlimDoc

PAIR-Systems-Inc/little-dorrit-editor

kariiimadelll/CV-Extractor-UiPath-Automation-Project

phong-lt/LiGT_VQA

TomQuez/LLM_document_understanding