layout-analysis

There are 54 repositories under layout-analysis topic.

opendatalab/MinerU
Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.
Language:Python48.3k 199 1.9k4k
bytedance/Dolphin
The official repo for “Dolphin: Document Image Parsing via Heterogeneous Anchor Prompting”, ACL, 2025.
Language:Python7.7k 59 132634
Layout-Parser/layout-parser
A Unified Toolkit for Deep Learning Based Document Image Analysis
Language:Python5.6k 73 154515
breezedeus/Pix2Text
An Open-Source Python3 tool with SMALL models for recognizing layouts, tables, math formulas (LaTeX), and text in images, converting them into Markdown format. A free alternative to Mathpix, empowering seamless conversion of visual content into text-based representations. 80+ languages are supported.
Language:Jupyter Notebook2.6k 18 113239
UglyToad/PdfPig
Read and extract text and other content from PDFs in C# (port of PDFBox)
Language:C#2.3k 49 573290
kotaro-kinoshita/yomitoku
YomiTokuはAIを活用した日本語文書解析エンジンを提供するPythonパッケージです。 Yomitoku is an AI-powered document image analysis package designed specifically for the Japanese language.
Language:Python1.1k 5 1241
mittagessen/kraken
OCR engine for all the languages
Language:Python905 30 566152
BobLd/DocumentLayoutAnalysis
Document Layout Analysis resources repos for development with PdfPig.
Language:C#625 32 168
mindspore-lab/mindocr
A toolbox of ocr models and algorithms based on MindSpore
Language:Python288 13 11960
RapidAI/RapidLayout
Analysis of Chinese and English layouts 中英文版面分析
Language:Python254 5 1817
RapidAI/RapidDocEx
📝 针对文档类图像做内容提取，将文档类图像一比一输出到Word或者Txt中，便于进一步使用或处理。后续计划支持输入PDF/图像，输出对应json格式、Txt格式、Word格式和Markdown格式。
Language:Python207 7 58
ppaanngggg/yolo-doclaynet
YOLO models trained by DocLayNet - power your Document Intelligent by Layout Analysis
Language:Python140 2 519
andreagemelli/doc2graph
Doc2Graph transforms documents into graphs and exploit a GNN to solve several tasks.
Language:Jupyter Notebook133 6 1824
xushengfeng/eSearch-OCR
基于paddleOCR的nodejs库
Language:TypeScript98 5 1510
NormXU/Layout2Graph
An official implementation of paper "Paragraph2Graph: A Language-independent GNN-based framework for layout analysis"
Language:Python81 1 1312
CycloneBoy/pdf_table
A Unified Toolkit for Deep Learning-Based Table Extraction
Language:Python52 5 49
JPLeoRX/detectron2-publaynet
Trained Detectron2 object detection models for document layout analysis based on PubLayNet dataset
Language:Python50 3 37
MaitySubhajit/SelfDocSeg
[ICDAR 2023] SelfDocSeg: A self-supervised vision-based approach towards Document Segmentation (Oral)
Language:Python42 4 52
empressabyss/nordrassil
A keyboard layout that provides an elegant and balanced typing experience by its use of a thumb-alpha, emphasis on middle fingers, deprioritisation of pinkies, and arcane keys.
35 6 01
dell-research-harvard/HJDataset
A Large Dataset of Historical Japanese Documents with Complex Layouts
Language:Jupyter Notebook34 2 24
BobLd/PdfPigMLNetBlockClassifier
Proof of concept of training a simple Region Classifier using PdfPig and ML.NET (LightGBM). The objective is to classify each text block in a pdf document page as either title, text, list, table and image.
Language:C#28 3 06
CaseDrive/publaynet-models
Trained Detectron2 object detection models for document layout analysis based on PubLayNet dataset
Language:Python27 2 02
jiangnanboy/layout_analysis4j
利用java-yolov8实现版面检测（Chinese layout detection），java-yolov8 is used to detect the layout of Chinese document images
Language:Java26 1 19
MBAigner/PDFSegmenter
This library builds a graph-representation of the content of PDFs. The graph is then clustered, resulting page segments are classified and returned. Tables are retrieved formatted as a CSV.
Language:Python22 1 03
aidayang/MinerU-OneClick
MinerU免安装部署一键启动整合包
15 1 12
VRI-UFPR/ocrd-gbn
OCR-D compliant toolset for optical layout recognition on historical german-language documents published in Brazil
Language:Python11 2 00
pleb631/pdfLayoutDet
pdfDet aims to simplify PDF layout detect tasks for users.
Language:Python9 1 01
qyhou/curated-document-layout-analysis
A curated list of resources on Document Layout Analysis
9 1 00
calfa-co/rasam-dataset
Open Dataset for the Recognition and Analysis of Scripts in Arabic Maghrebi (ICDAR 2021, CHR 2024)
7 1 52
yoshihikoueno/pdfminer-layout-scanner
A more complete example of programming with PDFMiner, which continues where the default documentation stops
Language:Python7 1 04
privateai-com/docviz
Advanced document contents extraction with multiple output formats
Language:Python6
VRI-UFPR/page-xml-draw
A powerful CLI tool for visualization and encoding of PAGE-XML files
Language:Python6 4 122
os-climate/crrf-det
A web application for PDF content and table extraction, featuring image-based visual layout analysis, indexed document search, batch processing and extraction result annotation.
Language:C++5 5 03
engkimo/bullseye
BullsEye is a Japanese Document AI system for production‑grade OCR, layout analysis, table structure recognition, reading order estimation, and LLM‑powered understanding. It exposes Unified Doc JSON with CLI/REST APIs and integrates bullseye‑compatible providers (Apache‑2.0).
Language:Python4
Magnet-AI/Quanta
Advanced PDF layout analysis engine for extracting figures, tables, and structured content from complex engineering documents using computer vision and machine learning.
Language:Python2
rithulkamesh/docproc
Opinionated and Sophisticated Document Region Analyzer.
Language:Python2 1 90

layout-analysis

opendatalab/MinerU

bytedance/Dolphin

Layout-Parser/layout-parser

breezedeus/Pix2Text

UglyToad/PdfPig

kotaro-kinoshita/yomitoku

mittagessen/kraken

BobLd/DocumentLayoutAnalysis

mindspore-lab/mindocr

RapidAI/RapidLayout

RapidAI/RapidDocEx

ppaanngggg/yolo-doclaynet

andreagemelli/doc2graph

xushengfeng/eSearch-OCR

NormXU/Layout2Graph

CycloneBoy/pdf_table

JPLeoRX/detectron2-publaynet

MaitySubhajit/SelfDocSeg

empressabyss/nordrassil

dell-research-harvard/HJDataset

BobLd/PdfPigMLNetBlockClassifier

CaseDrive/publaynet-models

jiangnanboy/layout_analysis4j

MBAigner/PDFSegmenter

aidayang/MinerU-OneClick

VRI-UFPR/ocrd-gbn

pleb631/pdfLayoutDet

qyhou/curated-document-layout-analysis

calfa-co/rasam-dataset

yoshihikoueno/pdfminer-layout-scanner

privateai-com/docviz

VRI-UFPR/page-xml-draw

os-climate/crrf-det

engkimo/bullseye

Magnet-AI/Quanta

rithulkamesh/docproc