ocr

There are 6558 repositories under ocr topic.

tesseract-ocr/tesseract
Tesseract Open Source OCR Engine (main repository)
Language:C++70.8k 1.7k 2.7k10.4k
PaddlePaddle/PaddleOCR
Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.
Language:Python63.1k 496 10.2k9.3k
opendatalab/MinerU
Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.
Language:Python48.3k 199 1.9k4k
hiroi-sora/Umi-OCR
OCR software, free and offline. 开源、免费的离线OCR软件。支持截屏/批量导入图片，PDF文档识别，排除水印/页眉页脚，扫描/生成二维码。内置多国语言库。
Language:Python39.6k 203 8113.9k
siyuan-note/siyuan
A privacy-first, self-hosted, fully open source personal knowledge management software, written in typescript and golang.
Language:TypeScript38.9k 159 15.5k2.4k
naptha/tesseract.js
Pure Javascript OCR for more than 100 Languages 📖🎉🖥
Language:JavaScript37.5k 482 7282.3k
ShareX/ShareX
ShareX is a free and open-source application that enables users to capture or record any area of their screen with a single keystroke. It also supports uploading images, text, and various file types to a wide range of destinations.
Language:C#34.3k 533 7k3.5k
paperless-ngx/paperless-ngx
A community-supported supercharged document management system: scan, index and archive all your documents
Language:Python34.1k 140 2.2k2.1k
ocrmypdf/OCRmyPDF
OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
Language:Python31.7k 187 1.3k2.2k
JaidedAI/EasyOCR
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
Language:Python28.3k 323 1.1k3.5k
lukas-blecher/LaTeX-OCR
pix2tex: Using a ViT to convert images of equations into LaTeX code.
Language:Python15.9k 85 2951.3k
pot-app/pot-desktop
🌈一个跨平台的划词翻译和OCR软件 | A cross-platform software for text translation and recognition.
Language:JavaScript15.7k 55 902741
Unstructured-IO/unstructured
Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean, structured formats for language models. Visit our website to learn more about our enterprise grade Platform product for production grade workflows, partitioning, enrichments, chunking and embedding.
Language:HTML13.1k 68 1.2k1.1k
sml2h3/ddddocr
带带弟弟通用验证码识别OCR pypi版
Language:Python13k 97 2542.1k
DayBreak-u/chineseocr_lite
超轻量级中文ocr，支持竖排文字识别, 支持ncnn、mnn、tnn推理 ( dbnet(1.8M) + crnn(2.5M) + anglenet(378KB)) 总模型仅4.7M
Language:C++12.2k 240 3772.3k
getomni-ai/zerox
OCR & Document Extraction using vision models
Language:TypeScript11.9k 61 100814
tisfeng/Easydict
一个简洁优雅的词典翻译 macOS App。开箱即用，支持离线 OCR 识别，支持有道词典，🍎 苹果系统词典，🍎 苹果系统翻译，OpenAI，Gemini，DeepL，Google，Bing，腾讯，百度，阿里，小牛，彩云和火山翻译。A concise and elegant Dictionary and Translator macOS App for looking up words and translating text.
Language:Swift10.9k 37 676557
dataelement/bisheng
BISHENG is an open LLM devops platform for next generation Enterprise AI applications. Powerful and comprehensive features include: GenAI workflow, RAG, Agent, Unified model management, Evaluation, SFT, Dataset Management, Enterprise-level System Management, Observability and more.
Language:TypeScript9.9k 610 3211.6k
ripperhe/Bob
Bob 是一款 macOS 平台的翻译和 OCR 软件。
9.4k 82 527526
HIllya51/LunaTranslator
视觉小说翻译器 / Visual Novel Translator
Language:C++9.4k 26 1.7k978
zyddnys/manga-image-translator
Translate manga/image 一键翻译各类图片内文字 https://cotrans.touhou.ai/ (no longer working)
Language:Python8.8k 57 795863
pymupdf/PyMuPDF
PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.
Language:Python8.4k 67 2.3k661
YaoFANGUK/video-subtitle-extractor
视频硬字幕提取，生成srt文件。无需申请第三方API，本地实现文本识别。基于深度学习的视频字幕提取框架，包含字幕区域检测、字幕内容提取。A GUI tool for extracting hard-coded subtitle (hardsub) from videos and generating srt files.
Language:Python8k 52 332831
the-paperless-project/paperless
Scan, index, and archive all of your paper documents
Language:Python7.9k 181 452502
microsoft/ailab
Experience, Learn and Code the latest breakthrough innovations with Microsoft AI
Language:C#7.8k 423 541.4k
bytedance/Dolphin
The official repo for “Dolphin: Document Image Parsing via Heterogeneous Anchor Prompting”, ACL, 2025.
Language:Python7.7k 59 132634
tesseract-ocr/tessdata
Trained models with fast variant of the "best" LSTM models + legacy models
7.2k 225 1682.4k
CVHub520/X-AnyLabeling
Effortless data labeling with AI support from Segment Anything and other awesome models.
Language:Python6.9k 41 1k769
adithya-s-k/omniparse
Ingest, parse, and optimize any data format ➡️ from documents to multimedia ➡️ for enhanced compatibility with GenAI frameworks
Language:Python6.7k 42 90530
clovaai/donut
Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022
Language:Python6.6k 52 315543
chineseocr/chineseocr
yolo3+ocr
Language:Python6.1k 188 5441.7k
Swift-AI/Swift-AI
The Swift machine learning library.
Language:Swift6.1k 325 52553
xushengfeng/eSearch
截屏离线OCR 搜索翻译以图搜图贴图录屏万向滚动截屏屏幕翻译 Screenshot Offline OCR Search Translate Search for picture Paste the picture on the screen Screen recorder Omnidirectional scrolling screenshot Screen translator 支持Windows Linux macOS
Language:TypeScript6k 35 378449
axa-group/Parsr
Transforms PDF, Documents and Images into Enriched Structured Data
Language:JavaScript6k 81 163317
PaddlePaddle/PaddleX
All-in-One Development Tool based on PaddlePaddle
Language:Python5.9k 95 1.7k1.1k
mindee/doctr
docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.
Language:Python5.6k 48 435591

ocr

tesseract-ocr/tesseract

PaddlePaddle/PaddleOCR

opendatalab/MinerU

hiroi-sora/Umi-OCR

siyuan-note/siyuan

naptha/tesseract.js

ShareX/ShareX

paperless-ngx/paperless-ngx

ocrmypdf/OCRmyPDF

JaidedAI/EasyOCR

lukas-blecher/LaTeX-OCR

pot-app/pot-desktop

Unstructured-IO/unstructured

sml2h3/ddddocr

DayBreak-u/chineseocr_lite

getomni-ai/zerox

tisfeng/Easydict

dataelement/bisheng

ripperhe/Bob

HIllya51/LunaTranslator

zyddnys/manga-image-translator

pymupdf/PyMuPDF

YaoFANGUK/video-subtitle-extractor

the-paperless-project/paperless

microsoft/ailab

bytedance/Dolphin

tesseract-ocr/tessdata

CVHub520/X-AnyLabeling

adithya-s-k/omniparse

clovaai/donut

chineseocr/chineseocr

Swift-AI/Swift-AI

xushengfeng/eSearch

axa-group/Parsr

PaddlePaddle/PaddleX

mindee/doctr