ocr

There are 5156 repositories under ocr topic.

  • tesseract-ocr/tesseract

    Tesseract Open Source OCR Engine (main repository)

    Language:C++63.3k1.7k2.7k9.6k
  • PaddlePaddle/PaddleOCR

    Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)

    Language:Python45.2k4449.4k7.9k
  • tesseract.js

    naptha/tesseract.js

    Pure Javascript OCR for more than 100 Languages 📖🎉🖥

    Language:JavaScript35.6k4787052.2k
  • ShareX

    ShareX/ShareX

    ShareX is a free and open source program that lets you capture or record any area of your screen and share it with a single press of a key. It also allows uploading images, text or other types of files to many supported destinations you can choose from.

    Language:C#30.2k5376.5k3.2k
  • Umi-OCR

    hiroi-sora/Umi-OCR

    OCR software, free and offline. 开源、免费的离线OCR软件。支持截屏/批量导入图片,PDF文档识别,排除水印/页眉页脚,扫描/生成二维码。内置多国语言库。

    Language:Python28.1k1496172.8k
  • JaidedAI/EasyOCR

    Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.

    Language:Python24.9k3181k3.2k
  • siyuan

    siyuan-note/siyuan

    A privacy-first, self-hosted, fully open source personal knowledge management software, written in typescript and golang.

    Language:TypeScript23.6k13413.1k1.7k
  • paperless-ngx/paperless-ngx

    A community-supported supercharged version of paperless: scan, index and archive all your physical documents

    Language:Python23.1k1121.7k1.3k
  • opendatalab/MinerU

    A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。

    Language:Python21.7k1147531.6k
  • OCRmyPDF

    ocrmypdf/OCRmyPDF

    OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched

    Language:Python14.4k1371.2k1k
  • LaTeX-OCR

    lukas-blecher/LaTeX-OCR

    pix2tex: Using a ViT to convert images of equations into LaTeX code.

    Language:Python13.1k732751k
  • DayBreak-u/chineseocr_lite

    超轻量级中文ocr,支持竖排文字识别, 支持ncnn、mnn、tnn推理 ( dbnet(1.8M) + crnn(2.5M) + anglenet(378KB)) 总模型仅4.7M

    Language:C++11.9k2423692.3k
  • pot-desktop

    pot-app/pot-desktop

    🌈一个跨平台的划词翻译和OCR软件 | A cross-platform software for text translation and recognition.

    Language:JavaScript10.8k44744486
  • sml2h3/ddddocr

    带带弟弟 通用验证码识别OCR pypi版

    Language:Python10.6k952131.8k
  • Unstructured-IO/unstructured

    Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.

    Language:HTML9.5k631.1k797
  • ripperhe/Bob

    Bob 是一款 macOS 平台的翻译和 OCR 软件。

  • dataelement/bisheng

    BISHENG is an open LLM devops platform for next generation Enterprise AI applications. Powerful and comprehensive features include: GenAI workflow, RAG, Agent, Unified model management, Evaluation, SFT, Dataset Management, Enterprise-level System Management, Observability and more.

    Language:Python9k9081661.6k
  • the-paperless-project/paperless

    Scan, index, and archive all of your paper documents

    Language:Python7.9k185452501
  • microsoft/ailab

    Experience, Learn and Code the latest breakthrough innovations with Microsoft AI

    Language:C#7.7k423531.4k
  • Easydict

    tisfeng/Easydict

    一个简洁优雅的词典翻译 macOS App。开箱即用,支持离线 OCR 识别,支持有道词典,🍎 苹果系统词典,🍎 苹果系统翻译,OpenAI,Gemini,DeepL,Google,Bing,腾讯,百度,阿里,小牛,彩云和火山翻译。A concise and elegant Dictionary and Translator macOS App for looking up words and translating text.

    Language:Objective-C7.7k31496382
  • getomni-ai/zerox

    PDF to Markdown with vision models

    Language:Python7.1k3065399
  • tesseract-ocr/tessdata

    Trained models with fast variant of the "best" LSTM models + legacy models

  • YaoFANGUK/video-subtitle-extractor

    视频硬字幕提取,生成srt文件。无需申请第三方API,本地实现文本识别。基于深度学习的视频字幕提取框架,包含字幕区域检测、字幕内容提取。A GUI tool for extracting hard-coded subtitle (hardsub) from videos and generating srt files.

    Language:Python6.3k44286689
  • Swift-AI/Swift-AI

    The Swift machine learning library.

    Language:Swift6k33259554
  • PyMuPDF

    pymupdf/PyMuPDF

    PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.

    Language:Python6k642.1k546
  • chineseocr/chineseocr

    yolo3+ocr

    Language:Python6k1895431.7k
  • clovaai/donut

    Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022

    Language:Python5.9k50306477
  • omniparse

    adithya-s-k/omniparse

    Ingest, parse, and optimize any data format ➡️ from documents to multimedia ➡️ for enhanced compatibility with GenAI frameworks

    Language:Python5.9k3681473
  • Parsr

    axa-group/Parsr

    Transforms PDF, Documents and Images into Enriched Structured Data

    Language:JavaScript5.9k82163310
  • zyddnys/manga-image-translator

    Translate manga/image 一键翻译各类图片内文字 https://cotrans.touhou.ai/

    Language:Python5.5k50601580
  • jonaswinkler/paperless-ng

    A supercharged version of paperless: scan, index and archive all your physical documents

    Language:Python5.4k53671353
  • eSearch

    xushengfeng/eSearch

    截屏 离线OCR 搜索翻译 以图搜图 贴图 录屏 万向滚动截屏 屏幕翻译 Screenshot Offline OCR Search Translate Search for picture Paste the picture on the screen Screen recorder Omnidirectional scrolling screenshot Screen translator

    Language:TypeScript5.1k31283392
  • PaddlePaddle/PaddleX

    All-in-One Development Tool based on PaddlePaddle(飞桨低代码开发工具)

    Language:Python5k911.2k974
  • Layout-Parser/layout-parser

    A Unified Toolkit for Deep Learning Based Document Image Analysis

    Language:Python5k74150474
  • NMAC427/SwiftOCR

    Fast and simple OCR library written in Swift

    Language:Swift4.6k154147482
  • Tencent/TNN

    TNN: developed by Tencent Youtu Lab and Guangying Lab, a uniform deep learning inference framework for mobile、desktop and server. TNN is distinguished by several outstanding features, including its cross-platform capability, high performance, model compression and code pruning. Based on ncnn and Rapidnet, TNN further strengthens the support and performance optimization for mobile devices, and also draws on the advantages of good extensibility and high performance from existed open source efforts. TNN has been deployed in multiple Apps from Tencent, such as Mobile QQ, Weishi, Pitu, etc. Contributions are welcome to work in collaborative with us and make TNN a better framework.

    Language:C++4.4k91954769