ocr-python

There are 398 repositories under ocr-python topic.

hiroi-sora/Umi-OCR
OCR software, free and offline. 开源、免费的离线OCR软件。支持截屏/批量导入图片，PDF文档识别，排除水印/页眉页脚，扫描/生成二维码。内置多国语言库。
Language:Python31.4k 170 6763.2k
breezedeus/CnOCR
CnOCR: Awesome Chinese/English OCR Python toolkits based on PyTorch. It comes with 20+ well-trained models for different application scenarios and can be used directly after installation. 【基于 PyTorch/MXNet 的中文/英文 OCR Python 包。】
Language:Python3.5k 67 255519
CatchTheTornado/text-extract-api
Document (PDF, Word, PPTX ...) extraction and parse API using state of the art modern OCRs + Ollama supported models. Anonymize documents. Remove PII. Convert any document or picture to structured JSON or Markdown
Language:Python2.5k 12 74200
hiroi-sora/Umi-OCR_v2
结束和新的开始
Language:QML935 13 6476
Psarpei/Multi-Type-TD-TSR
Extracting Tables from Document Images using a Multi-stage Pipeline for Table Detection and Table Structure Recognition:
Language:Jupyter Notebook270 9 1553
maxent-ai/ocrpy
OCR, Archive, Index and Search: Implementation agnostic OCR framework.
Language:Jupyter Notebook221 5 211
MrZilinXiao/Hyper-Table-OCR
A carefully-designed OCR pipeline for universal boarded table recognition and reconstruction.
Language:C++175 1 1445
nathanaday/RealTime-OCR
Perform text detection in a variety of languages with your computer webcam using Google Tesseract OCR and OpenCV. This script achieves a real-time OCR effect via multi-threading.
Language:Python160 4 341
ankandrew/fast-plate-ocr
Lightweight & fast OCR models for license plate text recognition.
Language:Python129 4 3623
ilic5000/pabkvizgenerator
Anansi is a computer vision (cv2 and FFmpeg) + OCR (EasyOCR and tesseract) python-based crawler for finding and extracting questions and correct answers from video files of popular TV game shows in the Balkan region.
Language:Python124 5 16
blueaxis/Cloe
Manga OCR snipping application for desktop
Language:Python112 2 239
prp-e/persian_ocr_project
A FLOSS software for Persian Optical Character Recognition
Language:Jupyter Notebook89 10 111
nainiayoub/pdf-text-data-extractor
PDF text data extraction web app with OCR for scanned documents
Language:Python87 4 449
kartikgill/Easter2
Easter2.0: IMPROVING CONVOLUTIONAL MODELS FOR HANDWRITTEN TEXT RECOGNITION
Language:Jupyter Notebook79 2 1822
shibing624/imgocr
Python3 package for Chinese/English OCR, with paddleocr-v4 onnx model(~14MB). 基于ppocr-v4-onnx模型推理，可实现 CPU 上毫秒级的 OCR 精准预测，通用场景中英文OCR达到开源SOTA。
Language:Python65 1 59
gnana70/tamil_ocr
OCR Tamil is a powerful tool that can detect and recognize text in Tamil images with high accuracy on Natural Scenes
Language:Python61 4 610
bentoml/BentoOCR
Turn any OCR models into online inference API endpoint 🚀 🌖
Language:Python54 5 34
ksasso1028/EasyOCR-cpp
Custom C++ implementation of deep learning based OCR
Language:C++54 2 1313
X-T-E-R/my-little-ocr
MyLittleOCR 是一个统一的 OCR 库包装器，提供一致的 API，便于集成和切换多个 OCR 引擎。 MyLittleOCR is a unified OCR wrapper providing a consistent API for seamless integration and switching between multiple OCR engines.
Language:Python51 2 03
algertc/ALPR-Database
Fully-Featured Automated License Plate Recognition Database Platform for Blue Iris + CodeProject AI Server 🚘
Language:JavaScript49 7 126
MauryaRitesh/OCR-Python
Optical Character Recognition in Python.
Language:Jupyter Notebook41 3 221
oidlabs-com/Lexoid
Multimodal document parser
Language:Python41 3 496
sepehrraisi/Persian-OCR
A project to bring high accuracy OCR to Persian language.
Language:Shell35 1 16
xtekky/zefoy-captcha-solver
Zefoy OCR captcha solver | 99% accurate
Language:Python33 2 08
sergiocorreia/quipucamayoc
dev repo for article
Language:Python28 6 75
ASACHIT/OCR-django-app
A django webapp to scan text from image , faster, easy & efficient
Language:CSS27 2 110
Unstructured-IO/community
Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
27 26 298
Baskar-forever/TableExtractor-Advanced-PDF-Table-Extraction
PDF Table Extractor is an innovative Python project designed to tackle the challenge of extracting tables from scanned PDF documents. Leveraging advanced optical character recognition (OCR) and image processing techniques.
Language:Jupyter Notebook26 1 06
pgplarosa/Employee-Monitoring-Using-Object-Detection
Deep Learning Individual Project - March 03, 2022.
Language:HTML25 1 03
Jan-9C/deathcounter_ocr
A python script which detects death messages by using OCR and displays a corrosponding deathcounter. Preconfigured for Elden Ring
Language:Python22 1 115
FtmsdtHosseini/IDPL-PFOD
An Image Dataset of Printed Farsi Text for OCR Research
21 1 02
ayseceyda/analog-meter-reading-openCV
AMR (automatic meter reading) project for analog meters, built with openCV+Python using basic OCR and image processing knowledge.
Language:Jupyter Notebook20 1 011
butlerlabs/docai
DocAI helps developers quickly build document, image and text processing pipelines using open source and cloud-based machine learning models for a wide range of applications
Language:Python20 2 81
hungtooc/transaction_ocr
The open source extract transaction infomation by using OCR.
Language:Python20 3 06
yunwoong7/korean_ocr_using_paddleOCR
This is a Korean OCR Python code using the paddleOCR library
Language:Jupyter Notebook19 1 11
Hermann-web/python-OCR
Converting invoice pdf to image, image to text and then get, from the text, invoice informations like invoice number or vendor name
Language:Jupyter Notebook18 1 02

ocr-python

hiroi-sora/Umi-OCR

breezedeus/CnOCR

CatchTheTornado/text-extract-api

hiroi-sora/Umi-OCR_v2

Psarpei/Multi-Type-TD-TSR

maxent-ai/ocrpy

MrZilinXiao/Hyper-Table-OCR

nathanaday/RealTime-OCR

ankandrew/fast-plate-ocr

ilic5000/pabkvizgenerator

blueaxis/Cloe

prp-e/persian_ocr_project

nainiayoub/pdf-text-data-extractor

kartikgill/Easter2

shibing624/imgocr

gnana70/tamil_ocr

bentoml/BentoOCR

ksasso1028/EasyOCR-cpp

X-T-E-R/my-little-ocr

algertc/ALPR-Database

MauryaRitesh/OCR-Python

oidlabs-com/Lexoid

sepehrraisi/Persian-OCR

xtekky/zefoy-captcha-solver

sergiocorreia/quipucamayoc

ASACHIT/OCR-django-app

Unstructured-IO/community

Baskar-forever/TableExtractor-Advanced-PDF-Table-Extraction

pgplarosa/Employee-Monitoring-Using-Object-Detection

Jan-9C/deathcounter_ocr

FtmsdtHosseini/IDPL-PFOD

ayseceyda/analog-meter-reading-openCV

butlerlabs/docai

hungtooc/transaction_ocr

yunwoong7/korean_ocr_using_paddleOCR

Hermann-web/python-OCR