ocr-python

There are 479 repositories under ocr-python topic.

  • ocr-to-docx

    Language:Python10
  • IDPL-PFOD

    An Image Dataset of Printed Farsi Text for OCR Research

  • docai

    DocAI helps developers quickly build document, image and text processing pipelines using open source and cloud-based machine learning models for a wide range of applications

    Language:Python20
  • transaction_ocr

    The open source extract transaction infomation by using OCR.

    Language:Python20
  • pdftotext

    A simple pdftotext conversion tool for Windows 8.1/10/11 and FEDORA/UBUNTU/DEBIAN/ARCH based linux distros using poppler-utils and Google's tesseract-ocr.

    Language:Python20
  • python-OCR

    Converting invoice pdf to image, image to text and then get, from the text, invoice informations like invoice number or vendor name

    Language:Jupyter Notebook19
  • OpenCV-OCR

    OpenCV OCR (Optical Character Recognition)

    Language:Python18
  • Menu_Reader

    This is a web application that converts restaurant menus into text using OCR. That text is then sent through a Machine Learning model to output a list of menu items using classification and NLP.

    Language:Jupyter Notebook17
  • PDF-Converter

    PDF-Converter

    Convert your PDF files into word documents or different image formats locally without uploading some servers unknown.

    Language:Python17
  • Markdownify

    Convert documents, images to high-quality Markdown using Vision LLMs. Built for RAG ingestion pipelines.

    Language:Python16
  • ScaleDP

    ScaleDP

    ScaleDP is an Open-Source extension of Apache Spark for Document Processing

    Language:Python16
  • taco-box

    An implementation of Tiling and Corruption (TACo) Augmentations for OCR/HTR

    Language:Jupyter Notebook15
  • Multimodal-OCR

    Vision Language Model : tailored for tasks that involve [messy] optical character recognition (ocr), image-to-text conversion, and math problem solving with latex formatting.

    Language:Python14
  • MTG-OCR-Imagehashing

    A self contained jupyter notebook demo showing how Tesseract OCR & Imagehashing can be used to recognize Magic Cards. This demo is meant to show how slow & inefficient these methods can be.

    Language:Jupyter Notebook14
  • EasyOCR-based-Automatic-Bangla-License-Plate-Recognition

    EasyOCR is basically Optical Character Reading package that belongs from PyTorch. Using this texts from the images can be extracted easily, documents, texts can be scanned. For License Plate's Number Recognition, it can be applicable easily as it can extract the texts. About License Plate's Number, there are several language's character plates are in the world, Bangla is one of them. Here EasyOCR is applied for Bangla Character Based License Plate Recognition.

    Language:Jupyter Notebook14
  • Image-table-to-text-

    Extracting tabular data from the image and storing it in CSV.

    Language:Python14
  • ISBN-Book-OCR

    Image to text recognition for ISBN numbers from books.

    Language:Python14
  • CaptchaSolver

    A tiny program to solve the thousand captcha image for testing the quality of the OCR. (Optical character recognition)

    Language:PHP14
  • OCR-Wizard

    A powerful and user-friendly tool based on OCRmyPDF, offering a seamless GUI for conversion of image-based PDFs into searchable text.

    Language:Python13
  • Persian-OCR-Streamlit

    Persian OCR allows users to scan documents and extract text from scanned image.

    Language:Python13
  • docling_ocr

    A powerful Python package for extracting text from images and documents using the SmolDocling-256M-preview advanced LLM-based models.

    Language:Python12
  • AdvAITelegramBot

    Telegram Advance AI ChatBot: GPT-4.1, Qwen-3, DeepSeek-R1, Dall-E-3, Flux, Flux-Pro, Dall-E Model, OCR and Google Voice2Text.

    Language:Python12
  • EasyPaddleOCR

    A simple package for PaddleOCR on CPU and GPU using PyTorch

    Language:Python12
  • VideoSubOCR

    OCR automation for VideoSubFinder

    Language:Python12
  • mathpx

    OCR for Mathematical equations

    Language:Python12
  • Tools_DeepSeekOCR

    A Windows-based screenshot OCR utility powered by DeepSeek-OCR. This tool allows users to quickly capture screen regions and perform high-accuracy Optical Character Recognition (OCR) directly on the captured image, leveraging the powerful DeepSeek-OCR model. It supports local model deployment and features real-time model output streaming.

    Language:Python11
  • queueit-captcha-handler

    Queue-it Captchas (BotDetect) Handler API

    Language:Python11
  • Discord-OCR-Bot

    This is an OCR Bot for Discord made using OpenCV and Pytesseract

    Language:Python11
  • pdf-ocr

    Converts scanned PDF documents to multiple formats using Optical Character Recognition

    Language:HTML10
  • trOCR

    Handwritten Text Recognition

    Language:Python10
  • screenshot-OCR

    Desktop application that lets the user extract text from images by just marking a section of the screen, instead of having to load an image file. Serves as a front-end for the Tesseract OCR Engine.

    Language:Python10
  • HandWritenSignatureDetection

    Deep Learning based Signature Detection (YOLOv5x)

    Language:Python10
  • Repo-2020

    Machine Learning, Google Cloud and Quantitative Algorithms for Stocks Trading

    Language:Jupyter Notebook10
  • pisahkan-ktp

    Python Package for Information Extraction and Segmentation - Segmentasi KTP Indonesia - Indonesian ID Card - Information Segmentation

    Language:Python9
  • Proyecto-Deteccion_de_Matriculas

    Se usa YOLOv10 para detectar vehículos en la vía, para luego detectar sus matriculas y usar tesseract-OCR para leer las matrículas

    Language:Jupyter Notebook9
  • Akshara-Jaana

    A OCR Project for Reading New and Old Kannada Texts

    Language:Python9