tesseract-ocr

There are 1229 repositories under tesseract-ocr topic.

  • Aadhaar-Card-OCR

    Extract text information from Aadhaar Card using tesseract-ocr :sunglasses:

    Language:Python132
  • receipt-ocr

    An efficient OCR engine for receipt image processing.

    Language:Python127
  • tessdata_chi

    Retrained Tesseract OCR model for Chinese

    Language:Python123
  • ChartReader

    Fully automated end-to-end framework to extract data from bar plots and other figures in scientific research papers using modules such as OpenCV, AWS-Rekognition.

    Language:Jupyter Notebook120
  • Manga-Translator-TesseractOCR

    Automatically translates manga pages with Tesseract-OCR and Google Translate API for Python

    Language:Python112
  • OCR_FontsSearchEngine

    A OCR Search Engine With Tesseract Nutch Solr And PHP

    Language:JavaScript111
  • ocr

    Nextcloud OCR (optical character recoginition) processing for images with tesseract-js

    Language:JavaScript108
  • tesseract-ocr-re

    Tesseract 4 OCR Runtime Environment - Docker Container

    Language:Shell101
  • fastmrz

    fastmrz

    ⚡Extracting the Machine Readable Zone (MRZ) from passport or any document images

    Language:Python96
  • Opencv-ImageBase

    对任何文字图片来源进行预处理结合tesseract-ocr进行识别,主要模块有纸张边缘查找,四角定位,仿射变换,二值化,模糊处理,摩尔纹处理,噪点过滤,图片exif,jfif信息处理,表格线删除,图片阴影处理,傅里叶图片矫正处理等等。。本程序依赖于与图片exif,jfif信息进行分类处理,传入时需带有信息

    Language:C++92
  • Indian-Number-Plate-Recognition-System

    Indian Number Plate Recognition System built using OpenCV

    Language:Python89
  • openCV_Tesseract_test

    Test program to read characters on labels using openCV and Tesseract-OCR

    Language:C++88
  • Automatic-License-Plate-Detection

    In this project we utilize OpenCV t in order to identify the license number plates and the python pytesseract for the characters and digits extraction from the plate. As well this project will presents a robust and efficient ALPR system based on the state-of-the-art YOLO object detector. We build Web App with a Python program that automatically recognizes the License Number Plate by the end of this journi. The results have shown that the trained neural network is able to perform with high accuracy of nearly 90-95 percent in recognizing license plates in low resolution images using this system.

    Language:Jupyter Notebook83
  • tess5train-fonts

    Files and Scripts to run Tesseract 5 LSTM Training using fonts

    Language:HTML79
  • ocrTranslator

    Convert captured images to text using BaiduOCR, GoogleOCR, WindowsOCR, tesseractOCR, RapidOCR or Capture2Text, and translate the resulting text using Google, Chatgpt, Edgegpt, DeepL or many more. Desktop application with a nice GUI provided by customtkinter.

    Language:Python78
  • breach-protocol-autosolver

    breach-protocol-autosolver

    Solve breach protocol minigame in second(s). Windows/Linux/GeForce Now/Google Stadia. Every language.

    Language:TypeScript76
  • spark-pdf

    spark-pdf

    PDF DataSource for Apache Spark, allow to read PDF files directly to the DataFrame and ocr it

    Language:Scala75
  • obs-ocr

    OCR Plugin for OBS based on Tesseract

    Language:CMake74
  • Audio-book-generator

    Convert your ebooks to audiobooks. 📖->🎧

    Language:Python74
  • Business-Card-Reader-BCR-

    Android app to extract name, email and phone from business card using OCR library tess-two (Fork of Tesseract Tools for Android) and phone's camera.

    Language:Java74
  • HighlightTranslator

    Highlight Translator can help you to translate the words quickly and accurately. By only highlighting, copying, or screenshoting the content you want to translate anywhere on your computer (ex. PDF, PPT, WORD etc.), the translated results will then be automatically displayed before you.

    Language:Python71
  • PDFtoTXT

    Python code to read text from a PDF file (OCR).

    Language:Python70
  • Automatic-Number-plate-detection-for-Indian-vehicles

    EC351: Introduction to Algorithms

    Language:Python67
  • Aristocrat

    Aristocrat is a menu bar utility that allows you to decode barcodes and OCR text on your screen.

    Language:Objective-C++67
  • handReacting

    Text to Handwriting converter made using React.

    Language:JavaScript65
  • rust-paddle-ocr

    高性能OCR库,有PaddleOCR v4/v5模型。支持文本检测/识别、多语言(中文/英文/日文)。 提供Rust库 + CAPI动态库 + CLI工具,轻松集成 调用简单 开箱即用。 High-performance OCR library, supports multiple languages ​​(Chinese/English/Japanese), provides Rust crate + C API + CLI tools.

    Language:Rust63
  • Custom-OCR-YOLO

    Custom-OCR-YOLO

    YOLO for custom object detection and passing the detected objects to Tesseract

    Language:Python62
  • Lens

    🔍 Lens is an opt-in search engine and data collection tool to aid content discovery of the distributed web

    Language:Go61
  • How-to-use-tesseract-ocr-4.0-with-csharp

    How to use Tesseract OCR 4.0 with C#

  • TableExtraction

    A line-based framework to detect and extract tabular data in JSON format from raster images using computer vision and Tesseract OCR.

    Language:Python59
  • TesseractStudio.Net

    A free Windows graphical interface to the Tesseract 4.0 OCR engine.

  • tesseract-ocr-elixir

    This package is a wrapper of Tesseract OCR. Helping to read characters on an image.

    Language:Elixir59
  • SceneTextRecognitioniOS

    A scene text recognition demo app using Vision framework and tesseract

    Language:Objective-C58
  • FarsiOCR

    An OCR application for Farsi/ Persian documents.

    Language:Python57
  • ocreval

    Update of the ISRI Analytic Tools for OCR Evaluation with UTF-8 support

    Language:C57
  • spectacle-ocr-screenshot

    A simple utility to automatically extract text from spectacle on plasma desktops

    Language:Makefile55