0AlphaZero0

French Data Scientist loving machine learning

Aquila Data EnablerParis, France

0AlphaZero0's Stars

tesseract-ocr/tesseract
Tesseract Open Source OCR Engine (main repository)
Language:C++65.5k 1.7k 2.7k9.8k
PaddlePaddle/PaddleOCR
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)
Language:Python47.5k 449 9.5k8.1k
JaidedAI/EasyOCR
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
Language:Python26k 318 1k3.3k
andrewyng/aisuite
Simple, unified interface to multiple Generative AI providers
Language:Python11.8k 142 701.1k
WongKinYiu/yolov9
Implementation of paper - YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information
Language:Python9.2k 57 5571.5k
clovaai/donut
Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022
Language:Python6.1k 51 310491
NMAC427/SwiftOCR
Fast and simple OCR library written in Swift
Language:Swift4.6k 154 147480
open-mmlab/mmocr
OpenMMLab Text Detection, Recognition and Understanding Toolbox
Language:Python4.5k 58 901755
mindee/doctr
docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.
Language:Python4.4k 42 397485
clovaai/deep-text-recognition-benchmark
Text recognition (optical character recognition) with deep learning methods, ICCV 2019
Language:Jupyter Notebook3.8k 85 3961.1k
RapidAI/RapidOCR
📄 Awesome OCR multiple programing languages toolkits based on ONNXRuntime, OpenVINO, PaddlePaddle and PyTorch.
Language:Python3.7k 48 158412
eragonruan/text-detection-ctpn
text detection mainly based on ctpn model in tensorflow, id card detect, connectionist text proposal network
Language:Python3.4k 131 4601.3k
aim-uofa/AdelaiDet
AdelaiDet is an open source toolbox for multiple instance-level detection and recognition tasks.
Language:Python3.4k 83 546651
clovaai/CRAFT-pytorch
Official implementation of Character Region Awareness for Text Detection (CRAFT)
Language:Python3.2k 70 184919
deepdoctection/deepdoctection
A Repo For Document AI
Language:Python2.8k 18 192150
pikepdf/pikepdf
A Python library for reading and writing PDF, powered by QPDF
Language:Python2.3k 36 439194
chezou/tabula-py
Simple wrapper of tabula-java: extract table from PDF into pandas DataFrame
Language:Python2.2k 45 283297
Belval/pdf2image
A python module that wraps the pdftoppm utility to convert PDF to PIL Image object
Language:Python1.7k 17 201200
AlibabaResearch/AdvancedLiterateMachinery
A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team in the Language Technology Lab, Tongyi Lab, Alibaba Group.
Language:C++1.7k 40 194190
JonathanLink/PDFLayoutTextStripper
Converts a pdf file into a text file while keeping the layout of the original pdf. Useful to extract the content from a table in a pdf file for instance. This is a subclass of PDFTextStripper class (from the Apache PDFBox library).
Language:Java1.6k 53 34214
faustomorales/keras-ocr
A packaged and flexible version of the CRAFT text detector and Keras CRNN recognition model.
Language:Python1.4k 47 217366
spatie/pdf-to-image
Convert a pdf to an image
Language:PHP1.4k 19 132227
mlco2/codecarbon
Track emissions from Compute and recommend ways to reduce their impact on the environment.
Language:Python1.3k 21 332195
paulocoutinhox/pdfium-lib
PDFium - Project to compile PDFium library to multiple platforms.
Language:Python963 13 9196
VILA-Lab/ATLAS
A principled instruction benchmark on formulating effective queries and prompts for large language models (LLMs). Our paper: https://arxiv.org/abs/2312.16171
Language:Python950 23 996
jalan/pdftotext
Simple PDF text extraction
Language:Python911 17 113103
spatie/pdf-to-text
Extract text from a pdf
Language:PHP903 18 37125
Unstructured-IO/unstructured-api
Language:Python683 27 135152
ml-energy/zeus
Deep Learning Energy Measurement and Optimization
Language:Python242 9 5231
ja-mcm/OCRfixr
A context-based spellchecker for correcting OCR output.
Language:Python18 2 04

0AlphaZero0

0AlphaZero0's Stars

tesseract-ocr/tesseract

PaddlePaddle/PaddleOCR

JaidedAI/EasyOCR

andrewyng/aisuite

WongKinYiu/yolov9

clovaai/donut

NMAC427/SwiftOCR

open-mmlab/mmocr

mindee/doctr

clovaai/deep-text-recognition-benchmark

RapidAI/RapidOCR

eragonruan/text-detection-ctpn

aim-uofa/AdelaiDet

clovaai/CRAFT-pytorch

deepdoctection/deepdoctection

pikepdf/pikepdf

chezou/tabula-py

Belval/pdf2image

AlibabaResearch/AdvancedLiterateMachinery

JonathanLink/PDFLayoutTextStripper

faustomorales/keras-ocr

spatie/pdf-to-image

mlco2/codecarbon

paulocoutinhox/pdfium-lib

VILA-Lab/ATLAS

jalan/pdftotext

spatie/pdf-to-text

Unstructured-IO/unstructured-api

ml-energy/zeus

ja-mcm/OCRfixr