eafaizal's Stars
cleanlab/cleanlab
The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
lmcinnes/umap
Uniform Manifold Approximation and Projection
bentoml/BentoML
The easiest way to serve AI apps and models - Build Model Inference APIs, Job queues, LLM apps, Multi-model pipelines, and more!
tesseract-ocr/tessdata
Trained models with fast variant of the "best" LSTM models + legacy models
clovaai/donut
Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022
open-mmlab/mmocr
OpenMMLab Text Detection, Recognition and Understanding Toolbox
zenml-io/zenml
ZenML 🙏: The bridge between ML and Ops. https://zenml.io.
deanmalmgren/textract
extract text from any document. no muss. no fuss.
jalammar/ecco
Explain, analyze, and visualize NLP language models. Ecco creates interactive visualizations directly in Jupyter notebooks explaining the behavior of Transformer-based language models (like GPT2, BERT, RoBERTA, T5, and T0).
ELS-RD/transformer-deploy
Efficient, scalable and enterprise-grade CPU/GPU inference server for 🤗 Hugging Face transformer models 🚀
NVlabs/noise2noise
Noise2Noise: Learning Image Restoration without Clean Data - Official TensorFlow implementation of the ICML 2018 paper
zacharywhitley/awesome-ocr
cdfoundation/sig-mlops
CDF SIG MLOps
rrenaud/Gibberish-Detector
A small program to detect gibberish using a Markov Chain
google-research/byt5
Psarpei/Multi-Type-TD-TSR
Extracting Tables from Document Images using a Multi-stage Pipeline for Table Detection and Table Structure Recognition:
shabie/docformer
Implementation of DocFormer: End-to-End Transformer for Document Understanding, a multi-modal transformer based architecture for the task of Visual Document Understanding (VDU)
bethgelab/robust-detection-benchmark
Code, data and benchmark from the paper "Benchmarking Robustness in Object Detection: Autonomous Driving when Winter is Coming" (NeurIPS 2019 ML4AD)
goodmami/penman
PENMAN notation (e.g. AMR) in Python
phamquiluan/jdeskew
ICIP 2022: Adaptive Radial Projection on Fourier Magnitude Spectrum for Document Image Skew Estimation
HazyResearch/fonduer-tutorials
A collection of simple tutorials for using Fonduer
juglab/FourierImageTransformer
Fourier Image Transformer (FIT) can solve relevant image analysis tasks in Fourier space.
massanishi/document_similarity_algorithms_experiments
Document similarity algorithms experiment - Jaccard, TF-IDF, Doc2vec, USE, and BERT.
jstray/deepform
Using ML to extract campaign finance data from messy forms for journalism
juglab/DivNoising
DivNoising is an unsupervised denoising method to generate diverse denoised samples for any noisy input image. This repository contains the code to reproduce the results reported in the paper https://openreview.net/pdf?id=agHLCOBM5jP
KGQA/QALD_9_plus
QALD-9-Plus Dataset for Knowledge Graph Question Answering
andrewk1/correctandsmooth
Simple correct&smooth implementation in PyTorch.
BordiaS/python-machine-learning-book
The "Python Machine Learning" book code repository and info resource
KoryakovDmitry/deep-image-orientation-angle-detection
slbinilkumar/tessdata_engrupee
traineddata for English including the rupee symbol