SanKumSan
Computer Vision | Deep Learning | Machine Learning Engineer. Software development with DevOps. Azure and Networking
Germany
SanKumSan's Stars
tudarmstadt-lt/GermaNER
GermaNER: Free Open German Named Entity Recognition Tool
anisha2102/docvqa
Document Visual Question Answering
jalammar/jalammar.github.io
Build a Jekyll blog in minutes, without touching the command line.
SamLynnEvans/Transformer
Transformer seq2seq model, program that can build a language translator from parallel corpus
GokuMohandas/Made-With-ML
Learn how to design, develop, deploy and iterate on production-grade ML applications.
abhishekkrthakur/bert-entity-extraction
huggingface/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
microsoft/unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
doc-analysis/DocBank
DocBank: A Benchmark Dataset for Document Layout Analysis
google-research/bert
TensorFlow code and pre-trained models for BERT
AndriyMulyar/bert_document_classification
architectures and pre-trained models for long document classification.
NielsRogge/Transformers-Tutorials
This repository contains demos I made with the Transformers library by HuggingFace.
microsoft/opensource.microsoft.com
This is the source code to the Microsoft Open Source site featuring projects, program information, and "get involved" pages. This site is published at opensource.microsoft.com and managed by the Microsoft Open Source Programs Office (OSPO).
suvoooo/Learn-TensorFlow
Learning Tensorflow Step by Step:: Concepts, Examples & Applications
marcopeix/Deep_Learning_AI
krishnaik06/Advanced-CNN-Architectures
kuangliu/torchcv
TorchCV: a PyTorch vision library mimics ChainerCV
groverpr/deep-learning
deepakrox/FastObjectLocalization
pavitrakumar78/Street-View-House-Numbers-SVHN-Detection-and-Classification-using-CNN
A 2-CNN pipeline to do both detection (using bounding box regression) and classification of numbers on SVHN dataset.
nalepae/bounding-box
Bounding Box is a library to plot pretty bounding boxes with a simple Python API.
dhlab-epfl/dhSegment
Generic framework for historical document processing
tblock/10kGNAD
Ten Thousand German News Articles Dataset for Topic Classification
ultralytics/yolov3
YOLOv3 in PyTorch > ONNX > CoreML > TFLite
ShairozS/Scan2Topic
A system for reading scanned documents and grouping them into high level topics
qq456cvb/multi-stage-detection
Tensorflow implementation of "MULTI-STAGE REINFORCEMENT LEARNING FOR OBJECT DETECTION"
gitanat/simple-ocr-opencv
A simple python OCR engine using opencv
jasmcaus/opencv-course
Learn OpenCV in 4 Hours - Code used in my Python and OpenCV course on freeCodeCamp.
javedsha/text-classification
Machine Learning and NLP: Text Classification using python, scikit-learn and NLTK
sambalshikhar/Document-Image-Classification-with-Intra-Domain-Transfer-Learning-and-Stacked-Generalization-of-Deep
RVL-CDIP could be looked at as the equivalent of ImageNet for the document image community. It’s certainly the largest we’ve seen in the literature. There are 400,000 total document images in the dataset. The dataset contains much noise and variance in composition of each document class. Uncompressed, the dataset size is ~100GB, and comprises 16 classes of document types, with 25,000 samples per classes. Example classes include email, resume, and invoice. Achieved an Accuracy of over 93% which beat the benchmark score of 92% based on https://paperswithcode.com/sota/document-image-classification-on-rvl-cdip