Pinned Repositories
3D-convolutional-speaker-recognition
:speaker: Deep Learning & 3D Convolutional Neural Networks for Speaker Verification
coursera-deep-learning-specialization
Notes, programming assignments and quizzes from all courses within the Coursera Deep Learning specialization offered by deeplearning.ai: (i) Neural Networks and Deep Learning; (ii) Improving Deep Neural Networks: Hyperparameter tuning, Regularization and Optimization; (iii) Structuring Machine Learning Projects; (iv) Convolutional Neural Networks; (v) Sequence Models
donut
Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022
GOT-OCR2.0
Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
huycq1712
Config files for my GitHub profile.
Minitorch-module0
Module 0 - completed
Qwen-Agent
Agent framework and applications built upon Qwen2.x, featuring Function Calling, Code Interpreter, RAG, and Chrome extension.
server
The Triton Inference Server provides an optimized cloud and edge inferencing solution.
SPTSv2
The official implementation of SPTS v2: Single-Point Text Spotting
TESTR
(CVPR 2022) Text Spotting Transformers
huycq1712's Repositories
huycq1712/huycq1712
Config files for my GitHub profile.
huycq1712/server
The Triton Inference Server provides an optimized cloud and edge inferencing solution.
huycq1712/donut
Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022
huycq1712/GOT-OCR2.0
Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
huycq1712/Qwen-Agent
Agent framework and applications built upon Qwen2.x, featuring Function Calling, Code Interpreter, RAG, and Chrome extension.
huycq1712/SPTSv2
The official implementation of SPTS v2: Single-Point Text Spotting
huycq1712/TESTR
(CVPR 2022) Text Spotting Transformers
huycq1712/Awesome-Document-Image-Rectification
A comprehensive list of awesome document image rectification papers.
huycq1712/awesome-Face_Recognition
papers about Face Detection; Face Alignment; Face Recognition && Face Identification && Face Verification && Face Representation; Face Reconstruction; Face Tracking; Face Super-Resolution && Face Deblurring; Face Generation && Face Synthesis; Face Transfer; Face Anti-Spoofing; Face Retrieval;
huycq1712/awesome-scalability
The Patterns of Scalable, Reliable, and Performant Large-Scale Systems
huycq1712/Awesome-Table-Recognition
A curated list of resources dedicated to table recognition
huycq1712/doc3D-dataset
A hybrid dataset for document unwarping (Paper: https://www3.cs.stonybrook.edu/~cvl/projects/dewarpnet/storage/paper.pdf)
huycq1712/docdewarp
huycq1712/docile
DocILE: Document Information Localization and Extraction Benchmark
huycq1712/dogcat
huycq1712/ERNIE-Layout-Pytorch
An unofficial Pytorch implementation of ERNIE-Layout which is originally released through PaddleNLP.
huycq1712/FAST
Faster Arbitrarily-Shaped Text Detector with Minimalist Kernel Representation
huycq1712/legal-chatbot
huycq1712/MATRN
Official PyTorch implementation for Multi-modal Text Recognition Networks: Interactive Enhancements between Visual and Semantic Features (MATRN) in ECCV 2022.
huycq1712/MC-OCR
The task aims at extracting required fields in receipts captured by mobile devices :smile:
huycq1712/NL-Car
Vehicle search model based on natural language description sentences
huycq1712/rag
huycq1712/streamlit-to-heroku
huycq1712/system-design
Learn how to design systems at scale and prepare for system design interviews
huycq1712/textdetect
huycq1712/u2net
huycq1712/unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
huycq1712/vietocr
Transformer OCR
huycq1712/Violence-detection-project
Computer vision project
huycq1712/yolov10
YOLOv10: Real-Time End-to-End Object Detection