Pinned Repositories
128D-Facenet-LFW-Embedding-Visualisation
128D Facenet Embedding Visualisation
CRNN_CTC_English_Handwriting_Recognition
English Handwriting Recognition with CRNN and CTC Loss
Data-Science-Resources
A curated list of resources to help you get start with Data Science.
DB_text_minimal
[WIP] A Pytorch implementation of DB-Text - Real-time Scene Text Detection with Differentiable Binarization
framler
[DEPRECATED] AutoCrawler - automate extracting main information from website
KIE_invoice_minimal
Key information extraction from invoice document with Graph Convolution Network
kuzushiji_recognition
[Late Submission] Solution for Kuzushiji recognition (Kaggle competition)
LDA_Viblo_Recommender_System
Simple Recommender System for Viblo Website using LDA (Latent Dirichlet Allocation)
Semantic_Search
[DEPRECATED] Baseline Project for Semantic Searching
Vietnamese_Handwriting_Recognition
[DEPRECATED] Vietnamese Handwriting Recognition with CRNN and CTC Loss
huyhoang17's Repositories
huyhoang17/KIE_invoice_minimal
Key information extraction from invoice document with Graph Convolution Network
huyhoang17/MiniGemini
Official implementation for Mini-Gemini
huyhoang17/MLOps-Basic-Example
huyhoang17/ChatIE
official repository for ChatIE paper and a tool of IE using ChatGPT. Note: we set defaul openai key. See issues for the solution of gpt3.5-turbo request limit. The response speed depends on openai. ( sometimes, the official is too crowded and the speed/model will be slow/overloaded.)
huyhoang17/colpali
huyhoang17/DocRes
[CVPR 2024] DocRes: A Generalist Model Toward Unifying Document Image Restoration Tasks
huyhoang17/EdgeFormer
Source code of "EdgeFormer: Improving Light-weight ConvNets by Learning from Vision Transformers"
huyhoang17/ERNIE-Layout-Pytorch
An unofficial Pytorch implementation of ERNIE-Layout which is originally released through PaddleNLP.
huyhoang17/GOT-OCR2.0
Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
huyhoang17/GTR
Scene text recognition
huyhoang17/Hermes-Function-Calling
huyhoang17/huyhoang17
huyhoang17/lightning-pose
Accelerated pose estimation and tracking using semi-supervised convolutional networks.
huyhoang17/LLaMA-Adapter
Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters
huyhoang17/MRN
MRN: Multiplexed Routing Network for Incremental Multilingual Text Recognition (ICCV 2023)
huyhoang17/nougat
Implementation of Nougat Neural Optical Understanding for Academic Documents
huyhoang17/nougat-latex-ocr
Codes for fine-tuning / evaluating nougat-based image2latex generation models
huyhoang17/ObjectBox
huyhoang17/python-mastery
Advanced Python Mastery (course by @dabeaz)
huyhoang17/segment-anything-fast
A batched offline inference oriented version of segment-anything
huyhoang17/SEMv2
huyhoang17/SEMv3
The official PyTorch implementation of SEMv3.
huyhoang17/StructEqTable-Deploy
A High-efficiency Open-source Toolkit for Table-to-Latex Task
huyhoang17/UniMERNet
UniMERNet: A Universal Network for Real-World Mathematical Expression Recognition
huyhoang17/Union14M
[ICCV 2023] Code base for Revisiting Scene Text Recognition: A Data Perspective
huyhoang17/unitable
UniTable: Towards a Unified Table Foundation Model
huyhoang17/UVDoc
Code for the paper "UVDoc: Neural Grid-based Document Unwarping"
huyhoang17/VLM-R1
Solve Visual Understanding with Reinforced VLMs
huyhoang17/yolo_tracking
A collection of SOTA real-time, multi-object tracking algorithms for object detectors
huyhoang17/yolov5_obb
yolov5 + csl_label.(Oriented Object Detection)(Rotation Detection)(Rotated BBox)基于yolov5的旋转目标检测