hung2003oke's Stars
thinhlpg/vixtts-demo
A Vietnamese Voice Cloning Text-to-Speech Model ✨
konrad-gajdus/miniMNIST-c
WooooDyy/LLM-Agent-Paper-List
The paper list of the 86-page paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.
Ucas-HaoranWei/GOT-OCR2.0
Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
hoang-quoc-trung/remote-ssh-kaggle-vscode
Instructions for connecting SSH between Kaggle and Visual Studio Code
Boubker10/SafeDriveVision
SafeDriveVision is a computer vision project aimed at enhancing road safety. This project leverages deep learning models to detect and alert in real-time the dangerous behaviors of drivers, such as using a phone while driving or showing signs of drowsiness.
microsoft/graphrag
A modular graph-based Retrieval-Augmented Generation (RAG) system
ajcr/100-pandas-puzzles
100 data puzzles for pandas, ranging from short and simple to super tricky (60% complete)
zou-group/textgrad
TextGrad: Automatic ''Differentiation'' via Text -- using large language models to backpropagate textual gradients.
mindsdb/mindsdb
The platform for building AI from enterprise data
andrewyng/translation-agent
Belval/TextRecognitionDataGenerator
A synthetic data generator for text recognition
VamosC/CLIP4STR
An implementation of "CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model".
noorkhokhar99/car-parking-finder
Yousef-Nasr/Football-Analysis
This project utilizes YOLOv8 for player detection, means to cluster players into two teams, and ByteTracker for player tracking in football matches. The combination of these technologies enables comprehensive analysis of player movements and calculate team ball control during game.
OpenBMB/MiniCPM-V
MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone
leduckhai/MultiMed
Multilingual Multitask Multipurpose Medical Speech Recognition
phatjkk/nttu-chatbot
NTTU Chatbot - A student support chatbot using LLM + Document Retriever (RAG) in Vietnamese
mauvilsa/imgtxtenh
Tool for enhancing noisy scanned text images
gopig99/AR-filters-MediPipe
KindXiaoming/pykan
Kolmogorov Arnold Networks
dinhquy-nguyen-1704/ZaloAI2023-Elementary-Math-Solving
Baseline achieving 0.8 accuracy on the private test set in the ZaloAI Challenge 2023 Elementary Math Solving
st--/annotate-equations
LaTeX package and annotated examples for annotating equations using TikZ.
eufouria/toxic-text-classification
API for toxic text classification, utilized pre-trained Distilbert and trained on Kaggle datasets. It helps identify and handle toxic content.
phuctrang/Intrusion-Detection-System-IDS-
dusty-nv/jetson-inference
Hello AI World guide to deploying deep-learning inference networks and deep vision primitives with TensorRT and NVIDIA Jetson.
torphix/infini-attention
Pytorch implementation of https://arxiv.org/html/2404.07143v1
Sana-0511/resnet
milesial/Pytorch-UNet
PyTorch implementation of the U-Net for image semantic segmentation with high quality images
foivospar/Arc2Face
[ECCV 2024 Oral🔥] Arc2Face: A Foundation Model for ID-Consistent Human Faces