NhiNguyen34's Stars
amazon-science/mm-cot
Official implementation for "Multimodal Chain-of-Thought Reasoning in Language Models" (stay tuned and more will be updated)
FSoft-AI4Code/HyperAgent
Generalist Software Agents to Solve Soware Engineering Tasks
DataExpert-io/data-engineer-handbook
This is a repo with links to everything you'd ever want to learn about data engineering
NhiNguyen34/vitextcaps-captioning
This is an open-source repository for constructing and researching fusion-style deep learning methods combined with pretrained vision models and pretrained language models for visual question answering (VQA) task in Vietnamese.
unslothai/unsloth
Finetune Llama 3.2, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory
wzk1015/CNMT
[AAAI 2021] Confidence-aware Non-repetitive Multimodal Transformers for TextCaps
NhiNguyen34/DS304.N21.KHDL
NhiNguyen34/DS-BioMed-at-ImageCLEFmedical-Caption-2024
DS@BioMed-at-ImageCLEFmedical-Caption-2024
clu0/unet.cu
UNet diffusion model in pure CUDA
ashleve/lightning-hydra-template
PyTorch Lightning + Hydra. A very user-friendly template for ML experimentation. ⚡🔥⚡
ronghanghu/mmf
A modular framework for Visual Question Answering research by the FAIR A-STAR team
NhiNguyen34/uav-detection
This repository provides checkpoints for YOLOv8, RT-DETR, and YOLOv10 models, each fine-tuned on the VisDrone dataset. These models are optimized specifically for detecting UAV (Unmanned Aerial Vehicle) images.
karpathy/LLM101n
LLM101n: Let's build a Storyteller
tranhungnghiep/AI-Conference-Info
Extensive acceptance rates and information of main AI conferences
hiyouga/LLaMA-Factory
Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)
bowang/mathml2latex
Convert MathML to Latex for OneNote to Markdown
NhiNguyen34/yolov10
YOLOv10: Real-Time End-to-End Object Detection
LiuShuaiyr/UAVMOT
multi-object tracking meets moving UAV
THU-MIG/yolov10
YOLOv10: Real-Time End-to-End Object Detection [NeurIPS 2024]
adityachintala/UAV-tracking-tank
MILITARY TANK OBJECT DETECTION AND TRACKING USING UAV
tannd-ds/realtime-reid
Realtime Person Re-Identification using Kafka, Spark and Deep Learning
tannd-ds/fe-football-champ
Front-end Repo for Football Championship Management using Nuxt 3
igorbarinov/awesome-data-engineering
A curated list of data engineering tools for software developers
mlabonne/llm-course
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
Kaggle/kaggle-api
Official Kaggle API
facebookresearch/fastMRI
A large-scale dataset of both raw MRI measurements and clinical MRI images.
ChenyuGAO-CS/SMA
The imdb files with SBD-Trans OCR for TextVQA dataset.
NhiNguyen34/DS105.O11.KHDL
google-research/multilingual-t5
NhiNguyen34/ViTextCaps