toanhvu2

toanhvu2's Stars

openai/whisper
Robust Speech Recognition via Large-Scale Weak Supervision
Language:Python72.1k 587 08.6k
microsoft/generative-ai-for-beginners
21 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/
Language:Jupyter Notebook65.4k 560 12933.5k
rasbt/LLMs-from-scratch
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
Language:Jupyter Notebook34k 367 1074.2k
Mintplex-Labs/anything-llm
The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, and more.
Language:JavaScript27.8k 217 1.8k2.8k
karpathy/llm.c
LLM training in simple, raw C/CUDA
Language:Cuda24.6k 247 1412.8k
Zeyi-Lin/HivisionIDPhotos
⚡️HivisionIDPhotos: a lightweight and efficient AI ID photos tools. 一个轻量级的AI证件照制作算法。
Language:Python13k 57 1141.3k
google-deepmind/alphafold
Open source code for AlphaFold 2.
Language:Python12.9k 230 8632.3k
kyutai-labs/moshi
Language:Python6.9k 77 85537
bklieger-groq/g1
g1: Using Llama-3.1 70b on Groq to create o1-like reasoning chains
Language:Python3.9k 52 14355
baaivision/Emu3
Next-Token Prediction is All You Need
Language:Python1.9k 33 4775
feizc/FluxMusic
Text-to-Music Generation with Rectified Flow Transformers
Language:Python1.6k 20 23123
EurekaLabsAI/ngram
The n-gram Language Model
Language:C1.4k 51 092
modelscope/3D-Speaker
A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization
Language:Python1.3k 17 113106
illuin-tech/colpali
The code used to train and run inference with the ColPali architecture.
Language:Python1.2k 15 74106
lamm-mit/PDF2Audio
Language:Jupyter Notebook1.1k 18 14138
bklieger-groq/stockbot-on-groq
StockBot powered by Groq: Lightning Fast AI Chatbot that Responds With Live Interactive Stock Charts, Financials, News, Screeners, and More. Powered by Llama3-70b on Groq, Vercel AI SDK, and TradingView Widgets.
Language:TypeScript1k 8 12175
budzianowski/multiwoz
Source code for end-to-end dialogue model from the MultiWOZ paper (Budzianowski et al. 2018, EMNLP)
Language:Python867 17 62199
mit-han-lab/TinyChatEngine
TinyChatEngine: On-Device LLM Inference Library
Language:C++757 15 4373
facebookresearch/audioseal
Localized watermarking for AI-generated speech audios, with SOTA on robustness and very fast detector
Language:Python466 15 2356
microsoft/BitBLAS
BitBLAS is a library to support mixed-precision matrix multiplications, especially for quantized LLM deployment.
Language:Python429 16 7134
peggy1502/Amazing-Resources
List of references and online resources related to data science, machine learning and deep learning.
339 13 098
revdotcom/reverb
Open source inference code for Rev's model
Language:Python336 11 1522
Pints-AI/1.5-Pints
A compact LLM pretrained in 9 days by using high quality data
Language:Python267 5 620
iamarunbrahma/finetuned-qlora-falcon7b-medical
Finetuning of Falcon-7B LLM using QLoRA on Mental Health Conversational Dataset
Language:Jupyter Notebook242 4 227
nvidia-riva/riva-asrlib-decoder
Standalone implementation of the CUDA-accelerated WFST Decoder available in Riva
Language:Python82 7 2423
syntithenai/opensnips
Open source projects related to Snips https://snips.ai/.
Language:JavaScript54 8 321
joonaskalda/PixIT
Companion repo for the paper "PixIT: Joint Training of Speaker Diarization and Speech Separation from Real-world Multi-speaker Recordings" published at Odyssey 2024
Language:Python47 6 33
chentuochao/Target-Conversation-Extraction
This is the code and dataset repo for Interspeech 2024 paper "Target conversation extraction: Source separation using turn-taking dynamics"
Language:Python38 2 14
joshua-decoder/fisher-callhome-corpus
The Fisher and CALLHOME Spanish–English Speech Translation Corpus
Language:JavaScript38 6 112
deepgram/genesys-voicebot-example
A minimal proof-of-concept voicebot built with Deepgram's Genesys AudioHook integration
Language:Python1 3 01