toanhvu2's Stars
openai/whisper
Robust Speech Recognition via Large-Scale Weak Supervision
microsoft/generative-ai-for-beginners
21 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/
rasbt/LLMs-from-scratch
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
Mintplex-Labs/anything-llm
The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, and more.
karpathy/llm.c
LLM training in simple, raw C/CUDA
Zeyi-Lin/HivisionIDPhotos
⚡️HivisionIDPhotos: a lightweight and efficient AI ID photos tools. 一个轻量级的AI证件照制作算法。
google-deepmind/alphafold
Open source code for AlphaFold 2.
kyutai-labs/moshi
bklieger-groq/g1
g1: Using Llama-3.1 70b on Groq to create o1-like reasoning chains
baaivision/Emu3
Next-Token Prediction is All You Need
feizc/FluxMusic
Text-to-Music Generation with Rectified Flow Transformers
EurekaLabsAI/ngram
The n-gram Language Model
modelscope/3D-Speaker
A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization
illuin-tech/colpali
The code used to train and run inference with the ColPali architecture.
lamm-mit/PDF2Audio
bklieger-groq/stockbot-on-groq
StockBot powered by Groq: Lightning Fast AI Chatbot that Responds With Live Interactive Stock Charts, Financials, News, Screeners, and More. Powered by Llama3-70b on Groq, Vercel AI SDK, and TradingView Widgets.
budzianowski/multiwoz
Source code for end-to-end dialogue model from the MultiWOZ paper (Budzianowski et al. 2018, EMNLP)
mit-han-lab/TinyChatEngine
TinyChatEngine: On-Device LLM Inference Library
facebookresearch/audioseal
Localized watermarking for AI-generated speech audios, with SOTA on robustness and very fast detector
microsoft/BitBLAS
BitBLAS is a library to support mixed-precision matrix multiplications, especially for quantized LLM deployment.
peggy1502/Amazing-Resources
List of references and online resources related to data science, machine learning and deep learning.
revdotcom/reverb
Open source inference code for Rev's model
Pints-AI/1.5-Pints
A compact LLM pretrained in 9 days by using high quality data
iamarunbrahma/finetuned-qlora-falcon7b-medical
Finetuning of Falcon-7B LLM using QLoRA on Mental Health Conversational Dataset
nvidia-riva/riva-asrlib-decoder
Standalone implementation of the CUDA-accelerated WFST Decoder available in Riva
syntithenai/opensnips
Open source projects related to Snips https://snips.ai/.
joonaskalda/PixIT
Companion repo for the paper "PixIT: Joint Training of Speaker Diarization and Speech Separation from Real-world Multi-speaker Recordings" published at Odyssey 2024
chentuochao/Target-Conversation-Extraction
This is the code and dataset repo for Interspeech 2024 paper "Target conversation extraction: Source separation using turn-taking dynamics"
joshua-decoder/fisher-callhome-corpus
The Fisher and CALLHOME Spanish–English Speech Translation Corpus
deepgram/genesys-voicebot-example
A minimal proof-of-concept voicebot built with Deepgram's Genesys AudioHook integration