HKAB's Stars
2noise/ChatTTS
A generative speech model for daily dialogue.
karpathy/LLM101n
LLM101n: Let's build a Storyteller
roboflow/supervision
We write your reusable computer vision tools. 💜
aristocratos/btop
A monitor of resources
overleaf/overleaf
A web-based collaborative LaTeX editor
kyutai-labs/moshi
Ucas-HaoranWei/GOT-OCR2.0
Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
FunAudioLLM/SenseVoice
Multilingual Voice Understanding Model
sjpiper145/MakerSkillTree
A repository of Maker Skill Trees and templates to make your own.
ChrisBuilds/terminaltexteffects
TerminalTextEffects (TTE) is a terminal visual effects engine, application, and Python library.
libAudioFlux/audioFlux
A library for audio and music analysis, feature extraction.
ictnlp/LLaMA-Omni
LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.
zhengkyl/qrframe
code-based qr code designer
facebookresearch/schedule_free
Schedule-Free Optimization in PyTorch
HazyResearch/ThunderKittens
Tile primitives for speedy kernels
QwenLM/Qwen2-Audio
The official repo of Qwen2-Audio chat & pretrained large audio language model proposed by Alibaba Cloud.
pemistahl/lingua-py
The most accurate natural language detection library for Python, suitable for short text and mixed-language text
k2-fsa/icefall
olcf/cuda-training-series
Training materials associated with NVIDIA's CUDA Training Series (www.olcf.ornl.gov/cuda-training-series/)
clu0/unet.cu
UNet diffusion model in pure CUDA
nyrahealth/CrisperWhisper
Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detection
google-ai-edge/ai-edge-torch
Supporting PyTorch models with the Google AI Edge TFLite runtime.
facebookresearch/muavic
MuAViC: A Multilingual Audio-Visual Corpus for Robust Speech Recognition and Robust Speech-to-Text Translation
microsoft/onnxruntime-training-examples
Examples for using ONNX Runtime for model training.
te42kyfo/gpu-benches
collection of benchmarks to measure basic GPU capabilities
leduckhai/MultiMed
Multilingual Multitask Multipurpose Medical Speech Recognition
CisMine/Guide-NVIDIA-Tools
NVIDIA tools guide
echocatzh/conv-stft
A STFT/iSTFT written up in PyTorch using 1D Convolutions
leimao/Nsight-Compute-Docker-Image
Nsight Compute in Docker
tuyen-tran1/VN-SLU
A Vietnamese Spoken Language Understanding