TeaPoly's Stars
ZuodaoTech/everyone-can-use-english
人人都能用英语
fishaudio/fish-speech
SOTA Open Source TTS
FunAudioLLM/CosyVoice
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
kyutai-labs/moshi
bitsandbytes-foundation/bitsandbytes
Accessible large language models via k-bit quantization for PyTorch.
FunAudioLLM/SenseVoice
Multilingual Voice Understanding Model
pytorch/torchchat
Run PyTorch LLMs locally on servers, desktop and mobile
gpt-omni/mini-omni
open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.
Tele-AI/TeleSpeech-ASR
microsoft/BitBLAS
BitBLAS is a library to support mixed-precision matrix multiplications, especially for quantized LLM deployment.
xingchensong/S3Tokenizer
Reverse Engineering of Supervised Semantic Speech Tokenizer (S3Tokenizer) proposed in CosyVoice
wenet-e2e/wesep
Target Speaker Extraction Toolkit