TeaPoly

Audio and Speech Processing

TeaPoly's Stars

ZuodaoTech/everyone-can-use-english
人人都能用英语
Language:TypeScript25.5k 283 4063.8k
fishaudio/fish-speech
SOTA Open Source TTS
Language:Python17.8k 110 4721.3k
FunAudioLLM/CosyVoice
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
Language:Python8.9k 85 642856
kyutai-labs/moshi
Language:Python7.1k 80 92550
bitsandbytes-foundation/bitsandbytes
Accessible large language models via k-bit quantization for PyTorch.
Language:Python6.5k 52 1.1k641
FunAudioLLM/SenseVoice
Multilingual Voice Understanding Model
Language:Python3.8k 41 160341
pytorch/torchchat
Run PyTorch LLMs locally on servers, desktop and mobile
Language:Python3.4k 38 349228
gpt-omni/mini-omni
open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.
Language:Python3.2k 101 123289
Tele-AI/TeleSpeech-ASR
Language:Python582 15 5752
microsoft/BitBLAS
BitBLAS is a library to support mixed-precision matrix multiplications, especially for quantized LLM deployment.
Language:Python464 17 8234
xingchensong/S3Tokenizer
Reverse Engineering of Supervised Semantic Speech Tokenizer (S3Tokenizer) proposed in CosyVoice
Language:Python209 10 1326
wenet-e2e/wesep
Target Speaker Extraction Toolkit
Language:Python137 6 916