Pinned Repositories
fish-speech
Brand new TTS solution
CosyVoice
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
milvus
A cloud-native vector database, storage for next generation AI applications
Montreal-Forced-Aligner
Command line utility for forced alignment using Kaldi
EmotiVoice
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
Amphion
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
ddd_arxiv
wjddd.github.io