wjddd

Pinned Repositories

fish-speech
Brand new TTS solution
Language:Python14.4k 97 4071.1k
CosyVoice
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
Language:Python6.2k 59 503669
milvus
A cloud-native vector database, storage for next generation AI applications
Language:Go30.4k 283 12.1k2.9k
Montreal-Forced-Aligner
Command line utility for forced alignment using Kaldi
Language:Python1.3k 36 716248
EmotiVoice
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
Language:Python7.4k 62 153631
Amphion
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
Language:Jupyter Notebook7.6k 78 189562
GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Language:Python35.6k 210 1.3k4k
ddd_arxiv
Language:CSS0 1 00
wjddd.github.io
0 1 00

wjddd's Repositories

wjddd/ddd_arxiv
Language:CSS0 1 00
wjddd/wjddd.github.io
0 1 00