Pinned Repositories
Amphion
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
Bert-VITS2
vits2 backbone with multilingual-bert
CosyVoice
LLM based TTS model, providing inference/training/deployment full-stack ability.
fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
LAC
RoPE
speech-synthesis-papers
TTS-Evaluation
Evaluation metrics for TTS model.
wenet
Production First and Production Ready End-to-End Speech Recognition Toolkit
wetts
Production First and Production Ready End-to-End Text-to-Speech Toolkit
Shengqiang-Li's Repositories
Shengqiang-Li/TTS-Evaluation
Evaluation metrics for TTS model.
Shengqiang-Li/fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
Shengqiang-Li/LAC
Shengqiang-Li/Amphion
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
Shengqiang-Li/Bert-VITS2
vits2 backbone with multilingual-bert
Shengqiang-Li/CosyVoice
LLM based TTS model, providing inference/training/deployment full-stack ability.
Shengqiang-Li/RoPE
Shengqiang-Li/speech-synthesis-papers
Shengqiang-Li/wenet
Production First and Production Ready End-to-End Speech Recognition Toolkit
Shengqiang-Li/Wenet-avsr