Pinned Repositories
Amphion
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
Automatic-Prosody-Annotator-with-SSWP-CLAP
An automatic prosodic boundary annotation tool for Text-to-Speech Synthesis (TTS).
coqui-TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
EmotiVoice
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
jzmzhong.github.io
marytts
MARY TTS -- an open-source, multilingual text-to-speech synthesis system written in pure java
NeMo
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
Non-Autoregressive-TTS
A Non-Autoregressive Text-to-Speech (NAR-TTS) framework, including official PyTorch implementation of PortaSpeech (NeurIPS 2021) and DiffSpeech (AAAI 2022)
speechbrain
A PyTorch-based Speech Toolkit
TTS_Front_End
jzmzhong's Repositories
jzmzhong/Automatic-Prosody-Annotator-with-SSWP-CLAP
An automatic prosodic boundary annotation tool for Text-to-Speech Synthesis (TTS).
jzmzhong/TTS_Front_End
jzmzhong/Amphion
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
jzmzhong/coqui-TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
jzmzhong/EmotiVoice
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
jzmzhong/jzmzhong.github.io
jzmzhong/marytts
MARY TTS -- an open-source, multilingual text-to-speech synthesis system written in pure java
jzmzhong/NeMo
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
jzmzhong/Non-Autoregressive-TTS
A Non-Autoregressive Text-to-Speech (NAR-TTS) framework, including official PyTorch implementation of PortaSpeech (NeurIPS 2021) and DiffSpeech (AAAI 2022)
jzmzhong/speechbrain
A PyTorch-based Speech Toolkit
jzmzhong/YourTTS
YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyone
jzmzhong/vall-e
PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html
jzmzhong/VALL-E-X
An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io