Pinned Repositories
accent-evaluation
Amphion
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
Automatic-Prosody-Annotator-with-SSWP-CLAP
An automatic prosodic boundary annotation tool for Text-to-Speech Synthesis (TTS).
autovc
AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss
coqui-TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
EmotiVoice
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
fish-speech
Brand new TTS solution
jinzuomuzhong.github.io
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
jzmzhong.github.io
TTS_Front_End
jzmzhong's Repositories
jzmzhong/Automatic-Prosody-Annotator-with-SSWP-CLAP
An automatic prosodic boundary annotation tool for Text-to-Speech Synthesis (TTS).
jzmzhong/TTS_Front_End
jzmzhong/accent-evaluation
jzmzhong/coqui-TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
jzmzhong/Amphion
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
jzmzhong/autovc
AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss
jzmzhong/EmotiVoice
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
jzmzhong/fish-speech
Brand new TTS solution
jzmzhong/jinzuomuzhong.github.io
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
jzmzhong/jzmzhong.github.io
jzmzhong/marytts
MARY TTS -- an open-source, multilingual text-to-speech synthesis system written in pure java
jzmzhong/NeMo
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
jzmzhong/Non-Autoregressive-TTS
A Non-Autoregressive Text-to-Speech (NAR-TTS) framework, including official PyTorch implementation of PortaSpeech (NeurIPS 2021) and DiffSpeech (AAAI 2022)
jzmzhong/speechbrain
A PyTorch-based Speech Toolkit
jzmzhong/YourTTS
YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyone
jzmzhong/ppgs
High-Fidelity Neural Phonetic Posteriorgrams
jzmzhong/qualtreats
Qualtric or Qualtreat? Generate Qualtrics listening tests for Text-To-Speech evaluations.
jzmzhong/vall-e
PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html
jzmzhong/VALL-E-X
An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io