Pinned Repositories
KazEmoTTS
An open-source Kazakh Emotional Text-to-Speech Dataset
edge-tts
Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key
KAN-TTS
KAN-TTS is a speech-synthesis training framework, please try the demos we have posted at https://modelscope.cn/models?page=1&tasks=text-to-speech
KazEmoTTS
An open-source Kazakh Emotional Text-to-Speech Dataset
NeMo
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
Speech-Backbones
This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.
WaveGrad
Implementation of WaveGrad high-fidelity vocoder from Google Brain in PyTorch.
whisper
Robust Speech Recognition via Large-Scale Weak Supervision
LipVoicer
Official Code implementation for the ICLR paper "LipVoicer: Generating Speech from Silent Videos Guided by Lip Reading"
MyBeautiful-Fantasy's Repositories
MyBeautiful-Fantasy/Speech-Backbones
This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.
MyBeautiful-Fantasy/edge-tts
Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key
MyBeautiful-Fantasy/KAN-TTS
KAN-TTS is a speech-synthesis training framework, please try the demos we have posted at https://modelscope.cn/models?page=1&tasks=text-to-speech
MyBeautiful-Fantasy/KazEmoTTS
An open-source Kazakh Emotional Text-to-Speech Dataset
MyBeautiful-Fantasy/NeMo
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
MyBeautiful-Fantasy/WaveGrad
Implementation of WaveGrad high-fidelity vocoder from Google Brain in PyTorch.
MyBeautiful-Fantasy/whisper
Robust Speech Recognition via Large-Scale Weak Supervision