wownaoh9's Stars
ga642381/FastSpeech2
Multi-Speaker Pytorch FastSpeech2: Fast and High-Quality End-to-End Text to Speech :fist:
keonlee9420/Expressive-FastSpeech2
PyTorch Implementation of Non-autoregressive Expressive (emotional, conversational) TTS based on FastSpeech2, supporting English, Korean, and your own languages.
enhuiz/vall-e
An unofficial PyTorch implementation of the audio LM VALL-E
revsic/torch-nansypp
NANSY++: Unified Voice Synthesis with Neural Analysis and Synthesis
elvisyjlin/RelGAN-PyTorch
RelGAN: Multi-Domain Image-to-Image Translation via Relative Attributes
chaitanya100100/Relative-Attributes-Zero-Shot-Learning
Python Implementation of Visual Relative Attributes for Image Classification and Zero Shot Learning
WShijun1991/ICASSP2023_DEMO
lucidrains/vit-pytorch
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
lucidrains/voicebox-pytorch
Implementation of Voicebox, new SOTA Text-to-speech network from MetaAI, in Pytorch
shivammehta25/Matcha-TTS
[ICASSP 2024] 🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching
Executedone/Chinese-FastSpeech2
基于标贝数据继续训练,同时对原本的FastSpeech2模型做了改进,引入了韵律表征以及韵律预测模块,使中文发音更生动且富有节奏
Jackson-Kang/MFARunner
A simple tool to easily use Montreal Forced Aligner. Also provide alignment(TextGrid) retrieved from ESD.
zjwang21/mix-phoneme-bert
An unofficial PyTorch implementation of Mix-Phoneme-Bert
microsoft/NeuralSpeech
haoheliu/AudioLDM
AudioLDM: Generate speech, sound effects, music and beyond, with text.
lucidrains/naturalspeech2-pytorch
Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorch