Pinned Repositories
AdaIN-VC
An unofficial implementation of the paper "One-shot Voice Conversion by Separating Speaker and Content Representations with Instance Normalization".
adaptive_voice_conversion
AttentionBasedProsodyPrediction
Encoder and Decoder and Attention Based Prosody Prediction
autovc-official
AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss
autovc-unofficial_tw
An unofficial implementation of the paper "AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss".
Comprehensive-Transformer-TTS
A Non-Autoregressive Transformer based TTS, supporting a family of SOTA transformers with supervised and unsupervised duration modelings. This project grows with the research community, aiming to achieve the ultimate TTS.
CosyVoice
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
dvector
Speaker embedding (d-vector) trained with GE2E loss
Expressive-FastSpeech2
PyTorch Implementation of Non-autoregressive Expressive (emotional, conversational) TTS (text to speech, speech synthesis) based on FastSpeech2, supporting English and Korean
FastSpeech2
An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"
mnfutao's Repositories
mnfutao/AdaIN-VC
An unofficial implementation of the paper "One-shot Voice Conversion by Separating Speaker and Content Representations with Instance Normalization".
mnfutao/adaptive_voice_conversion
mnfutao/AttentionBasedProsodyPrediction
Encoder and Decoder and Attention Based Prosody Prediction
mnfutao/autovc-official
AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss
mnfutao/autovc-unofficial_tw
An unofficial implementation of the paper "AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss".
mnfutao/Comprehensive-Transformer-TTS
A Non-Autoregressive Transformer based TTS, supporting a family of SOTA transformers with supervised and unsupervised duration modelings. This project grows with the research community, aiming to achieve the ultimate TTS.
mnfutao/CosyVoice
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
mnfutao/dvector
Speaker embedding (d-vector) trained with GE2E loss
mnfutao/Expressive-FastSpeech2
PyTorch Implementation of Non-autoregressive Expressive (emotional, conversational) TTS (text to speech, speech synthesis) based on FastSpeech2, supporting English and Korean
mnfutao/FastSpeech2
An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"
mnfutao/FragmentVC
Any-to-any voice conversion by end-to-end extracting and fusing fine-grained voice fragments with attention
mnfutao/fullstop-deep-punctuation-prediction
A model that predicts the punctuation of English, Italian, French and German texts.
mnfutao/g2p
g2p: English Grapheme To Phoneme Conversion
mnfutao/GitHub-Chinese-Top-Charts
:cn: GitHub中文排行榜,各语言分设「软件 | 资料」榜单,精准定位中文好项目。各取所需,高效学习。
mnfutao/Model_Fusion_Based_Prosody_Prediction
Model Fusion Based Prosody Prediction
mnfutao/open-tts-tracker
mnfutao/Prosody_Prediction
Predict prosody labels for Chinese sentences.
mnfutao/punctuation_prediction
chinese sentence punctuation prediction,中文句子标点符号预测。
mnfutao/pytorch_xvectors
Deep speaker embeddings in PyTorch, including x-vectors. Code used in this work: https://arxiv.org/abs/2007.16196
mnfutao/PyTSMod
An open-source Python library for audio time-scale modification.
mnfutao/voicefixer
General Speech Restoration
mnfutao/VQMIVC
Official implementation of VQMIVC: One-shot Voice Conversion @ Interspeech 2021
mnfutao/whisper
Robust Speech Recognition via Large-Scale Weak Supervision
mnfutao/ZMM-TTS
ZMM-TTS: Zero-shot Multilingual and Multispeaker Speech Synthesis Conditioned on Self-supervised Discrete Speech Representations