mnfutao

Pinned Repositories

AdaIN-VC
An unofficial implementation of the paper "One-shot Voice Conversion by Separating Speaker and Content Representations with Instance Normalization".
Language:Python00
adaptive_voice_conversion
Language:Python00
AttentionBasedProsodyPrediction
Encoder and Decoder and Attention Based Prosody Prediction
Language:Python0 0 00
autovc-official
AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss
Language:Python0 0 00
autovc-unofficial_tw
An unofficial implementation of the paper "AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss".
Language:Python0 0 00
Comprehensive-Transformer-TTS
A Non-Autoregressive Transformer based TTS, supporting a family of SOTA transformers with supervised and unsupervised duration modelings. This project grows with the research community, aiming to achieve the ultimate TTS.
Language:Python0 0 00
CosyVoice
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
Language:Python0 0 00
dvector
Speaker embedding (d-vector) trained with GE2E loss
Language:Python00
Expressive-FastSpeech2
PyTorch Implementation of Non-autoregressive Expressive (emotional, conversational) TTS (text to speech, speech synthesis) based on FastSpeech2, supporting English and Korean
Language:Python0 0 00
FastSpeech2
An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"
Language:Python0 0 00

mnfutao's Repositories

mnfutao/AdaIN-VC
An unofficial implementation of the paper "One-shot Voice Conversion by Separating Speaker and Content Representations with Instance Normalization".
Language:Python00
mnfutao/adaptive_voice_conversion
Language:Python00
mnfutao/AttentionBasedProsodyPrediction
Encoder and Decoder and Attention Based Prosody Prediction
Language:Python0 0 00
mnfutao/autovc-official
AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss
Language:Python0 0 00
mnfutao/autovc-unofficial_tw
An unofficial implementation of the paper "AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss".
Language:Python0 0 00
mnfutao/Comprehensive-Transformer-TTS
A Non-Autoregressive Transformer based TTS, supporting a family of SOTA transformers with supervised and unsupervised duration modelings. This project grows with the research community, aiming to achieve the ultimate TTS.
Language:Python0 0 00
mnfutao/CosyVoice
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
Language:Python0 0 00
mnfutao/dvector
Speaker embedding (d-vector) trained with GE2E loss
Language:Python00
mnfutao/Expressive-FastSpeech2
PyTorch Implementation of Non-autoregressive Expressive (emotional, conversational) TTS (text to speech, speech synthesis) based on FastSpeech2, supporting English and Korean
Language:Python0 0 00
mnfutao/FastSpeech2
An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"
Language:Python0 0 00
mnfutao/FragmentVC
Any-to-any voice conversion by end-to-end extracting and fusing fine-grained voice fragments with attention
Language:Python0 0 00
mnfutao/fullstop-deep-punctuation-prediction
A model that predicts the punctuation of English, Italian, French and German texts.
mnfutao/g2p
g2p: English Grapheme To Phoneme Conversion
Language:Python0 0
mnfutao/GitHub-Chinese-Top-Charts
:cn: GitHub中文排行榜，各语言分设「软件 | 资料」榜单，精准定位中文好项目。各取所需，高效学习。
Language:Java0 0
mnfutao/Model_Fusion_Based_Prosody_Prediction
Model Fusion Based Prosody Prediction
Language:Python0 0
mnfutao/open-tts-tracker
0 0
mnfutao/Prosody_Prediction
Predict prosody labels for Chinese sentences.
mnfutao/punctuation_prediction
chinese sentence punctuation prediction，中文句子标点符号预测。
mnfutao/pytorch_xvectors
Deep speaker embeddings in PyTorch, including x-vectors. Code used in this work: https://arxiv.org/abs/2007.16196
Language:Python0 0
mnfutao/PyTSMod
An open-source Python library for audio time-scale modification.
Language:Python0 0
mnfutao/voicefixer
General Speech Restoration
mnfutao/VQMIVC
Official implementation of VQMIVC: One-shot Voice Conversion @ Interspeech 2021
mnfutao/whisper
Robust Speech Recognition via Large-Scale Weak Supervision
Language:Python0 0
mnfutao/ZMM-TTS
ZMM-TTS: Zero-shot Multilingual and Multispeaker Speech Synthesis Conditioned on Self-supervised Discrete Speech Representations