heixiaoniu's Stars
espnet/espnet
End-to-End Speech Processing Toolkit
facebookresearch/fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
Wendison/VQMIVC
Official implementation of VQMIVC: One-shot (any-to-any) Voice Conversion @ Interspeech 2021 + Online playing demo!
cyhuang-tw/AdaIN-VC
An unofficial implementation of the paper "One-shot Voice Conversion by Separating Speaker and Content Representations with Instance Normalization".
xunhuang1995/AdaIN-style
Arbitrary Style Transfer in Real-time with Adaptive Instance Normalization
yistLin/FragmentVC
Any-to-any voice conversion by end-to-end extracting and fusing fine-grained voice fragments with attention
auspicious3000/autovc
AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss
descriptinc/melgan-neurips
GAN-based Mel-Spectrogram Inversion Network for Text-to-Speech Synthesis
hhguo/EA-SVC
An implement of "Phonetic Posteriorgrams based Many-to-Many Singing Voice Conversion via Adversarial Training"
jiawei-ren/BalancedMetaSoftmax
[NeurIPS 2020] Code for the paper "Balanced Meta-Softmax for Long-Tailed Visual Recognition"
yl4579/StarGANv2-VC
StarGANv2-VC: A Diverse, Unsupervised, Non-parallel Framework for Natural-Sounding Voice Conversion
Dr-Kevin-Zhang/Big-Data-Analysis-and-Application-Course
jazzsaxmafia/dcgan_tensorflow
Tensorflow implementation of "UNSUPERVISED REPRESENTATION LEARNING WITH DEEP CONVOLUTIONAL GENERATIVE ADVERSARIAL NETWORKS"
seaniezhao/torch_npss
pytorch implementation of Neural Parametric Singing Synthesizer 歌声合成
ranchlai/mandarin-tts
Chinese Mandarin tts text-to-speech 中文 (普通话) 语音 合成 , by fastspeech 2 , implemented in pytorch, using waveglow as vocoder, with biaobei and aishell3 datasets
aidreamwin/TTS-Clone-Chinese
基于Real-Time-Voice-Cloning语音克隆中文普通话实现
YannickJadoul/Parselmouth
Praat in Python, the Pythonic way
xcmyz/FastSpeech
The Implementation of FastSpeech based on pytorch.
ming024/FastSpeech2
An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"
NVIDIA/mellotron
Mellotron: a multispeaker voice synthesis model based on Tacotron 2 GST that can make a voice emote and sing without emotive or singing training data
mathigatti/midi2voice
Singing synthesis from MIDI file
MTG/WGANSing
Multi-voice singing voice synthesis
ibab/tensorflow-wavenet
A TensorFlow implementation of DeepMind's WaveNet paper
jackaduma/CycleGAN-VC2
Voice Conversion by CycleGAN (语音克隆/语音转换): CycleGAN-VC2
gerdm/prml
Repository of notes, code and notebooks in Python for the book Pattern Recognition and Machine Learning by Christopher Bishop
jiangyiqun233/PRML_learning
learning fomula
CorentinJ/Real-Time-Voice-Cloning
Clone a voice in 5 seconds to generate arbitrary speech in real-time
ShomyLiu/pytorch-relation-extraction
distant supervised relation extraction models: PCNN MIL (Zeng 2015), PCNN+ATT(Lin 2016). 关系抽取
NLP-LOVE/ML-NLP
此项目是机器学习(Machine Learning)、深度学习(Deep Learning)、NLP面试中常考到的知识点和代码实现,也是作为一个算法工程师必会的理论基础知识。
AimeeLee77/keyword_extraction
利用Python实现中文文本关键词抽取,分别采用TF-IDF、TextRank、Word2Vec词聚类三种方法。