ZhengRachel
Final-year Master Student at NERC-SLIP of USTC.
University of Science and Technology of ChinaHefei, China
Pinned Repositories
annotated-transformer
http://nlp.seas.harvard.edu/2018/04/03/attention.html
audio-visual-speech-enhancement
Diff-A2A
DiffGAN-TTS
PyTorch Implementation of DiffGAN-TTS: High-Fidelity and Efficient Text-to-Speech with Denoising Diffusion GANs
DiffSinger
PyTorch implementation of DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (focused on DiffSpeech)
DiffVC_and_GradTTS
This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.
diffwave
DiffWave is a fast, high-quality neural vocoder and waveform synthesizer.
Dive-into-DL-PyTorch
本项目将《动手学深度学习》(Dive into Deep Learning)原书中的MXNet实现改为PyTorch实现。
End-to-End-VAD
an Audio-Visual Voice Activity Detection using Deep Learning
UTIforAVSE-demo
Demo for "Incorporating Ultrasound Tongue Images for Audio-Visual Speech Enhancement Through Knowledge Distillation"
ZhengRachel's Repositories
ZhengRachel/UTIforAVSE-demo
Demo for "Incorporating Ultrasound Tongue Images for Audio-Visual Speech Enhancement Through Knowledge Distillation"
ZhengRachel/annotated-transformer
http://nlp.seas.harvard.edu/2018/04/03/attention.html
ZhengRachel/audio-visual-speech-enhancement
ZhengRachel/Diff-A2A
ZhengRachel/DiffGAN-TTS
PyTorch Implementation of DiffGAN-TTS: High-Fidelity and Efficient Text-to-Speech with Denoising Diffusion GANs
ZhengRachel/DiffSinger
PyTorch implementation of DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (focused on DiffSpeech)
ZhengRachel/DiffVC_and_GradTTS
This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.
ZhengRachel/diffwave
DiffWave is a fast, high-quality neural vocoder and waveform synthesizer.
ZhengRachel/Dive-into-DL-PyTorch
本项目将《动手学深度学习》(Dive into Deep Learning)原书中的MXNet实现改为PyTorch实现。
ZhengRachel/End-to-End-VAD
an Audio-Visual Voice Activity Detection using Deep Learning
ZhengRachel/FastSpeech2
An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"
ZhengRachel/ERVQ
Demo for paper "ERVQ: Enhancing Residual Vector Quantization in Audio Codecs through Intra- and Inter-Codebook Optimization".
ZhengRachel/ImprovedTaLNet-demo
Demo for Improved methods based on pseudo target generation and domain adversarial training for voice reconstruction from silent tongue and lip articulation.
ZhengRachel/IUTIforAVSE-demo
Demo for paper "Incorporating Ultrasound Tongue Images for Audio-Visual Speech Enhancement".
ZhengRachel/NATSpeech
A Non-Autoregressive Text-to-Speech (NAR-TTS) framework, including official PyTorch implementation of PortaSpeech (NeurIPS 2021) and DiffSpeech (AAAI 2022)
ZhengRachel/ProDiff
PyTorch Implementation of ProDiff (ACM-MM'22) with a Extremely-Fast diffusion speech synthesis pipeline
ZhengRachel/py_class_homework
Homework of the Python class of USTC.
ZhengRachel/py_class_homework2
homework2 of py class of ustc due April 29th
ZhengRachel/SpeakerRecognition_tutorial
Simple d-vector based Speaker Recognition (verification and identification) using Pytorch
ZhengRachel/VisualVoice
Audio-Visual Speech Separation with Cross-Modal Consistency
ZhengRachel/VQ-VAE-Speech
PyTorch implementation of VQ-VAE + WaveNet by [Chorowski et al., 2019] and VQ-VAE on speech signals by [van den Oord et al., 2017]
ZhengRachel/wavegrad2
Unofficial Pytorch Implementation of WaveGrad2
ZhengRachel/ZeroSpeech
VQ-VAE for Acoustic Unit Discovery and Voice Conversion
ZhengRachel/zerospeech2020
Python package for the Zero Speech Challenge 2020
ZhengRachel/zhengrachel.github.io
ZhengRachel's HomePage (Forked from AcadHomepage: A Modern and Responsive Academic Personal Homepage)