gongchenghhu

Tianjin University

Pinned Repositories

asr_guided_tacotron
Use las to enhance the performance of tacotron, especially at the lack of the speaker labels.
Language:Python0 0 00
auorange
Audio LPC (linear prediction code) using mel spectorgram, compatible for LPCNet
Language:Python1 0 00
Bert-VITS2
vits2 backbone with bert
Language:Python0 0 00
BigCiDian
Pronunciation lexicon covering both English and Chinese languages for Automatic Speech Recognition.
Language:Python0 0 00
bit-rnn
Quantize weights and activations in Recurrent Neural Networks.
Language:Python0 0 00
ICASSP2021_demo
Language:HTML1 1 01
ICASSP2022_demo
Language:HTML1 1 01
TacoLPCNet-demo
Language:HTML2 1 12
TASLP
Language:HTML4 2 10
zhrtvc
Chinese real time voice cloning (VC) and Chinese text to speech (TTS). 好用的中文语音克隆兼中文语音合成系统，包含语音编码器、语音合成器、声码器和可视化模块。
Language:Python1 0 00

gongchenghhu's Repositories

gongchenghhu/TASLP
Language:HTML4 2 10
gongchenghhu/ICASSP2022_demo
Language:HTML1 1 01
gongchenghhu/Bert-VITS2
vits2 backbone with bert
Language:Python0 0 00
gongchenghhu/Comprehensive-E2E-TTS
A Non-Autoregressive End-to-End Text-to-Speech (text-to-wav), supporting a family of SOTA unsupervised duration modelings. This project grows with the research community, aiming to achieve the ultimate E2E-TTS
Language:Python0 0
gongchenghhu/Comprehensive-Transformer-TTS
A Non-Autoregressive Transformer based TTS, supporting a family of SOTA transformers with supervised and unsupervised duration modelings. This project grows with the research community, aiming to achieve the ultimate TTS.
Language:Python0 0
gongchenghhu/emotion2vec
Official PyTorch code for extracting features and training downstream models with emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation
Language:Python0 0
gongchenghhu/emotional-vits
无需情感标注的情感可控语音合成模型，基于VITS
Language:Jupyter Notebook0 0
gongchenghhu/espnet
End-to-End Speech Processing Toolkit
Language:Python0 0
gongchenghhu/few-shot-transformer-tts
Byte-based multilingual transformer TTS for low-resource/few-shot language adaptation.
Language:Python0 0
gongchenghhu/FG-transformer-TTS
Official implementation for the paper Fine-grained style control in transformer-based text-to-speech synthesis.
Language:Python0 0
gongchenghhu/gongchenghhu.github.io
Portuguese audio samples
gongchenghhu/house
有完整版的PDF下载。
Language:Java0 0
gongchenghhu/IMS-Toucan
Text-to-Speech Toolkit of the Speech and Language Technologies Group at the University of Stuttgart. Objectives of the development are simplicity, modularity, controllability and multilinguality.
gongchenghhu/iSTFTNet-pytorch
iSTFTNet : Fast and Lightweight Mel-spectrogram Vocoder Incorporating Inverse Short-time Fourier Transform
Language:Python0 0
gongchenghhu/Italian-demo
Low resources results for Italian
Language:HTML1 0
gongchenghhu/MSMC-TTS
Official Implement of Multi-Stage Multi-Codebook (MSMC) TTS
Language:Python0 0
gongchenghhu/NANSY
Language:Python0 0
gongchenghhu/Polish-demo
Language:HTML1 0
gongchenghhu/Portuguese.github.io
Portuguese audio samples
Language:HTML1 0
gongchenghhu/PortugueseAudios
Language:HTML1 0
gongchenghhu/PortugueseAudios.github.io
Language:HTML
gongchenghhu/radtts
Provides training, inference and voice conversion recipes for RADTTS and RADTTS++: Flow-based TTS models with Robust Alignment Learning, Diverse Synthesis, and Generative Modeling and Fine-Grained Control over of Low Dimensional (F0 and Energy) Speech Attributes.
Language:Roff0 0
gongchenghhu/SpeechT5
Unified-Modal Speech-Text Pre-Training for Spoken Language Processing
Language:Python0 0
gongchenghhu/supplementary-results
Language:HTML1 0
gongchenghhu/TASLP-demo
Multi-lingual and multi-speaker audios
Language:HTML1 0
gongchenghhu/test
1 0
gongchenghhu/w2v2-how-to
How to use our public wav2vec2 dimensional emotion model
Language:Jupyter Notebook0 0
gongchenghhu/wav2vec2-codebook-indices
Language:Python0 0
gongchenghhu/ZeroSpeech
VQ-VAE for Acoustic Unit Discovery and Voice Conversion
Language:Python0 0
gongchenghhu/ZMM-TTS
ZMM-TTS: Zero-shot Multilingual and Multispeaker Speech Synthesis Conditioned on Self-supervised Discrete Speech Representations
Language:C0 0