Labmem-Zhouyx

Focus on TTS/Speech/NLP. El Psy Congroo

Tsinghua UniversityShenzhen, Guangdong

Labmem-Zhouyx's Stars

CompVis/latent-diffusion
High-Resolution Image Synthesis with Latent Diffusion Models
Language:Jupyter Notebook11.6k1.5k
divymurli/VAEs
Variational autoencoders: VAE, gaussian mixture VAE (GMVAE), and a basic ladder VAE (LVAE)
Language:Jupyter Notebook4717
ShannonAI/ChineseBert
Code for ACL 2021 paper "ChineseBERT: Chinese Pretraining Enhanced by Glyph and Pinyin Information"
Language:Python54092
Voice-Privacy-Challenge/Voice-Privacy-Challenge-2022
Baseline Recipe for VoicePrivacy Challenge 2022: anonymization systems and evaluation software
Language:Python6215
CoinCheung/pytorch-loss
label-smooth, amsoftmax, partial-fc, focal-loss, triplet-loss, lovasz-softmax. Maybe useful
Language:Python2.2k374
NeuroWave-ai/CUCVAE-TTS
Language:Python256
NATSpeech/NATSpeech
A Non-Autoregressive Text-to-Speech (NAR-TTS) framework, including official PyTorch implementation of PortaSpeech (NeurIPS 2021) and DiffSpeech (AAAI 2022)
Language:Python96599
yerfor/SyntaSpeech
SyntaSpeech: Syntax-aware Generative Adversarial Text-to-Speech; IJCAI 2022; Official code
Language:Python19330
yl4579/StarGANv2-VC
StarGANv2-VC: A Diverse, Unsupervised, Non-parallel Framework for Natural-Sounding Voice Conversion
Language:Python478107
thuhcsi/tacotron
PyTorch implementation of Tacotron and Tacotron2
Language:Python3212
Labmem-Zhouyx/CDFSE_FastSpeech2
The Official Implementation of “Content-Dependent Fine-Grained Speaker Embedding for Zero-Shot Speaker Adaptation in Text-to-Speech Synthesis”
Language:Python8112
MLNLP-World/Paper-Writing-Tips
MLNLP社区用来帮助大家避免论文投稿小错误的整理仓库。 Paper Writing Tips
3.5k454
resemble-ai/Resemblyzer
A python package to analyze and compare voices with deep learning
Language:Python2.7k424
thuhcsi/VAENAR-TTS
The official implementation of VAENAR-TTS, a VAE based non-autoregressive TTS model.
Language:Python14520
thuhcsi/SpanPSP
Language:Python7419
keonlee9420/STYLER
Official repository of STYLER: Style Factor Modeling with Rapidity and Robustness via Speech Decomposition for Expressive and Controllable Neural Text to Speech, INTERSPEECH 2021
Language:Python15831
Labmem-Zhouyx/FastSpeech2
Language:Python32
RookieJunChen/FullSubNet-plus
The official PyTorch implementation of "FullSubNet+: Channel Attention FullSubNet with Complex Spectrograms for Speech Enhancement".
Language:Python23755
microsoft/unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
Language:Python19.6k2.5k
MontrealCorpusTools/Montreal-Forced-Aligner
Command line utility for forced alignment using Kaldi
Language:Python1.3k243
jik876/hifi-gan
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
Language:Python1.9k504
SungFeng-Huang/Meta-TTS
Official repository of https://doi.org/10.1109/TASLP.2022.3167258. More up-to-date code is in "refactor" branch.
Language:Python18433
pettarin/forced-alignment-tools
A collection of links and notes on forced alignment tools
Language:Python86686
ming024/FastSpeech2
An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"
Language:Python1.8k527
Labmem-Zhouyx/GNN_SemanticTaco2
The code of "Dependency Parsing based Semantic Representation Learning with Graph Neural Network for Enhancing Expressiveness of Text-to-Speech"
Language:Python24
Labmem-Zhouyx/ParallelWaveGAN
Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN) with Pytorch
Language:Jupyter Notebook1
Labmem-Zhouyx/audio2mel_preprocessor
A tool for speech dataset to mel-spectrogram.
Language:Python1
Labmem-Zhouyx/PyTorch_character_CN_Taco2
A PyTorch inplementation of character-based Tacotron2 for Chinese/Mandarin
Language:Python11
Labmem-Zhouyx/PyTorch_phoneme_CN_Taco2
A PyTorch inplementation of phoneme-based Tacotron2 for Chinese/Mandarin
Language:Python2
Labmem-Zhouyx/bert_phoneme_CN_Taco2
Language:Python52

Labmem-Zhouyx

Labmem-Zhouyx's Stars

CompVis/latent-diffusion

divymurli/VAEs

ShannonAI/ChineseBert

Voice-Privacy-Challenge/Voice-Privacy-Challenge-2022

CoinCheung/pytorch-loss

NeuroWave-ai/CUCVAE-TTS

NATSpeech/NATSpeech

yerfor/SyntaSpeech

yl4579/StarGANv2-VC

thuhcsi/tacotron

Labmem-Zhouyx/CDFSE_FastSpeech2

MLNLP-World/Paper-Writing-Tips

resemble-ai/Resemblyzer

thuhcsi/VAENAR-TTS

thuhcsi/SpanPSP

keonlee9420/STYLER

Labmem-Zhouyx/FastSpeech2

RookieJunChen/FullSubNet-plus

microsoft/unilm

MontrealCorpusTools/Montreal-Forced-Aligner

jik876/hifi-gan

SungFeng-Huang/Meta-TTS

pettarin/forced-alignment-tools

ming024/FastSpeech2

Labmem-Zhouyx/GNN_SemanticTaco2

Labmem-Zhouyx/ParallelWaveGAN

Labmem-Zhouyx/audio2mel_preprocessor

Labmem-Zhouyx/PyTorch_character_CN_Taco2

Labmem-Zhouyx/PyTorch_phoneme_CN_Taco2

Labmem-Zhouyx/bert_phoneme_CN_Taco2