Pinned Repositories
Amphion
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
ast
Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".
AttentionBasedProsodyPrediction
Encoder and Decoder and Attention Based Prosody Prediction
crepe
CREPE: A Convolutional REpresentation for Pitch Estimation -- pre-trained model (ICASSP 2018)
css10
CSS10: A Collection of Single Speaker Speech Datasets for 10 Languages
dctts-pytorch
The pytorch implementation of DC-TTS
gst-tacotron
A tensorflow implementation of the "Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis"
shangqwe123's Repositories
shangqwe123/Amphion
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
shangqwe123/ast
Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".
shangqwe123/crepe
CREPE: A Convolutional REpresentation for Pitch Estimation -- pre-trained model (ICASSP 2018)
shangqwe123/css10
CSS10: A Collection of Single Speaker Speech Datasets for 10 Languages
shangqwe123/ddsp-pytorch
Implementation of DDSP (PyTorch), Differentiable Digital Signal Processing (ICLR 2020)
shangqwe123/DeepLearningExamples
Deep Learning Examples
shangqwe123/diffwave
DiffWave is a fast, high-quality neural vocoder and waveform synthesizer.
shangqwe123/End-to-end-ASR-Pytorch
This is an open source project (formerly named Listen, Attend and Spell - PyTorch Implementation) for end-to-end ASR implemented with Pytorch, the well known deep learning toolkit.
shangqwe123/espnet
End-to-End Speech Processing Toolkit
shangqwe123/FastSpeech
The Implementation of FastSpeech based on pytorch.
shangqwe123/ForwardTacotron
⏩ Generating speech in a single forward pass without any attention!
shangqwe123/g2pC
g2pC: A Context-aware Grapheme-to-Phoneme Conversion module for Chinese
shangqwe123/GAN-TTS
A pytroch implementation of the GAN-TTS: HIGH FIDELITY SPEECH SYNTHESIS WITH ADVERSARIAL NETWORKS
shangqwe123/glow-tts
A Generative Flow for Text-to-Speech via Monotonic Alignment Search
shangqwe123/gmvae_tacotron
Gaussian Mixture VAE Tacotron
shangqwe123/google-research
Google Research
shangqwe123/LPCNet
Efficient neural speech synthesis
shangqwe123/LPCTron
Tacotron2 + LPCNET for complete End-to-End TTS System
shangqwe123/melgan-neurips
GAN-based Mel-Spectrogram Inversion Network for Text-to-Speech Synthesis
shangqwe123/merlin
This is now the official location of the Merlin project.
shangqwe123/ParallelWaveGAN
Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN) with Pytorch
shangqwe123/pkuseg-python
pkuseg多领域中文分词工具; The pkuseg toolkit for multi-domain Chinese word segmentation
shangqwe123/shangqwe123.github.io
shangqwe123/Speaker_Embedding_Torch
PyTorch based speaker embedding model
shangqwe123/tacotron2
Forked from https://github.com/NVIDIA/DeepLearningExamples/tree/master/PyTorch/SpeechSynthesis/Tacotron2 and merged with https://github.com/Rayhane-mamah/Tacotron-2
shangqwe123/torchcrepe
Pytorch implementation of the CREPE pitch tracker
shangqwe123/UniversalVocoding
A PyTorch implementation of "Robust Universal Neural Vocoding"
shangqwe123/VAEX
code f
shangqwe123/voice_conversion
shangqwe123/WGANSing
Multi-voice singing voice synthesis