liusongxiang
Work on spoken language processing: General Audio synthesis, TTS, VC, SVS & SVC etc.
miHoYoShenzhen, China
Pinned Repositories
AcademiCodec
AcademiCodec: An Open Source Audio Codec Model for Academic Research
BNE-Seq2SeqMoL-VC
Demo for "Any-to-Many Voice Conversion with Location-Relative Sequence-to-Sequence Modeling"
diffsvc
DiffSVC demo page
efficient_tts
Pytorch implementation of "Efficienttts: an efficient and high-quality text-to-speech architecture"
end2endAC
Audio samples for the paper "End-to-end Accent Conversion"
Large-Audio-Models
Keep track of big models in audio domain, including speech, singing, music etc.
ppg-vc
PPG-Based Voice Conversion
speaker-verification-d-vector
Implementation of state of the art d-vector approach for speaker verification
StarGAN-Voice-Conversion
This is a pytorch implementation of the paper: StarGAN-VC: Non-parallel many-to-many voice conversion with star generative adversarial networks
StyleTransferVC
Audio samples for the paper "Transferring Source Style in Non-Parallel Voice Conversion"
liusongxiang's Repositories
liusongxiang/Large-Audio-Models
Keep track of big models in audio domain, including speech, singing, music etc.
liusongxiang/ppg-vc
PPG-Based Voice Conversion
liusongxiang/efficient_tts
Pytorch implementation of "Efficienttts: an efficient and high-quality text-to-speech architecture"
liusongxiang/diffsvc
DiffSVC demo page
liusongxiang/BNE-Seq2SeqMoL-VC
Demo for "Any-to-Many Voice Conversion with Location-Relative Sequence-to-Sequence Modeling"
liusongxiang/AcademiCodec
AcademiCodec: An Open Source Audio Codec Model for Academic Research
liusongxiang/bigvsan
Pytorch implementation of BigVSAN
liusongxiang/liusongxiang
liusongxiang/WaveGrad
Implementation of Google Brain's WaveGrad high-fidelity vocoder (paper: https://arxiv.org/pdf/2009.00713.pdf). First implementation on GitHub.
liusongxiang/liusongxiang.github.io
Personal homepage:
liusongxiang/.tmux
🇫🇷 Oh my tmux! My self-contained, pretty & versatile tmux configuration made with ❤️
liusongxiang/aishell-3-baseline-fc
The code for aishell-3 baseline acoustic model
liusongxiang/audiolm-pytorch
Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch
liusongxiang/cceyda
Short profile with some stats and keywords
liusongxiang/CPC_audio
An implementation of the Contrast Predictive Coding (CPC) method to train audio features in an unsupervised fashion.
liusongxiang/fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
liusongxiang/ForwardTacotron
⏩ Generating speech in a single forward pass without any attention!
liusongxiang/glow-tts
A Generative Flow for Text-to-Speech via Monotonic Alignment Search
liusongxiang/hifi-gan
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
liusongxiang/HN-UnifiedSourceFilterGAN
liusongxiang/Mos-Render-Test
liusongxiang/Parselmouth
Praat in Python, the Pythonic way
liusongxiang/phonemizer
Simple text to phones converter for multiple languages
liusongxiang/rayeren.github.io
My personal homepage
liusongxiang/s3prl
Self-Supervised Speech Pre-training and Representation Learning Toolkit.
liusongxiang/syang1993.github.io
liusongxiang/timbre_painting
liusongxiang/VQMIVC
Official implementation of VQMIVC: One-shot Voice Conversion @ Interspeech 2021
liusongxiang/WavAugment
A library for speech data augmentation in time-domain
liusongxiang/xcmyz