Pinned Repositories
Awesome-Deep-Neural-Network-Compression
Summary, Code for Deep Neural Network Quantization
awesome-speech-recognition-speech-synthesis-papers
Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)
CFLOW_VC_demo
Comprehensive-Transformer-TTS
A Non-Autoregressive Transformer based Text-to-Speech, supporting a family of SOTA transformers with supervised and unsupervised duration modelings. This project grows with the research community, aiming to achieve the ultimate TTS
css10
CSS10: A Collection of Single Speaker Speech Datasets for 10 Languages
data
Deep-Compression-PyTorch
PyTorch implementation of 'Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding' by Song Han, Huizi Mao, William J. Dally
demo-test
emotional-vits
无需情感标注的情感可控语音合成模型,基于VITS
FastSpeech2
An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"
bigdan12's Repositories
bigdan12/Awesome-Deep-Neural-Network-Compression
Summary, Code for Deep Neural Network Quantization
bigdan12/awesome-speech-recognition-speech-synthesis-papers
Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)
bigdan12/CFLOW_VC_demo
bigdan12/Comprehensive-Transformer-TTS
A Non-Autoregressive Transformer based Text-to-Speech, supporting a family of SOTA transformers with supervised and unsupervised duration modelings. This project grows with the research community, aiming to achieve the ultimate TTS
bigdan12/css10
CSS10: A Collection of Single Speaker Speech Datasets for 10 Languages
bigdan12/data
bigdan12/Deep-Compression-PyTorch
PyTorch implementation of 'Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding' by Song Han, Huizi Mao, William J. Dally
bigdan12/demo-test
bigdan12/emotional-vits
无需情感标注的情感可控语音合成模型,基于VITS
bigdan12/FastSpeech2
An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"
bigdan12/glow-tts
A Generative Flow for Text-to-Speech via Monotonic Alignment Search
bigdan12/LQ-Nets
LQ-Nets: Learned Quantization for Highly Accurate and Compact Deep Neural Networks
bigdan12/MSMC-TTS
Official Implement of Multi-Stage Multi-Codebook (MSMC) TTS
bigdan12/NonAttentiveTacotron
bigdan12/parrots
Automatic Speech Recognition(ASR), Text-To-Speech(TTS) engine. 中英语音识别、多角色语音合成,支持多语言,准确率高
bigdan12/product-quantization
🙃Implementation of vector quantization algorithms, codes for Norm-Explicit Quantization: Improving Vector Quantization for Maximum Inner Product Search..
bigdan12/PyTorch-VAE
A Collection of Variational Autoencoders (VAE) in PyTorch.
bigdan12/SFBgan-VC-demo
bigdan12/SpanPSP
bigdan12/test
demo
bigdan12/test-d
bigdan12/transferlearning
Transfer learning / domain adaptation / domain generalization / multi-task learning etc. Papers, codes, datasets, applications, tutorials.-迁移学习
bigdan12/tts-gan
TTS-GAN: A Transformer-based Time-Series Generative Adversarial Network
bigdan12/VAE-CVAE-MNIST
Variational Autoencoder and Conditional Variational Autoencoder on MNIST in PyTorch
bigdan12/vits_chinese
vits chinese, tts chinese, tts mandarin 史上训练最简单,音质最好的语音合成系统
bigdan12/VoiceConversionLab
Collect Voice Conversion researches
bigdan12/WhisperSpeech
An Open Source text-to-speech system built by inverting Whisper.