vryhor's Stars
s3prl/s3prl
Self-Supervised Speech Pre-training and Representation Learning Toolkit
shivammehta25/Matcha-TTS
[ICASSP 2024] 🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching
IDRnD/VoxTube
The VoxTube dataset official repository
corl-team/CORL
High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC, SPOT, Cal-QL, ReBRAC
ldong1111/GraphemeBERT
This is the source code of the paper "Neural grapheme-to-phoneme conversion with pretrained grapheme models
lingjzhu/CharsiuG2P
Multilingual G2P in 100 languages
Den4ikAI/Anfice-chatbot
Диалоговая система на базе FRED-T5
JunityZhan/Understanding-VITS
In this repository, you will learn how code works in VITS(Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech) in Jupyter Notebooks, including normalizing data, training process, inference process, and model's details.
svc-develop-team/so-vits-svc
SoftVC VITS Singing Voice Conversion
microsoft/NeuralSpeech
bootphon/phonemizer
Simple text to phones converter for multiple languages
Tomiinek/Blizzard2013_Segmentation
Transcripts and segmentation for the Blizzard 2013 audiobooks also known as the Lessac or Blizzard 2013 dataset.
facebookresearch/CPC_audio
An implementation of the Contrast Predictive Coding (CPC) method to train audio features in an unsupervised fashion.
YatingMusic/ddsp-singing-vocoders
Official implementation of SawSing (ISMIR'22)
LAION-AI/Open-Assistant
OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.
Rongjiehuang/ProDiff
PyTorch Implementation of ProDiff (ACM-MM'22) with a Extremely-Fast diffusion speech synthesis pipeline