vryhor

vryhor's Stars

s3prl/s3prl
Self-Supervised Speech Pre-training and Representation Learning Toolkit
Language:Python2.3k485
shivammehta25/Matcha-TTS
[ICASSP 2024] 🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching
Language:Jupyter Notebook72288
IDRnD/VoxTube
The VoxTube dataset official repository
Language:HTML601
corl-team/CORL
High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC, SPOT, Cal-QL, ReBRAC
Language:Python47720
ldong1111/GraphemeBERT
This is the source code of the paper "Neural grapheme-to-phoneme conversion with pretrained grapheme models
Language:Python457
lingjzhu/CharsiuG2P
Multilingual G2P in 100 languages
Language:Jupyter Notebook28525
Den4ikAI/Anfice-chatbot
Диалоговая система на базе FRED-T5
Language:Python343
JunityZhan/Understanding-VITS
In this repository, you will learn how code works in VITS(Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech) in Jupyter Notebooks, including normalizing data, training process, inference process, and model's details.
Language:Jupyter Notebook15924
svc-develop-team/so-vits-svc
SoftVC VITS Singing Voice Conversion
Language:Python25.8k4.8k
microsoft/NeuralSpeech
Language:Python1.4k183
bootphon/phonemizer
Simple text to phones converter for multiple languages
Language:Python1.2k172
Tomiinek/Blizzard2013_Segmentation
Transcripts and segmentation for the Blizzard 2013 audiobooks also known as the Lessac or Blizzard 2013 dataset.
Language:Shell449
facebookresearch/CPC_audio
An implementation of the Contrast Predictive Coding (CPC) method to train audio features in an unsupervised fashion.
Language:Python34861
YatingMusic/ddsp-singing-vocoders
Official implementation of SawSing (ISMIR'22)
Language:Python25437
LAION-AI/Open-Assistant
OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.
Language:Python37k3.2k
Rongjiehuang/ProDiff
PyTorch Implementation of ProDiff (ACM-MM'22) with a Extremely-Fast diffusion speech synthesis pipeline
Language:Python43255