dariadiatlova

voice dl researcher

@deepvkSaint-Petersburg

dariadiatlova's Stars

Textualize/rich
Rich is a Python library for rich text and beautiful formatting in the terminal.
Language:Python48.8k 536 1.3k1.7k
netease-youdao/EmotiVoice
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
Language:Python7.2k 63 149614
metavoiceio/metavoice-src
Foundational model for human-like, expressive TTS
Language:Python3.7k 77 123644
Stability-AI/stable-audio-tools
Generative models for conditional audio generation
Language:Python2.5k 43 82231
lucidrains/voicebox-pytorch
Implementation of Voicebox, new SOTA Text-to-speech network from MetaAI, in Pytorch
Language:Python587 51 2549
shivammehta25/Matcha-TTS
[ICASSP 2024] 🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching
Language:Jupyter Notebook586 16 6174
TaoRuijie/ECAPA-TDNN
Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)
Language:Python579 4 80111
facebookresearch/textlesslib
Library for Textless Spoken Language Processing
Language:Python518 16 2450
audeering/w2v2-how-to
How to use our public wav2vec2 dimensional emotion model
Language:Jupyter Notebook432 9 1647
X-LANCE/VoiceFlow-TTS
[ICASSP 2024] This is the official code for "VoiceFlow: Efficient Text-to-Speech with Rectified Flow Matching"
Language:Python278 16 1320
dongzhuoyao/awesome-flow-matching
A summary of related works about flow matching, stochastic interpolants
257 10 19
fschmid56/EfficientAT
This repository aims at providing efficient CNNs for Audio Tagging. We provide AudioSet pre-trained models ready for downstream training and extraction of audio embeddings.
Language:Python215 5 2740
p0p4k/pflowtts_pytorch
Unofficial implementation of NVIDIA P-Flow TTS paper
Language:Python205 14 4230
jishengpeng/Languagecodec
Language-Codec: Reducing the Gaps Between Discrete Codec Representation and Speech Language Models
Language:Python202 8 716
sony/bigvsan
Pytorch implementation of BigVSAN
Language:Python196 29 616
keonlee9420/DailyTalk
Official repository of DailyTalk: Spoken Dialogue Dataset for Conversational Text-to-Speech, ICASSP 2023
Language:Python194 7 313
corl-team/rebased
Official implementation of the paper "Linear Transformers with Learnable Kernel Functions are Better In-Context Models"
Language:Python151 5 44
X-LANCE/UniCATS-CTX-vec2wav
[AAAI 2024] Code for CTX-vec2wav in UniCATS
Language:Python115 10 916
nii-yamagishilab/ZMM-TTS
ZMM-TTS: Zero-shot Multilingual and Multispeaker Speech Synthesis Conditioned on Self-supervised Discrete Speech Representations
Language:C107 5 68
theodorblackbird/lina-speech
lina-speech : linear attention based text-to-speech
Language:Jupyter Notebook106 12 79
seastar105/pflow-encodec
Implementation of TTS model based on NVIDIA P-Flow TTS Paper
Language:Python64 6 65
DanielLin94144/StyleTalk
Official release of StyleTalk dataset.
53 7 12
shang0712/HierTTS
Language:Python44 7 310
ECNU-Cross-Innovation-Lab/ShiftSER
[ICASSP 2023] Mingling or Misalignment? Temporal Shift for Speech Emotion Recognition with Pre-trained Representations
Language:Python33 2 22
deepvk/NISQA-s
Language:Python31 2 20
Lallapallooza/fast-audiomentations
⚡ Blazing fast audio augmentation in Python, powered by GPU for high-efficiency processing in machine learning and audio analysis tasks.
Language:Python31 3 01
nivibilla/efficient-vits-finetuning
Finetuning VITS Efficiently
Language:Python31 4 36
EMOsuperb/EMO-SUPERB-submission
EMO-SUPERB submission
Language:Python25 4 01
MSP-UTD/MSP-Podcast_Challenge
MSP-Podcast Challenge Baseline Code
Language:Python115
deepvk/muse
🎵 muse: Music Separation
Language:Python10 3 01

dariadiatlova

dariadiatlova's Stars

Textualize/rich

netease-youdao/EmotiVoice

metavoiceio/metavoice-src

Stability-AI/stable-audio-tools

lucidrains/voicebox-pytorch

shivammehta25/Matcha-TTS

TaoRuijie/ECAPA-TDNN

facebookresearch/textlesslib

audeering/w2v2-how-to

X-LANCE/VoiceFlow-TTS

dongzhuoyao/awesome-flow-matching

fschmid56/EfficientAT

p0p4k/pflowtts_pytorch

jishengpeng/Languagecodec

sony/bigvsan

keonlee9420/DailyTalk

corl-team/rebased

X-LANCE/UniCATS-CTX-vec2wav

nii-yamagishilab/ZMM-TTS

theodorblackbird/lina-speech

seastar105/pflow-encodec

DanielLin94144/StyleTalk

shang0712/HierTTS

ECNU-Cross-Innovation-Lab/ShiftSER

deepvk/NISQA-s

Lallapallooza/fast-audiomentations

nivibilla/efficient-vits-finetuning

EMOsuperb/EMO-SUPERB-submission

MSP-UTD/MSP-Podcast_Challenge

deepvk/muse