dariadiatlova

voice dl researcher

@deepvkSaint-Petersburg

dariadiatlova's Stars

yt-dlp/yt-dlp
A feature-rich command-line audio/video downloader
Language:Python86.7k 508 8k6.8k
suno-ai/bark
🔊 Text-Prompted Generative Audio Model
Language:Jupyter Notebook35.9k 331 4414.2k
svc-develop-team/so-vits-svc
SoftVC VITS Singing Voice Conversion
Language:Python25.8k 178 1304.8k
Plachtaa/VITS-fast-fine-tuning
This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion
Language:Python4.7k 41 566711
state-spaces/s4
Structured state space sequence models
Language:Jupyter Notebook2.4k 52 135293
haoheliu/AudioLDM
AudioLDM: Generate speech, sound effects, music and beyond, with text.
Language:Python2.4k 42 107222
lucidrains/lion-pytorch
🦁 Lion, new optimizer discovered by Google Brain using genetic algorithms that is purportedly better than Adam(w), in Pytorch
Language:Python2k 15 2349
csteinmetz1/ai-audio-startups
Community list of startups working with AI in audio and music technology
1.5k 67 5136
descriptinc/descript-audio-codec
State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.
Language:Python1.2k 26 76111
xiph/LPCNet
Efficient neural speech synthesis
Language:C1.1k 71 198295
DmitryRyumin/INTERSPEECH-2023-24-Papers
INTERSPEECH 2023-2024 Papers: A complete collection of influential and exciting research papers from the INTERSPEECH 2023-24 conference. Explore the latest advances in speech and language processing. Code included. Star the repository to support the advancement of speech technology!
639 88 442
OlaWod/FreeVC
FreeVC: Towards High-Quality Text-Free One-Shot Voice Conversion
Language:Python599 19 88110
heatz123/naturalspeech
A fully working pytorch implementation of NaturalSpeech (Tan et al., 2022)
Language:Python470 30 3468
rishikksh20/VocGAN
VocGAN: A High-Fidelity Real-time Vocoder with a Hierarchically-nested Adversarial Network
Language:Python319 11 1861
chomeyama/SiFiGAN
Official implementation of the source-filter HiFiGAN vocoder
Language:Python233 9 1234
adobe-research/MetaAF
Control adaptive filters with neural networks.
Language:Python225 8 1638
keonlee9420/StyleSpeech
PyTorch Implementation of Meta-StyleSpeech : Multi-Speaker Adaptive Text-to-Speech Generation
Language:Python190 6 1723
keonlee9420/Cross-Speaker-Emotion-Transfer
PyTorch Implementation of ByteDance's Cross-speaker Emotion Transfer Based on Speaker Condition Layer Normalization and Semi-Supervised Training in Text-To-Speech
Language:Python187 6 1727
MelissaChen15/control-vc
This is the implementation for "ControlVC: Zero-Shot Voice Conversion with Time-Varying Controls on Pitch and Rhythm"
Language:Python127 9 1216
ubisoft/ubisoft-laforge-daft-exprt
PyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis
Language:Python123 7 1723
ttslr/StrengthNet
[INTERSPEECH'2022] Accurate Emotion Strength Assessment for Seen and Unseen Speech Based on Data-Driven Deep Learning
Language:Python79 6 311
microsoft/PLC-Challenge
This repo contains required files for the INTERSPEECH 2022 Audio Deep Packet Loss Concealment (PLC) Challenge.
Language:Python73 8 612
seahore/PPG-GradVC
A diffusion-based cross-lingual voice conversion model, as my bachelor's thesis
Language:Python43 8 36
hetpandya/youtube_tts_data_generator
A python library to generate speech dataset from Youtube videos
Language:Python35 3 59
fmu2/NICE
PyTorch implementation of NICE
Language:Python33 4 28
ZiangLong/LPCNet_pytorch
A Pytorch version of LPCNet, including dump weight
Language:Python31 1 414
Guanyuansheng/TFGAN-PLC
A Temporal-Spectral Generative Adversarial Network based End-to-end Packet Loss Concealment for Wideband Speech Transmission
Language:Python30 1 1411
Crystalsound/FRN
Language:Python26 1 46
elephantmipt/annotated-s4
LRU
Language:Python1 0 00
SpirinEgor/llm_inference_bot
Simple LLM inference for VK & Telegram bots
Language:Python1 3 01

dariadiatlova

dariadiatlova's Stars

yt-dlp/yt-dlp

suno-ai/bark

svc-develop-team/so-vits-svc

Plachtaa/VITS-fast-fine-tuning

state-spaces/s4

haoheliu/AudioLDM

lucidrains/lion-pytorch

csteinmetz1/ai-audio-startups

descriptinc/descript-audio-codec

xiph/LPCNet

DmitryRyumin/INTERSPEECH-2023-24-Papers

OlaWod/FreeVC

heatz123/naturalspeech

rishikksh20/VocGAN

chomeyama/SiFiGAN

adobe-research/MetaAF

keonlee9420/StyleSpeech

keonlee9420/Cross-Speaker-Emotion-Transfer

MelissaChen15/control-vc

ubisoft/ubisoft-laforge-daft-exprt

ttslr/StrengthNet

microsoft/PLC-Challenge

seahore/PPG-GradVC

hetpandya/youtube_tts_data_generator

fmu2/NICE

ZiangLong/LPCNet_pytorch

Guanyuansheng/TFGAN-PLC

Crystalsound/FRN

elephantmipt/annotated-s4

SpirinEgor/llm_inference_bot