dariadiatlova's Stars
yt-dlp/yt-dlp
A feature-rich command-line audio/video downloader
suno-ai/bark
🔊 Text-Prompted Generative Audio Model
svc-develop-team/so-vits-svc
SoftVC VITS Singing Voice Conversion
Plachtaa/VITS-fast-fine-tuning
This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion
state-spaces/s4
Structured state space sequence models
haoheliu/AudioLDM
AudioLDM: Generate speech, sound effects, music and beyond, with text.
lucidrains/lion-pytorch
🦁 Lion, new optimizer discovered by Google Brain using genetic algorithms that is purportedly better than Adam(w), in Pytorch
csteinmetz1/ai-audio-startups
Community list of startups working with AI in audio and music technology
descriptinc/descript-audio-codec
State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.
xiph/LPCNet
Efficient neural speech synthesis
DmitryRyumin/INTERSPEECH-2023-24-Papers
INTERSPEECH 2023-2024 Papers: A complete collection of influential and exciting research papers from the INTERSPEECH 2023-24 conference. Explore the latest advances in speech and language processing. Code included. Star the repository to support the advancement of speech technology!
OlaWod/FreeVC
FreeVC: Towards High-Quality Text-Free One-Shot Voice Conversion
heatz123/naturalspeech
A fully working pytorch implementation of NaturalSpeech (Tan et al., 2022)
rishikksh20/VocGAN
VocGAN: A High-Fidelity Real-time Vocoder with a Hierarchically-nested Adversarial Network
chomeyama/SiFiGAN
Official implementation of the source-filter HiFiGAN vocoder
adobe-research/MetaAF
Control adaptive filters with neural networks.
keonlee9420/StyleSpeech
PyTorch Implementation of Meta-StyleSpeech : Multi-Speaker Adaptive Text-to-Speech Generation
keonlee9420/Cross-Speaker-Emotion-Transfer
PyTorch Implementation of ByteDance's Cross-speaker Emotion Transfer Based on Speaker Condition Layer Normalization and Semi-Supervised Training in Text-To-Speech
MelissaChen15/control-vc
This is the implementation for "ControlVC: Zero-Shot Voice Conversion with Time-Varying Controls on Pitch and Rhythm"
ubisoft/ubisoft-laforge-daft-exprt
PyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis
ttslr/StrengthNet
[INTERSPEECH'2022] Accurate Emotion Strength Assessment for Seen and Unseen Speech Based on Data-Driven Deep Learning
microsoft/PLC-Challenge
This repo contains required files for the INTERSPEECH 2022 Audio Deep Packet Loss Concealment (PLC) Challenge.
seahore/PPG-GradVC
A diffusion-based cross-lingual voice conversion model, as my bachelor's thesis
hetpandya/youtube_tts_data_generator
A python library to generate speech dataset from Youtube videos
fmu2/NICE
PyTorch implementation of NICE
ZiangLong/LPCNet_pytorch
A Pytorch version of LPCNet, including dump weight
Guanyuansheng/TFGAN-PLC
A Temporal-Spectral Generative Adversarial Network based End-to-end Packet Loss Concealment for Wideband Speech Transmission
Crystalsound/FRN
elephantmipt/annotated-s4
LRU
SpirinEgor/llm_inference_bot
Simple LLM inference for VK & Telegram bots