gabelev's Stars
svc-develop-team/so-vits-svc
SoftVC VITS Singing Voice Conversion
deezer/spleeter
Deezer source separation library including pretrained models.
jiaaro/pydub
Manipulate audio with a simple and easy high level interface
facebookresearch/demucs
Code for the paper Hybrid Spectrogram and Waveform Source Separation
skorch-dev/skorch
A scikit-learn compatible neural network library that wraps PyTorch
riffusion/riffusion-hobby
Stable diffusion for real-time music generation
lululxvi/deepxde
A library for scientific machine learning and physics-informed learning
Stability-AI/stable-audio-tools
Generative models for conditional audio generation
alessiodm/drl-zh
Deep Reinforcement Learning: Zero to Hero!
archinetai/audio-diffusion-pytorch
Audio generation using diffusion models, in PyTorch.
Mangio621/Mangio-RVC-Fork
*CREPE+HYBRID TRAINING* A very experimental fork of the Retrieval-based-Voice-Conversion-WebUI repo that incorporates a variety of other f0 methods, along with a hybrid f0 nanmedian method.
teticio/audio-diffusion
Apply diffusion models using the new Hugging Face diffusers package to synthesize music instead of images.
Natooz/MidiTok
MIDI / symbolic music tokenizers for Deep Learning models 🎶
maum-ai/faceshifter
Unofficial PyTorch Implementation for FaceShifter (https://arxiv.org/abs/1912.13457)
wesbz/SoundStream
This repository is an implementation of this article: https://arxiv.org/pdf/2107.03312.pdf
symphonynet/SymphonyNet
Symphony Generation with Permutation Invariant Language Model
musikalkemist/pytorchforaudio
Code for the "PyTorch for Audio + Music Processing" series on The Sound of AI YouTube channel.
areski/django-audiofield
Django-Audiofield is a simple app that allows Audio files upload, management and conversion to different audio format (mp3, wav & ogg), which also makes it easy to play audio files into your Django application.
TuneNN/TuneNN
A transformer-based network model for pitch detection
POZAlabs/ComMU-code
[NeurIPS'22] Official code of "ComMU: Dataset for Combinatorial Music Generation"
Oncorporation/audiocraft
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
Sanoojan/REFace
This repository gives the official implementation of Realistic and Efficient Face Swapping: A Unified Approach with Diffusion Models (WACV 2025)
sakemin/cog-musicgen-fine-tuner
This is a cog implementation of the fine-tuner for Meta's MusicGen
jech2/YM2413-MDB
80s FM video game music dataset
DeepSpectrum/DeepSpectrumLite
Light-weight transfer learning framework for on-device speech and audio recognition using pre-trained image convolutional neural networks.
RevoSpeechTech/audio-generation-papers
recent audio generation papers (including speech, music and general audios)
Vio-Chung/Rap-Speech-Classification
My music tech thesis prototype as well as recent class project
clarenceluo78/singer-adaptive-svc
This repository is the implementation of project Converting to Realistic Professional Singing Voices with Singer-Adaptive Representations and some baselines of Singing Voice Conversion (SVC) task.
unis-ing/lorenz-parameter-learning
taylorbn/linear_nudging