boriq's Stars
vinta/awesome-python
An opinionated list of awesome Python frameworks, libraries, software and resources.
Lightning-AI/pytorch-lightning
Pretrain, finetune ANY AI model of ANY size on multiple GPUs, TPUs with zero code changes.
iterative/dvc
🦉 Data Versioning and ML Experiments
kkroening/ffmpeg-python
Python bindings for FFmpeg - with complex filtering support
spotify/pedalboard
🎛 🔊 A Python library for audio.
Lyken17/pytorch-OpCounter
Count the MACs / FLOPs of your PyTorch model.
sovrasov/flops-counter.pytorch
Flops counter for convolutional networks in pytorch framework
asteroid-team/asteroid
The PyTorch-based audio source separation toolkit for researchers
facebookresearch/svoice
We provide a PyTorch implementation of the paper Voice Separation with an Unknown Number of Multiple Speakers In which, we present a new method for separating a mixed audio sequence, in which multiple voices speak simultaneously. The new method employs gated neural networks that are trained to separate the voices at multiple processing steps, while maintaining the speaker in each output channel fixed. A different model is trained for every number of possible speakers, and the model with the largest number of speakers is employed to select the actual number of speakers in a given sample. Our method greatly outperforms the current state of the art, which, as we show, is not competitive for more than two speakers.
JusperLee/Speech-Separation-Paper-Tutorial
A must-read paper for speech separation based on neural networks
NVIDIA/nv-wavenet
Reference implementation of real-time autoregressive wavenet inference
kaituoxu/Conv-TasNet
A PyTorch implementation of Conv-TasNet described in "TasNet: Surpassing Ideal Time-Frequency Masking for Speech Separation" with Permutation Invariant Training (PIT).
nussl/nussl
A flexible source separation library in Python
Audio-WestlakeU/FullSubNet
PyTorch implementation of "FullSubNet: A Full-Band and Sub-Band Fusion Model for Real-Time Single-Channel Speech Enhancement."
facebookresearch/music-translation
A UNIVERSAL MUSIC TRANSLATION NETWORK - a method for translating music across musical instruments and styles.
etzinis/sudo_rm_rf
Code for SuDoRm-Rf networks for efficient audio source separation. SuDoRm-Rf stands for SUccessive DOwnsampling and Resampling of Multi-Resolution Features which enables a more efficient way of separating sources from mixtures.
tky823/DNN-based_source_separation
A PyTorch implementation of DNN-based source separation.
AvivBick/awesome-ssm-ml
Reading list for research topics in state-space models
danmic/av-se
Deep-Learning-Based Audio-Visual Speech Enhancement and Separation
xcmyz/FastVocoder
Include Basis-MelGAN, MelGAN, HifiGAN and Multiband-HifiGAN, maybe NHV in the future.
mbinkowski/DeepSpeechDistances
Authors' implementation of DeepSpeech Distances.
afourast/avobjects
Implementation for ECCV20 paper "Self-Supervised Learning of audio-visual objects from video"
SRPOL-AUI/storir
anton-jeran/IR-GAN
Augmenting Room Impulse Response
fgnt/graph_pit
JarCme/MIRaGe_Utils