Pinned Repositories
asteroid
The PyTorch-based audio source separation toolkit for researchers || Pretrained models available
beamforming
Matlab files for various types of beamforming
Conv-TasNet
Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech Separation Pytorch's Implement
crepe
CREPE: A Convolutional REpresentation for Pitch Estimation -- pre-trained model (ICASSP 2018)
ddsp-pytorch
Implementation of DDSP (PyTorch), Differentiable Digital Signal Processing (ICLR 2020)
DeepComplexUNetPyTorch
Implementation of Deep Complex UNet Using PyTorch
dereverberate
An implementation of a sound dereverberation algorithm by Gilbert Soulodre
DNS-Challenge
This repo contains the scripts, models and required files for the Interspeech 2020 Deep Noise Suppression (DNS) Challenge. We are open sourcing clean speech and noise files as well. Participants of this challenge will use the scripts from this repo to create data to train their noise suppressors. They will compare their method with our baseline noise suppressor and report the results.
wav-file
wav file read/write lib and tools
webrtc
Forked from https://webrtc.googlesource.com/src
spxen's Repositories
spxen/wav-file
wav file read/write lib and tools
spxen/webrtc
Forked from https://webrtc.googlesource.com/src
spxen/asteroid
The PyTorch-based audio source separation toolkit for researchers || Pretrained models available
spxen/beamforming
Matlab files for various types of beamforming
spxen/Conv-TasNet
Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech Separation Pytorch's Implement
spxen/crepe
CREPE: A Convolutional REpresentation for Pitch Estimation -- pre-trained model (ICASSP 2018)
spxen/ddsp-pytorch
Implementation of DDSP (PyTorch), Differentiable Digital Signal Processing (ICLR 2020)
spxen/DeepComplexUNetPyTorch
Implementation of Deep Complex UNet Using PyTorch
spxen/DNS-Challenge
This repo contains the scripts, models and required files for the Interspeech 2020 Deep Noise Suppression (DNS) Challenge. We are open sourcing clean speech and noise files as well. Participants of this challenge will use the scripts from this repo to create data to train their noise suppressors. They will compare their method with our baseline noise suppressor and report the results.
spxen/Dual-Path-RNN-Pytorch
Dual-path RNN: efficient long sequence modeling for time-domain single-channel speech separation implemented by Pytorch
spxen/dual-path-RNNs-DPRNNs-based-speech-separation
A PyTorch implementation of dual-path RNNs (DPRNNs) based speech separation described in "Dual-path RNN: efficient long sequence modeling for time-domain single-channel speech separation".
spxen/fdndlp
A speech dereverberation algorithm, also called wpe
spxen/gpuRIR
Python library for Room Impulse Response (RIR) simulation with GPU acceleration
spxen/GPV
Repository for our Interspeech2020 general-purpose voice activity detection (GPVAD) paper
spxen/meta-tasnet
A PyTorch implementation of Meta-TasNet from "Meta-learning Extractors for Music Source Separation
spxen/nara_wpe
Different implementations of "Weighted Prediction Error" for speech dereverberation
spxen/NISQA
NISQA - Non-Intrusive Speech Quality Assessment
spxen/pytorch_stoi
STOI loss function in PyTorch
spxen/rir_simulator_python
Room impulse response simulator using python
spxen/rnnoise-wav
Recurrent neural network for audio noise reduction
spxen/sed-crnn
Single and multichannel sound event detection using convolutional recurrent neural networks. DCASE 2017 real-life sound event detection winning method.
spxen/segan_pytorch
Speech Enhancement Generative Adversarial Network in PyTorch
spxen/setk
Tools for Speech Enhancement integrated with Kaldi
spxen/Solo
Agora Solo is an open source speech codec, it was developed based on Silk with BWE(Bandwidth Extension) and MDC(Multi Description Coding). With these technologies, Solo is enable to resist weak networks at low bitrates.
spxen/Speech-Separation-Paper-Tutorial
A must-read paper for speech separation based on neural networks
spxen/Speech_Enhancement_DNN_NMF
Speech Enhancement based on DNN (Spectral-Mapping, TF-Masking), DNN-NMF, NMF
spxen/TAC
transform-average-concatenate (TAC) method for end-to-end microphone permutation and number invariant ad-hoc beamforming.
spxen/torch-stft
An STFT/iSTFT for PyTorch.
spxen/voice-filter
A unofficial Pytorch implementation of Google's VoiceFilter
spxen/Wave-U-Net-for-Speech-Enhancement
Implement [Wave-U-Net](https://arxiv.org/abs/1806.03185) by PyTorch, and migrate it to the speech enhancement area.