spxen

speech enhancement based on dsp and deep learning

ChinaBeijing

Pinned Repositories

asteroid
The PyTorch-based audio source separation toolkit for researchers || Pretrained models available
Language:Python00
beamforming
Matlab files for various types of beamforming
Language:MATLAB0 0 00
Conv-TasNet
Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech Separation Pytorch's Implement
Language:Python0 0 00
crepe
CREPE: A Convolutional REpresentation for Pitch Estimation -- pre-trained model (ICASSP 2018)
Language:Python00
ddsp-pytorch
Implementation of DDSP (PyTorch), Differentiable Digital Signal Processing (ICLR 2020)
Language:Python00
DeepComplexUNetPyTorch
Implementation of Deep Complex UNet Using PyTorch
Language:Python0 0 00
dereverberate
An implementation of a sound dereverberation algorithm by Gilbert Soulodre
Language:MATLAB00
DNS-Challenge
This repo contains the scripts, models and required files for the Interspeech 2020 Deep Noise Suppression (DNS) Challenge. We are open sourcing clean speech and noise files as well. Participants of this challenge will use the scripts from this repo to create data to train their noise suppressors. They will compare their method with our baseline noise suppressor and report the results.
00
wav-file
wav file read/write lib and tools
Language:C10
webrtc
Forked from https://webrtc.googlesource.com/src
Language:C++10

spxen's Repositories

spxen/wav-file
wav file read/write lib and tools
Language:C10
spxen/webrtc
Forked from https://webrtc.googlesource.com/src
Language:C++10
spxen/asteroid
The PyTorch-based audio source separation toolkit for researchers || Pretrained models available
Language:Python00
spxen/beamforming
Matlab files for various types of beamforming
Language:MATLAB0 0 00
spxen/Conv-TasNet
Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech Separation Pytorch's Implement
Language:Python0 0 00
spxen/crepe
CREPE: A Convolutional REpresentation for Pitch Estimation -- pre-trained model (ICASSP 2018)
Language:Python00
spxen/ddsp-pytorch
Implementation of DDSP (PyTorch), Differentiable Digital Signal Processing (ICLR 2020)
Language:Python00
spxen/DeepComplexUNetPyTorch
Implementation of Deep Complex UNet Using PyTorch
Language:Python0 0 00
spxen/DNS-Challenge
This repo contains the scripts, models and required files for the Interspeech 2020 Deep Noise Suppression (DNS) Challenge. We are open sourcing clean speech and noise files as well. Participants of this challenge will use the scripts from this repo to create data to train their noise suppressors. They will compare their method with our baseline noise suppressor and report the results.
00
spxen/Dual-Path-RNN-Pytorch
Dual-path RNN: efficient long sequence modeling for time-domain single-channel speech separation implemented by Pytorch
Language:Python0 0 00
spxen/dual-path-RNNs-DPRNNs-based-speech-separation
A PyTorch implementation of dual-path RNNs (DPRNNs) based speech separation described in "Dual-path RNN: efficient long sequence modeling for time-domain single-channel speech separation".
spxen/fdndlp
A speech dereverberation algorithm, also called wpe
spxen/gpuRIR
Python library for Room Impulse Response (RIR) simulation with GPU acceleration
spxen/GPV
Repository for our Interspeech2020 general-purpose voice activity detection (GPVAD) paper
spxen/meta-tasnet
A PyTorch implementation of Meta-TasNet from "Meta-learning Extractors for Music Source Separation
spxen/nara_wpe
Different implementations of "Weighted Prediction Error" for speech dereverberation
spxen/NISQA
NISQA - Non-Intrusive Speech Quality Assessment
spxen/pytorch_stoi
STOI loss function in PyTorch
spxen/rir_simulator_python
Room impulse response simulator using python
spxen/rnnoise-wav
Recurrent neural network for audio noise reduction
Language:C
spxen/sed-crnn
Single and multichannel sound event detection using convolutional recurrent neural networks. DCASE 2017 real-life sound event detection winning method.
spxen/segan_pytorch
Speech Enhancement Generative Adversarial Network in PyTorch
spxen/setk
Tools for Speech Enhancement integrated with Kaldi
spxen/Solo
Agora Solo is an open source speech codec, it was developed based on Silk with BWE(Bandwidth Extension) and MDC(Multi Description Coding). With these technologies, Solo is enable to resist weak networks at low bitrates.
Language:C
spxen/Speech-Separation-Paper-Tutorial
A must-read paper for speech separation based on neural networks
0 0
spxen/Speech_Enhancement_DNN_NMF
Speech Enhancement based on DNN (Spectral-Mapping, TF-Masking), DNN-NMF, NMF
spxen/TAC
transform-average-concatenate (TAC) method for end-to-end microphone permutation and number invariant ad-hoc beamforming.
spxen/torch-stft
An STFT/iSTFT for PyTorch.
spxen/voice-filter
A unofficial Pytorch implementation of Google's VoiceFilter
spxen/Wave-U-Net-for-Speech-Enhancement
Implement [Wave-U-Net](https://arxiv.org/abs/1806.03185) by PyTorch, and migrate it to the speech enhancement area.