speech-enhancement
There are 248 repositories under speech-enhancement topic.
speechbrain/speechbrain
A PyTorch-based Speech Toolkit
espnet/espnet
End-to-End Speech Processing Toolkit
modelscope/ClearerVoice-Studio
An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.
Rikorose/DeepFilterNet
Noise supression using deep filtering
asteroid-team/asteroid
The PyTorch-based audio source separation toolkit for researchers
resemble-ai/resemble-enhance
AI powered speech denoising and enhancement
haoheliu/voicefixer
General Speech Restoration
ictnlp/StreamSpeech
StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.
JusperLee/Speech-Separation-Paper-Tutorial
A must-read paper for speech separation based on neural networks
nanahou/Awesome-Speech-Enhancement
A tutorial for Speech Enhancement researchers and practitioners. The purpose of this repo is to organize the world’s resources for speech enhancement and make them universally accessible and useful.
breizhn/DTLN
Tensorflow 2.x implementation of the DTLN real time speech denoising model. With TF-lite, ONNX and real-time audio processing support.
k2kobayashi/sprocket
Voice Conversion Tool Kit
Audio-WestlakeU/FullSubNet
PyTorch implementation of "FullSubNet: A Full-Band and Sub-Band Fusion Model for Real-Time Single-Channel Speech Enhancement."
anicolson/DeepXi
Deep Xi: A deep learning approach to a priori SNR estimation implemented in TensorFlow 2/Keras. For speech enhancement and robust ASR.
schmiph2/pysepm
Python implementation of performance metrics in Loizou's Speech Enhancement book
Xiaobin-Rong/gtcrn
The official implementation of GTCRN, an ultra-lightweight SE model.
double22a/speech_dataset
The dataset of Speech Recognition
yxlu-0102/MP-SENet
Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement
funcwj/setk
Tools for Speech Enhancement integrated with Kaldi
jzi040941/PercepNet
Unofficial implementation of PercepNet: A Perceptually-Motivated Approach for Low-Complexity, Real-Time Enhancement of Fullband Speech
yongxuUSTC/sednn
deep learning based speech enhancement using keras or pytorch, make it easy to use
shahules786/mayavoz
Pytorch based speech enhancement toolkit.
haoxiangsnr/Wave-U-Net-for-Speech-Enhancement
Implement Wave-U-Net by PyTorch, and migrate it to the speech enhancement.
haoxiangsnr/A-Convolutional-Recurrent-Neural-Network-for-Real-Time-Speech-Enhancement
A minimum unofficial implementation of the "A Convolutional Recurrent Neural Network for Real-Time Speech Enhancement" (CRN) using PyTorch
seanwood/gcc-nmf
Real-time GCC-NMF Blind Speech Separation and Enhancement
aishoot/LSTM_PIT_Speech_Separation
Two-talker Speech Separation with LSTM/BLSTM by Permutation Invariant Training method.
fgnt/pb_bss
Collection of EM algorithms for blind source separation of audio signals
haoheliu/voicefixer_main
General Speech Restoration
AkojimaSLP/Beamforming-for-speech-enhancement
simple delaysum, MVDR and CGMM-MVDR
huyanxin/phasen
A unofficial Pytorch implementation of Microsoft's PHASEN
echocatzh/MTFAA-Net
Multi-Scale Temporal Frequency Convolutional Network With Axial Attention for Speech Enhancement
jtkim-kaist/Speech-enhancement
Deep neural network based speech enhancement toolkit
audiolabs/torch-pesq
PyTorch implementation of the Perceptual Evaluation of Speech Quality for wideband audio
sekiguchi92/SoundSourceSeparation
The code for multi-channel source separation and dereverberation such as FastMNMF1, FastMNMF2, and AR-FastMNMF2.
madhavmk/Noise2Noise-audio_denoising_without_clean_training_data
Source code for the paper titled "Speech Denoising without Clean Training Data: a Noise2Noise Approach". Paper accepted at the INTERSPEECH 2021 conference. This paper tackles the problem of the heavy dependence of clean speech data required by deep learning based audio denoising methods by showing that it is possible to train deep speech denoising networks using only noisy speech samples.
skirdey/voicerestore
VoiceRestore: Flow-Matching Transformers for Universal Speech Restoration