speech-enhancement

There are 248 repositories under speech-enhancement topic.

speechbrain/speechbrain
A PyTorch-based Speech Toolkit
Language:Python10.4k 134 1.2k1.6k
espnet/espnet
End-to-End Speech Processing Toolkit
Language:Python9.5k 168 2.5k2.3k
modelscope/ClearerVoice-Studio
An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.
Language:Python3.4k 30 82276
Rikorose/DeepFilterNet
Noise supression using deep filtering
Language:Python3.3k 35 317315
asteroid-team/asteroid
The PyTorch-based audio source separation toolkit for researchers
Language:Python2.5k 51 225440
resemble-ai/resemble-enhance
AI powered speech denoising and enhancement
Language:Python2k 21 64233
haoheliu/voicefixer
General Speech Restoration
Language:Python1.2k 16 62147
ictnlp/StreamSpeech
StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.
Language:Python1.1k 14 1786
JusperLee/Speech-Separation-Paper-Tutorial
A must-read paper for speech separation based on neural networks
Language:TypeScript828 25 2137
nanahou/Awesome-Speech-Enhancement
A tutorial for Speech Enhancement researchers and practitioners. The purpose of this repo is to organize the world’s resources for speech enhancement and make them universally accessible and useful.
Language:MATLAB786 32 5152
breizhn/DTLN
Tensorflow 2.x implementation of the DTLN real time speech denoising model. With TF-lite, ONNX and real-time audio processing support.
Language:Python645 9 83166
k2kobayashi/sprocket
Voice Conversion Tool Kit
Language:Python605 34 78116
Audio-WestlakeU/FullSubNet
PyTorch implementation of "FullSubNet: A Full-Band and Sub-Band Fusion Model for Real-Time Single-Channel Speech Enhancement."
Language:Python576 8 63157
anicolson/DeepXi
Deep Xi: A deep learning approach to a priori SNR estimation implemented in TensorFlow 2/Keras. For speech enhancement and robust ASR.
Language:MATLAB516 25 49125
schmiph2/pysepm
Python implementation of performance metrics in Loizou's Speech Enhancement book
Language:Python436 8 1491
Xiaobin-Rong/gtcrn
The official implementation of GTCRN, an ultra-lightweight SE model.
Language:Python425 5 6151
double22a/speech_dataset
The dataset of Speech Recognition
424 10 377
yxlu-0102/MP-SENet
Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement
Language:Python424 7 6360
funcwj/setk
Tools for Speech Enhancement integrated with Kaldi
Language:Python420 21 1192
jzi040941/PercepNet
Unofficial implementation of PercepNet: A Perceptually-Motivated Approach for Low-Complexity, Real-Time Enhancement of Fullband Speech
Language:C++356 26 4294
yongxuUSTC/sednn
deep learning based speech enhancement using keras or pytorch, make it easy to use
Language:Python338 18 58123
shahules786/mayavoz
Pytorch based speech enhancement toolkit.
Language:Python337 12 1626
haoxiangsnr/Wave-U-Net-for-Speech-Enhancement
Implement Wave-U-Net by PyTorch, and migrate it to the speech enhancement.
Language:Python336 10 1367
haoxiangsnr/A-Convolutional-Recurrent-Neural-Network-for-Real-Time-Speech-Enhancement
A minimum unofficial implementation of the "A Convolutional Recurrent Neural Network for Real-Time Speech Enhancement" (CRN) using PyTorch
Language:Python332 6 2760
seanwood/gcc-nmf
Real-time GCC-NMF Blind Speech Separation and Enhancement
Language:Python324 11 16134
aishoot/LSTM_PIT_Speech_Separation
Two-talker Speech Separation with LSTM/BLSTM by Permutation Invariant Training method.
Language:Jupyter Notebook309 16 2290
fgnt/pb_bss
Collection of EM algorithms for blind source separation of audio signals
Language:Python294 12 1260
haoheliu/voicefixer_main
General Speech Restoration
Language:Python283 11 1956
AkojimaSLP/Beamforming-for-speech-enhancement
simple delaysum, MVDR and CGMM-MVDR
Language:Python269 5 574
huyanxin/phasen
A unofficial Pytorch implementation of Microsoft's PHASEN
Language:Python231 9 1350
echocatzh/MTFAA-Net
Multi-Scale Temporal Frequency Convolutional Network With Axial Attention for Speech Enhancement
Language:Python218 7 1262
jtkim-kaist/Speech-enhancement
Deep neural network based speech enhancement toolkit
Language:MATLAB217 8 2862
audiolabs/torch-pesq
PyTorch implementation of the Perceptual Evaluation of Speech Quality for wideband audio
Language:Python203 5 715
sekiguchi92/SoundSourceSeparation
The code for multi-channel source separation and dereverberation such as FastMNMF1, FastMNMF2, and AR-FastMNMF2.
Language:Python203 8 634
madhavmk/Noise2Noise-audio_denoising_without_clean_training_data
Source code for the paper titled "Speech Denoising without Clean Training Data: a Noise2Noise Approach". Paper accepted at the INTERSPEECH 2021 conference. This paper tackles the problem of the heavy dependence of clean speech data required by deep learning based audio denoising methods by showing that it is possible to train deep speech denoising networks using only noisy speech samples.
Language:Jupyter Notebook197 6 745
skirdey/voicerestore
VoiceRestore: Flow-Matching Transformers for Universal Speech Restoration
Language:Python185 7 818

speech-enhancement

speechbrain/speechbrain

espnet/espnet

modelscope/ClearerVoice-Studio

Rikorose/DeepFilterNet

asteroid-team/asteroid

resemble-ai/resemble-enhance

haoheliu/voicefixer

ictnlp/StreamSpeech

JusperLee/Speech-Separation-Paper-Tutorial

nanahou/Awesome-Speech-Enhancement

breizhn/DTLN

k2kobayashi/sprocket

Audio-WestlakeU/FullSubNet

anicolson/DeepXi

schmiph2/pysepm

Xiaobin-Rong/gtcrn

double22a/speech_dataset

yxlu-0102/MP-SENet

funcwj/setk

jzi040941/PercepNet

yongxuUSTC/sednn

shahules786/mayavoz

haoxiangsnr/Wave-U-Net-for-Speech-Enhancement

haoxiangsnr/A-Convolutional-Recurrent-Neural-Network-for-Real-Time-Speech-Enhancement

seanwood/gcc-nmf

aishoot/LSTM_PIT_Speech_Separation

fgnt/pb_bss

haoheliu/voicefixer_main

AkojimaSLP/Beamforming-for-speech-enhancement

huyanxin/phasen

echocatzh/MTFAA-Net

jtkim-kaist/Speech-enhancement

audiolabs/torch-pesq

sekiguchi92/SoundSourceSeparation

madhavmk/Noise2Noise-audio_denoising_without_clean_training_data

skirdey/voicerestore