Pinned Repositories
audio-visual-speech-enhancement
Official Implementation of "Visual Speech Enhancement", Interspeech 2018.
audio_visual_speech_enhancement
Face Landmark-based Speaker-Independent Audio-Visual Speech Enhancement in Multi-Talker Environments
awesome-speech
this is a treasure-house of speech
awesome-speech-enhancement
speech enhancement\speech seperation\sound source localization
Beam-Guided-TasNet
Beam-guided TasNet
Calculate-SNR-SDR
Script to calculate SNR and SDR using python
Conv-TasNet
A PyTorch implementation of Conv-TasNet described in "TasNet: Surpassing Ideal Time-Frequency Masking for Speech Separation" with Permutation Invariant Training (PIT).
DeepXi
Deep Xi: A deep learning approach to a priori SNR estimation implemented in TensorFlow 2/Keras. For speech enhancement and robust ASR.
demucs
Code for the paper Music Source Separation in the Waveform Domain
DNN-Phase-Reconstruction
pigip's Repositories
pigip/Calculate-SNR-SDR
Script to calculate SNR and SDR using python
pigip/audio-visual-speech-enhancement
Official Implementation of "Visual Speech Enhancement", Interspeech 2018.
pigip/audio_visual_speech_enhancement
Face Landmark-based Speaker-Independent Audio-Visual Speech Enhancement in Multi-Talker Environments
pigip/awesome-speech
this is a treasure-house of speech
pigip/awesome-speech-enhancement
speech enhancement\speech seperation\sound source localization
pigip/Beam-Guided-TasNet
Beam-guided TasNet
pigip/Conv-TasNet
A PyTorch implementation of Conv-TasNet described in "TasNet: Surpassing Ideal Time-Frequency Masking for Speech Separation" with Permutation Invariant Training (PIT).
pigip/DeepXi
Deep Xi: A deep learning approach to a priori SNR estimation implemented in TensorFlow 2/Keras. For speech enhancement and robust ASR.
pigip/demucs
Code for the paper Music Source Separation in the Waveform Domain
pigip/DNN-Phase-Reconstruction
pigip/dnn_wpe
pigip/dual-path-RNNs-DPRNNs-based-speech-separation
A PyTorch implementation of dual-path RNNs (DPRNNs) based speech separation described in "Dual-path RNN: efficient long sequence modeling for time-domain single-channel speech separation".
pigip/ERNN-for-speech-enhancement
pigip/FaSNet-TAC-PyTorch
Full implementation of "End-to-end microphone permutation and number invariant multi-channel speech separation" (Interspeech 2020)
pigip/knowledge-distillation-pytorch
A PyTorch implementation for exploring deep and shallow knowledge distillation (KD) experiments with flexibility
pigip/Lip_Reading_in_the_Wild_AVSR
Audio-Visual Speech Recognition using Deep Learning
pigip/mediaio
pigip/onssen
An open-source speech separation and enhancement library
pigip/open-unmix-pytorch
Open-Unmix - Music Source Separation for PyTorch
pigip/pyroomacoustics
Pyroomacoustics is a package for audio signal processing for indoor applications. It was developed as a fast prototyping platform for beamforming algorithms in indoor scenarios.
pigip/python-pesq
A python package for calculating the PESQ.
pigip/pytorch
pigip/SincNet
SincNet is a neural architecture for efficiently processing raw audio samples.
pigip/Sound_Localization_Algorithms
Classical algorithms of sound source localization with beamforming, TDOA and high-resolution spectral estimation.
pigip/speech-dereverberation
speech-dereverberation-using-GANs
pigip/Speech-Separation-Paper
A must-read paper for speech separation based on neural networks
pigip/speech_feature_extractor
Some useful features of speech process, such as MFCC, gammatone filterbank, GFCC, spectrum(power spectrum and log-power spectrum), Amplitude Modulation Spectrum(AMS) and so on.
pigip/speechmetrics
A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR
pigip/spleeter
Deezer source separation library including pretrained models.
pigip/wavenet
Keras WaveNet implementation