pigip

fdu

Pinned Repositories

audio-visual-speech-enhancement
Official Implementation of "Visual Speech Enhancement", Interspeech 2018.
Language:Python0 1 00
audio_visual_speech_enhancement
Face Landmark-based Speaker-Independent Audio-Visual Speech Enhancement in Multi-Talker Environments
Language:Python0 1 00
awesome-speech
this is a treasure-house of speech
0 1 00
awesome-speech-enhancement
speech enhancement\speech seperation\sound source localization
0 1 00
Beam-Guided-TasNet
Beam-guided TasNet
Language:Python0 0 00
Calculate-SNR-SDR
Script to calculate SNR and SDR using python
Language:Python1 1 00
Conv-TasNet
A PyTorch implementation of Conv-TasNet described in "TasNet: Surpassing Ideal Time-Frequency Masking for Speech Separation" with Permutation Invariant Training (PIT).
Language:Python0 1 00
DeepXi
Deep Xi: A deep learning approach to a priori SNR estimation implemented in TensorFlow 2/Keras. For speech enhancement and robust ASR.
Language:MATLAB0 0 00
demucs
Code for the paper Music Source Separation in the Waveform Domain
Language:Python0 1 00
DNN-Phase-Reconstruction
Language:Jupyter Notebook0 1 00

pigip's Repositories

pigip/Calculate-SNR-SDR
Script to calculate SNR and SDR using python
Language:Python1 1 00
pigip/audio-visual-speech-enhancement
Official Implementation of "Visual Speech Enhancement", Interspeech 2018.
Language:Python0 1 00
pigip/audio_visual_speech_enhancement
Face Landmark-based Speaker-Independent Audio-Visual Speech Enhancement in Multi-Talker Environments
Language:Python0 1 00
pigip/awesome-speech
this is a treasure-house of speech
0 1 00
pigip/awesome-speech-enhancement
speech enhancement\speech seperation\sound source localization
0 1 00
pigip/Beam-Guided-TasNet
Beam-guided TasNet
Language:Python0 0 00
pigip/Conv-TasNet
A PyTorch implementation of Conv-TasNet described in "TasNet: Surpassing Ideal Time-Frequency Masking for Speech Separation" with Permutation Invariant Training (PIT).
Language:Python0 1 00
pigip/DeepXi
Deep Xi: A deep learning approach to a priori SNR estimation implemented in TensorFlow 2/Keras. For speech enhancement and robust ASR.
Language:MATLAB0 0 00
pigip/demucs
Code for the paper Music Source Separation in the Waveform Domain
Language:Python0 1 00
pigip/DNN-Phase-Reconstruction
Language:Jupyter Notebook0 1 00
pigip/dnn_wpe
Language:Python1 0
pigip/dual-path-RNNs-DPRNNs-based-speech-separation
A PyTorch implementation of dual-path RNNs (DPRNNs) based speech separation described in "Dual-path RNN: efficient long sequence modeling for time-domain single-channel speech separation".
Language:Python1 0
pigip/ERNN-for-speech-enhancement
Language:Python1 0
pigip/FaSNet-TAC-PyTorch
Full implementation of "End-to-end microphone permutation and number invariant multi-channel speech separation" (Interspeech 2020)
Language:Python0 0
pigip/knowledge-distillation-pytorch
A PyTorch implementation for exploring deep and shallow knowledge distillation (KD) experiments with flexibility
Language:Python1 0
pigip/Lip_Reading_in_the_Wild_AVSR
Audio-Visual Speech Recognition using Deep Learning
Language:Python1 0
pigip/mediaio
Language:Python1 0
pigip/onssen
An open-source speech separation and enhancement library
Language:Python1 0
pigip/open-unmix-pytorch
Open-Unmix - Music Source Separation for PyTorch
Language:Python1 0
pigip/pyroomacoustics
Pyroomacoustics is a package for audio signal processing for indoor applications. It was developed as a fast prototyping platform for beamforming algorithms in indoor scenarios.
Language:Python1 0
pigip/python-pesq
A python package for calculating the PESQ.
Language:Python1 0
pigip/pytorch
1 0
pigip/SincNet
SincNet is a neural architecture for efficiently processing raw audio samples.
Language:Python1 0
pigip/Sound_Localization_Algorithms
Classical algorithms of sound source localization with beamforming, TDOA and high-resolution spectral estimation.
pigip/speech-dereverberation
speech-dereverberation-using-GANs
Language:Python1 0
pigip/Speech-Separation-Paper
A must-read paper for speech separation based on neural networks
1 0
pigip/speech_feature_extractor
Some useful features of speech process, such as MFCC, gammatone filterbank, GFCC, spectrum(power spectrum and log-power spectrum), Amplitude Modulation Spectrum(AMS) and so on.
Language:Python1 0
pigip/speechmetrics
A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR
Language:Python1 0
pigip/spleeter
Deezer source separation library including pretrained models.
Language:Python1 0
pigip/wavenet
Keras WaveNet implementation
Language:Python1 0