Pinned Repositories
asteroid
The PyTorch-based audio source separation toolkit for researchers || Pretrained models available
Attention
Audio-Feature-Extraction
In sound processing, the mel-frequency cepstrum (MFC) is a representation of the short-term power spectrum of a sound, based on a linear cosine transform of a log power spectrum on a nonlinear mel scale of frequency. Mel-frequency cepstral coefficients (MFCCs) are coefficients that collectively make up an MFC.
Awesome-Deblurring
A curated list of resources for Image and Video Deblurring
DANet
Dual Attention Network for Scene Segmentation (CVPR2019)
DeepFilterNet
the fwSegSNR code
py-aec-unified2021
pyaudlib
A speech signal processing library in Python with emphasis on deep learning.
speech_feature_extractor
Some useful features of speech process, such as MFCC, gammatone filterbank, GFCC, spectrum(power spectrum and log-power spectrum), Amplitude Modulation Spectrum(AMS) and so on.
wenet
Production First and Production Ready End-to-End Speech Recognition Toolkit
zwb0626's Repositories
zwb0626/asteroid
The PyTorch-based audio source separation toolkit for researchers || Pretrained models available
zwb0626/Attention
zwb0626/Audio-Feature-Extraction
In sound processing, the mel-frequency cepstrum (MFC) is a representation of the short-term power spectrum of a sound, based on a linear cosine transform of a log power spectrum on a nonlinear mel scale of frequency. Mel-frequency cepstral coefficients (MFCCs) are coefficients that collectively make up an MFC.
zwb0626/Awesome-Deblurring
A curated list of resources for Image and Video Deblurring
zwb0626/DANet
Dual Attention Network for Scene Segmentation (CVPR2019)
zwb0626/DeepFilterNet
the fwSegSNR code
zwb0626/py-aec-unified2021
zwb0626/pyaudlib
A speech signal processing library in Python with emphasis on deep learning.
zwb0626/speech_feature_extractor
Some useful features of speech process, such as MFCC, gammatone filterbank, GFCC, spectrum(power spectrum and log-power spectrum), Amplitude Modulation Spectrum(AMS) and so on.
zwb0626/wenet
Production First and Production Ready End-to-End Speech Recognition Toolkit