runngezhang

runngezhang's Stars

facebookresearch/audiocraft
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
Language:Python20.6k 203 3722.1k
kska32/ebooks
收藏的一些经典的历史、政治、心理、哲学、数学、计算机方面电子书(约10万本）
Language:JavaScript4k 57 13556
philipperemy/keras-tcn
Keras Temporal Convolutional Network.
Language:Python1.9k 48 170455
ming024/FastSpeech2
An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"
Language:Python1.8k 28 212527
haoheliu/versatile_audio_super_resolution
Versatile audio super resolution (any -> 48kHz) with AudioSR.
Language:Python1.1k 24 55106
cpuimage/WebRTC_NS
Noise Suppression Module Port From WebRTC
Language:C307 15 9146
maum-ai/nuwave2
NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling Rates @ INTERSPEECH 2022
Language:Python273 8 1821
slp-rl/aero
This repo contains the official PyTorch implementation of "Audio Super Resolution in the Spectral Domain" (ICASSP 2023)
Language:Python195 6 2726
maggie0830/DCCRN
implementation of "DCCRN-Deep Complex Convolution Recurrent Network for Phase-Aware Speech Enhancement" by pytorch
Language:Python180 2 031
sp-uhh/storm
StoRM: A Diffusion-based Stochastic Regeneration Model for Speech Enhancement and Dereverberation
Language:Python168 12 2223
sj-li/MS-TCN2
MS-TCN++: Multi-Stage Temporal Convolutional Network for Action Segmentation (TPAMI 2020)
Language:Python140 3 1232
fakufaku/fast_bss_eval
A fast implementation of bss_eval metrics for blind source separation
Language:Python130 4 108
google-research/seanet
Language:HTML116 13 022
fakufaku/diffusion-separation
Single channel speech source separation by diffusion process (ICASSP 2023)
Language:Python89 8 310
chomeyama/DualCycleGAN
Official implementation of DualCycleGAN for nonparallel audio super resolution
Language:Python49 2 45
sp-uhh/deep-non-linear-filter
Language:Python44 5 210
slp-rl/SC-PhASE
This repo contains the official PyTorch implementation of "A Systematic Comparison of Phonetic Aware Techniques for Speech Enhancement" (Interspeech 2022)
Language:Python27 1 02
Hadryan/TFNet-for-Environmental-Sound-Classification
Learning discriminative and robust time-frequency representations for environmental sound classification: Convolutional neural networks (CNN) are one of the best-performing neural network architectures for environmental sound classification (ESC). Recently, attention mechanisms have been used in CNN to capture the useful information from the audio signal for sound classification, especially for weakly labelled data where the timing information about the acoustic events is not available in the training data, apart from the availability of sound class labels. In these methods, however, the inherent time-frequency characteristics and variations are not explicitly exploited when obtaining the deep features. In this paper, we propose a new method, called time-frequency enhancement block (TFBlock), which temporal attention and frequency attention are employed to enhance the features from relevant frames and frequency bands. Compared with other attention mechanisms, in our method, parallel branches are constructed which allow the temporal and frequency features to be attended respectively in order to mitigate interference from the sections where no sound events happened in the acoustic environments. The experiments on three benchmark ESC datasets show that our method improves the classification performance and also exhibits robustness to noise.
Language:Python26 3 04
moodoki/tfnet
Language:Python24 2 27
zeroone-universe/AECNN_for_Speech_Enhancement
Unofficial Pytorch Lightning Implementation of "A New Framework for CNN-Based Speech Enhancement in the Time Domain"
Language:Python15 1 26
tan90xx/audio-super-resolution-tf
https://tan90xx.github.io/SR-display.github.io/
Language:Jupyter Notebook9 1 01
zeroone-universe/TowardsRobustSpeechSR
Unofficial Pytorch Lightning Implementation of "Towards Robust Speech Super-Resolution"
Language:Python53
BerlinerA/DSVAE-NES
This repository contains the official PyTorch implementation of the paper: "Learning Discrete Structured VAE using NES".
Language:Python4 2 04
nicolas-dufour/self-supervised-low-res-speech
This project transfert the self supervised Wav2vec2 representation to low ressources languages
Language:Jupyter Notebook3 1 01
maggie0830/WebRTC_NS
Noise Suppression Module Port From WebRTC
Language:C2 1 00
zeroone-universe/BinauralEffectSimulator
Language:Jupyter Notebook2 1 01
andreeavoicu19/Music-Recommender-System
Based on sound processing and audio feature extraction
Language:Python12
zeroone-universe/AdaSpeech
An implementation of Microsoft's "AdaSpeech: Adaptive Text to Speech for Custom Voice"
Language:Python1 0 01
zeroone-universe/GM4MNIST
Language:Python1 1 01
zeroone-universe/SRGAN
Unofficial Pytorch Lightning Implementation of SRGAN
Language:Python1 1 01