zewushui's Stars
huggingface/diffusers
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.
xmu-xiaoma666/External-Attention-pytorch
🍀 Pytorch implementation of various Attention Mechanisms, MLP, Re-parameter, Convolution, which is helpful to further understand papers.⭐⭐⭐
lucidrains/denoising-diffusion-pytorch
Implementation of Denoising Diffusion Probabilistic Model in Pytorch
snakers4/silero-vad
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
hujie-frank/SENet
Squeeze-and-Excitation Networks
tomgoldstein/loss-landscape
Code for visualizing the loss landscape of neural nets
Jongchan/attention-module
Official PyTorch code for "BAM: Bottleneck Attention Module (BMVC2018)" and "CBAM: Convolutional Block Attention Module (ECCV2018)"
melodyguan/enas
TensorFlow Code for paper "Efficient Neural Architecture Search via Parameter Sharing"
tonylins/pytorch-mobilenet-v2
A PyTorch implementation of MobileNet V2 architecture and pretrained model.
jtkim-kaist/VAD
Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.
Audio-WestlakeU/FullSubNet
PyTorch implementation of "FullSubNet: A Full-Band and Sub-Band Fusion Model for Real-Time Single-Channel Speech Enhancement."
DmitryRyumin/ICASSP-2023-24-Papers
ICASSP 2023-2024 Papers: A complete collection of influential and exciting research papers from the ICASSP 2023-24 conferences. Explore the latest advancements in acoustics, speech and signal processing. Code included. Star the repository to support the advancement of audio and signal processing!
huyanxin/DeepComplexCRN
yxlu-0102/MP-SENet
Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement
yluo42/TAC
transform-average-concatenate (TAC) method for end-to-end microphone permutation and number invariant ad-hoc beamforming.
Xiaobin-Rong/gtcrn
The official implementation of GTCRN, an ultra-lite speech enhancement model.
echocatzh/MTFAA-Net
Multi-Scale Temporal Frequency Convolutional Network With Axial Attention for Speech Enhancement
RoyChao19477/SEMamba
This is the official implementation of the SEMamba paper. (Accepted to IEEE SLT 2024)
YuZheng9/C2PNet
[CVPR 2023] Curricular Contrastive Regularization for Physics-aware Single Image Dehazing
PengtaoJiang/LayerCAM-jittor
The official code for our TIP paper 'LayerCAM: Exploring Hierarchical Class Activation Maps for Localization'
ConferencingSpeech/ConferencingSpeech2021
Conferencing Speech Challenge
Andong-Li-speech/TaylorSENet
This is the implementation of the paper ''Taylor, Can You Hear Me Now? A Taylor-Unfolding Framework for Monaural Speech Enhancement'', which was accepted by IJCAI-ECAI2022 (Long oral)
yuzhouhe2000/OMLSA-IMCRA
Python implementation of OMLSA+IMCRA algorithm for speech enhancement.
AkojimaSLP/Frame-by-frame-closed-form-update-for-mask-based-adaptive-MVDR-beamforming
speech-enhacement
SamsungLabs/ffc_se
Code for the paper "FFC-SE: Fast Fourier Convolution for Speech Enhancement" (published at Interspeech 2022 conference)
gitwukeyi/FSPEN
ModarHalimeh/COSPA
Complex-valued Spatial Autoencoders for Multichannel Speech Enhancement
phecda-xu/RIR-Generator
为音频加混响的代码
TzuchengChang/NASS
Noise-Aware Speech Separation with Contrastive Learning
Qingzheng-Wang/Dual-Window-SE
An implement of STFT-Domain Neural Speech Enhancement with Very Low Algorithmic Latency of Zhong-Qiu Wang et al.