zewushui

Student

Xi'an

zewushui's Stars

huggingface/diffusers
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.
Language:Python26.6k 214 4.3k5.5k
xmu-xiaoma666/External-Attention-pytorch
🍀 Pytorch implementation of various Attention Mechanisms, MLP, Re-parameter, Convolution, which is helpful to further understand papers.⭐⭐⭐
Language:Python11.6k 103 821.9k
lucidrains/denoising-diffusion-pytorch
Implementation of Denoising Diffusion Probabilistic Model in Pytorch
Language:Python8.5k 36 2981.1k
snakers4/silero-vad
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
Language:Python4.5k 49 248438
hujie-frank/SENet
Squeeze-and-Excitation Networks
Language:Cuda3.4k 83 91838
tomgoldstein/loss-landscape
Code for visualizing the loss landscape of neural nets
Language:Python2.9k 33 41403
Jongchan/attention-module
Official PyTorch code for "BAM: Bottleneck Attention Module (BMVC2018)" and "CBAM: Convolutional Block Attention Module (ECCV2018)"
Language:Python2.1k 19 50402
melodyguan/enas
TensorFlow Code for paper "Efficient Neural Architecture Search via Parameter Sharing"
Language:Python1.6k 76 115390
tonylins/pytorch-mobilenet-v2
A PyTorch implementation of MobileNet V2 architecture and pretrained model.
Language:Python1.4k 26 37329
jtkim-kaist/VAD
Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.
Language:MATLAB845 44 40235
Audio-WestlakeU/FullSubNet
PyTorch implementation of "FullSubNet: A Full-Band and Sub-Band Fusion Model for Real-Time Single-Channel Speech Enhancement."
Language:Python556 10 62158
DmitryRyumin/ICASSP-2023-24-Papers
ICASSP 2023-2024 Papers: A complete collection of influential and exciting research papers from the ICASSP 2023-24 conferences. Explore the latest advancements in acoustics, speech and signal processing. Code included. Star the repository to support the advancement of audio and signal processing!
Language:Python411 29 417
huyanxin/DeepComplexCRN
Language:HTML401 9 27100
yxlu-0102/MP-SENet
Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement
Language:Python331 7 5548
yluo42/TAC
transform-average-concatenate (TAC) method for end-to-end microphone permutation and number invariant ad-hoc beamforming.
Language:Python260 6 1554
Xiaobin-Rong/gtcrn
The official implementation of GTCRN, an ultra-lite speech enhancement model.
Language:Python226 5 5042
echocatzh/MTFAA-Net
Multi-Scale Temporal Frequency Convolutional Network With Axial Attention for Speech Enhancement
Language:Python196 7 1257
RoyChao19477/SEMamba
This is the official implementation of the SEMamba paper. (Accepted to IEEE SLT 2024)
Language:Python145 12 1714
YuZheng9/C2PNet
[CVPR 2023] Curricular Contrastive Regularization for Physics-aware Single Image Dehazing
Language:Python140 3 3821
PengtaoJiang/LayerCAM-jittor
The official code for our TIP paper 'LayerCAM: Exploring Hierarchical Class Activation Maps for Localization'
Language:Python117 1 1413
ConferencingSpeech/ConferencingSpeech2021
Conferencing Speech Challenge
Language:Python90 8 1232
Andong-Li-speech/TaylorSENet
This is the implementation of the paper ''Taylor, Can You Hear Me Now? A Taylor-Unfolding Framework for Monaural Speech Enhancement'', which was accepted by IJCAI-ECAI2022 (Long oral)
Language:Python63 1 212
yuzhouhe2000/OMLSA-IMCRA
Python implementation of OMLSA+IMCRA algorithm for speech enhancement.
Language:Python51 3 218
AkojimaSLP/Frame-by-frame-closed-form-update-for-mask-based-adaptive-MVDR-beamforming
speech-enhacement
Language:Python50 4 016
SamsungLabs/ffc_se
Code for the paper "FFC-SE: Fast Fourier Convolution for Speech Enhancement" (published at Interspeech 2022 conference)
Language:Python49 3 05
gitwukeyi/FSPEN
Language:Python38 3 811
ModarHalimeh/COSPA
Complex-valued Spatial Autoencoders for Multichannel Speech Enhancement
Language:Python30 3 112
phecda-xu/RIR-Generator
为音频加混响的代码
Language:C++25 2 11
TzuchengChang/NASS
Noise-Aware Speech Separation with Contrastive Learning
Language:Python166
Qingzheng-Wang/Dual-Window-SE
An implement of STFT-Domain Neural Speech Enhancement with Very Low Algorithmic Latency of Zhong-Qiu Wang et al.
Language:Python12 1 21

zewushui

zewushui's Stars

huggingface/diffusers

xmu-xiaoma666/External-Attention-pytorch

lucidrains/denoising-diffusion-pytorch

snakers4/silero-vad

hujie-frank/SENet

tomgoldstein/loss-landscape

Jongchan/attention-module

melodyguan/enas

tonylins/pytorch-mobilenet-v2

jtkim-kaist/VAD

Audio-WestlakeU/FullSubNet

DmitryRyumin/ICASSP-2023-24-Papers

huyanxin/DeepComplexCRN

yxlu-0102/MP-SENet

yluo42/TAC

Xiaobin-Rong/gtcrn

echocatzh/MTFAA-Net

RoyChao19477/SEMamba

YuZheng9/C2PNet

PengtaoJiang/LayerCAM-jittor

ConferencingSpeech/ConferencingSpeech2021

Andong-Li-speech/TaylorSENet

yuzhouhe2000/OMLSA-IMCRA

AkojimaSLP/Frame-by-frame-closed-form-update-for-mask-based-adaptive-MVDR-beamforming

SamsungLabs/ffc_se

gitwukeyi/FSPEN

ModarHalimeh/COSPA

phecda-xu/RIR-Generator

TzuchengChang/NASS

Qingzheng-Wang/Dual-Window-SE