Pinned Repositories
ATST-SED
This repo includes the official implementations of "Fine-tune the pretrained ATST model for sound event detection".
audiossl
A library built for easier audio self-supervised training, downstream tasks evaluation
FN-SSL
The Official PyTorch Implementation of FN-SSL & IPDnet for Sound Source Localization [INTERSPEECH2023 & TASLP2024]
FS-EEND
The official Pytorch implementation of "Frame-wise streaming end-to-end speaker diarization with non-autoregressive self-attention-based attractors". [ICASSP 2024] and "LS-EEND: long-form streaming end-to-end neural diarization with online attractor extraction"
FullSubNet
PyTorch implementation of "FullSubNet: A Full-Band and Sub-Band Fusion Model for Real-Time Single-Channel Speech Enhancement."
McNet
The official repo: "McNet: Fuse Multiple Cues for Multichannel Speech Enhancement", ICASSP 2023
NBSS
The official repo of NBC & SpatialNet for multichannel speech separation, denoising, and dereverberation
pytorch_lightning_template_for_beginners
A pytorch template for beginners based on pytorch_lightning
RealMAN
A description of "RealMAN: A Real-Recorded and Annotated Microphone Array Dataset for Dynamic Speech Enhancement and Localization" [NeurIPS 2024]
RVAE-EM
Official PyTorch implementation of "RVAE-EM: Generative speech dereverberation based on recurrent variational auto-encoder and convolutive transfer function" [ICASSP2024]
Audio-WestlakeU's Repositories
Audio-WestlakeU/FullSubNet
PyTorch implementation of "FullSubNet: A Full-Band and Sub-Band Fusion Model for Real-Time Single-Channel Speech Enhancement."
Audio-WestlakeU/NBSS
The official repo of NBC & SpatialNet for multichannel speech separation, denoising, and dereverberation
Audio-WestlakeU/audiossl
A library built for easier audio self-supervised training, downstream tasks evaluation
Audio-WestlakeU/McNet
The official repo: "McNet: Fuse Multiple Cues for Multichannel Speech Enhancement", ICASSP 2023
Audio-WestlakeU/ATST-SED
This repo includes the official implementations of "Fine-tune the pretrained ATST model for sound event detection".
Audio-WestlakeU/RealMAN
A description of "RealMAN: A Real-Recorded and Annotated Microphone Array Dataset for Dynamic Speech Enhancement and Localization" [NeurIPS 2024]
Audio-WestlakeU/FS-EEND
The official Pytorch implementation of "Frame-wise streaming end-to-end speaker diarization with non-autoregressive self-attention-based attractors". [ICASSP 2024] and "LS-EEND: long-form streaming end-to-end neural diarization with online attractor extraction"
Audio-WestlakeU/FN-SSL
The Official PyTorch Implementation of FN-SSL & IPDnet for Sound Source Localization [INTERSPEECH2023 & TASLP2024]
Audio-WestlakeU/RVAE-EM
Official PyTorch implementation of "RVAE-EM: Generative speech dereverberation based on recurrent variational auto-encoder and convolutive transfer function" [ICASSP2024]
Audio-WestlakeU/pytorch_lightning_template_for_beginners
A pytorch template for beginners based on pytorch_lightning
Audio-WestlakeU/SAR-SSL
A python implementation of “Self-Supervised Learning of Spatial Acoustic Representation with Cross-Channel Signal Reconstruction and Multi-Channel Conformer” [TASLP 2024]
Audio-WestlakeU/UMA-ASR
This repository is the official implementation of unimodal aggregation (UMA) for automaticspeech recognition (ASR).
Audio-WestlakeU/Narrowband_DeepFiltering
Audio-WestlakeU/RCT
This repo gives the code for the official implementation of RCT.
Audio-WestlakeU/OnlineSSL_DPRTF_EG
Audio-WestlakeU/LSTM-noisePSD
Audio-WestlakeU/Microphone-Array-Generalization-for-Multichannel-Narrowband-Deep-Speech-Enhancement
Audio-WestlakeU/bss_ctf_lasso
Audio-WestlakeU/Microphone-Array-Generalization-for-Multichannel-Narrowband-Deep-Speech-Enhancement-
Audio-WestlakeU/Audio-WestlakeU.github.io
Audio and Signal Information Processing Lab in Westlake University concentrates on speech processing algorithm
Audio-WestlakeU/DP_RTF_SSL
Audio-WestlakeU/SMIF_online_dereverb
Audio-WestlakeU/ATST-RCT
ATST-RCT model for DCASE 2022 task4.
Audio-WestlakeU/dereverb_ctf_nonneg
Audio-WestlakeU/RS_noisePSD
Audio-WestlakeU/RTF_InterFrameSpecSub
Audio-WestlakeU/BSS_CTF_EM
Audio-WestlakeU/ctf_mint