realspacepen's Stars
Zeyi-Lin/HivisionIDPhotos
⚡️HivisionIDPhotos: a lightweight and efficient AI ID photos tools. 一个轻量级的AI证件照制作算法。
asteroid-team/asteroid
The PyTorch-based audio source separation toolkit for researchers
archinetai/audio-ai-timeline
A timeline of the latest AI models for audio generation, starting in 2023!
maum-ai/voicefilter
Unofficial PyTorch implementation of Google AI's VoiceFilter system
WenzheLiu-Speech/awesome-speech-enhancement
speech enhancement\speech seperation\sound source localization
f90/Wave-U-Net
Implementation of the Wave-U-Net for audio source separation
DmitryRyumin/INTERSPEECH-2023-24-Papers
INTERSPEECH 2023-2024 Papers: A complete collection of influential and exciting research papers from the INTERSPEECH 2023-24 conference. Explore the latest advances in speech and language processing. Code included. Star the repository to support the advancement of speech technology!
TaoRuijie/ECAPA-TDNN
Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)
HarryVolek/PyTorch_Speaker_Verification
PyTorch implementation of "Generalized End-to-End Loss for Speaker Verification" by Wan, Li et al.
JusperLee/Conv-TasNet
Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech Separation Pytorch's Implement
MoonInTheRiver/NeuralSVB
Learning the Beauty in Songs: Neural Singing Voice Beautifier; ACL 2022 (Main conference); Official code
DmitryRyumin/ICASSP-2023-24-Papers
ICASSP 2023-2024 Papers: A complete collection of influential and exciting research papers from the ICASSP 2023-24 conferences. Explore the latest advancements in acoustics, speech and signal processing. Code included. Star the repository to support the advancement of audio and signal processing!
huyanxin/DeepComplexCRN
sharathadavanne/seld-net
Sound event localization, detection, and tracking of multiple overlapping and moving sources in 2D spherical space using convolutional recurrent neural network
haoxiangsnr/A-Convolutional-Recurrent-Neural-Network-for-Real-Time-Speech-Enhancement
A minimum unofficial implementation of the "A Convolutional Recurrent Neural Network for Real-Time Speech Enhancement" (CRN) using PyTorch
fgnt/nn-gev
Neural network supported GEV beamformer
Enny1991/beamformers
Easy to use Beamformers for multi-channel speech separation/enhancement
ZitengWang/MASP
Microphone Array Speech Processing
haoxiangsnr/IRM-based-Speech-Enhancement-using-LSTM
Ideal Ratio Mask (IRM) Estimation based Speech Enhancement using LSTM
yinkalario/Two-Stage-Polyphonic-Sound-Event-Detection-and-Localization
A two-stage polyphonic sound event detection and localization method for both SED and DOA.
Audio-WestlakeU/FN-SSL
The Official PyTorch Implementation of FN-SSL & IPDnet for Sound Source Localization [INTERSPEECH2023 & TASLP2024]
zhaojw1998/Beat-Transformer
Codes for ISMIR 2022 paper: Beat Transformer: Demixed Beat and Downbeat Tracking with Dilated Self-Attention
seorim0/DCCRN-with-various-loss-functions
DCCRN with various loss functions
JupiterEthan/GCRN-complex
Dannynis/xvector_pytorch
A pytorch implementation of xvector embedding
JupiterEthan/CRN-causal
WildHoneyPie/BEAST
Codes for ICASSP 2024 paper: BEAST: Online Joint Beat and Downbeat Tracking Based on Streaming Transformer. An online beat tracking system based on streaming Transformer
YangangCao/Causal-U-Net
unofficial PyTorch implementation of 《A Causal U-net based Neural Beamforming Network for Real-Time Multi-Channel Speech Enhancement》
yongxuUSTC/grnnbf
Generalized RNN beamformer for speech separation
TJU-haoran/VCTK-16k-simulated
Simulation data from VCTK Corpus (version 0.92) for direction of arrival (DoA) estimation, and detailed data simulation process.