realspacepen

realspacepen's Stars

Zeyi-Lin/HivisionIDPhotos
⚡️HivisionIDPhotos: a lightweight and efficient AI ID photos tools. 一个轻量级的AI证件照制作算法。
Language:Python13.9k 60 1221.5k
asteroid-team/asteroid
The PyTorch-based audio source separation toolkit for researchers
Language:Python2.3k 52 222424
archinetai/audio-ai-timeline
A timeline of the latest AI models for audio generation, starting in 2023!
1.9k 168 470
maum-ai/voicefilter
Unofficial PyTorch implementation of Google AI's VoiceFilter system
Language:Python1.1k 35 26228
WenzheLiu-Speech/awesome-speech-enhancement
speech enhancement\speech seperation\sound source localization
1.1k 42 1223
f90/Wave-U-Net
Implementation of the Wave-U-Net for audio source separation
Language:Python856 21 54177
DmitryRyumin/INTERSPEECH-2023-24-Papers
INTERSPEECH 2023-2024 Papers: A complete collection of influential and exciting research papers from the INTERSPEECH 2023-24 conference. Explore the latest advances in speech and language processing. Code included. Star the repository to support the advancement of speech technology!
650 89 442
TaoRuijie/ECAPA-TDNN
Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)
Language:Python624 4 83115
HarryVolek/PyTorch_Speaker_Verification
PyTorch implementation of "Generalized End-to-End Loss for Speaker Verification" by Wan, Li et al.
Language:Python579 19 74165
JusperLee/Conv-TasNet
Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech Separation Pytorch's Implement
Language:Python444 6 5777
MoonInTheRiver/NeuralSVB
Learning the Beauty in Songs: Neural Singing Voice Beautifier; ACL 2022 (Main conference); Official code
Language:Python426 12 1952
DmitryRyumin/ICASSP-2023-24-Papers
ICASSP 2023-2024 Papers: A complete collection of influential and exciting research papers from the ICASSP 2023-24 conferences. Explore the latest advancements in acoustics, speech and signal processing. Code included. Star the repository to support the advancement of audio and signal processing!
Language:Python419 29 417
huyanxin/DeepComplexCRN
Language:HTML405 9 27100
sharathadavanne/seld-net
Sound event localization, detection, and tracking of multiple overlapping and moving sources in 2D spherical space using convolutional recurrent neural network
Language:Python343 16 2866
haoxiangsnr/A-Convolutional-Recurrent-Neural-Network-for-Real-Time-Speech-Enhancement
A minimum unofficial implementation of the "A Convolutional Recurrent Neural Network for Real-Time Speech Enhancement" (CRN) using PyTorch
Language:Python317 7 2759
fgnt/nn-gev
Neural network supported GEV beamformer
Language:Python200 14 1491
Enny1991/beamformers
Easy to use Beamformers for multi-channel speech separation/enhancement
Language:Python191 4 448
ZitengWang/MASP
Microphone Array Speech Processing
Language:MATLAB187 7 177
haoxiangsnr/IRM-based-Speech-Enhancement-using-LSTM
Ideal Ratio Mask (IRM) Estimation based Speech Enhancement using LSTM
Language:Python114 3 525
yinkalario/Two-Stage-Polyphonic-Sound-Event-Detection-and-Localization
A two-stage polyphonic sound event detection and localization method for both SED and DOA.
Language:Python110 2 426
Audio-WestlakeU/FN-SSL
The Official PyTorch Implementation of FN-SSL & IPDnet for Sound Source Localization [INTERSPEECH2023 & TASLP2024]
Language:Python97 5 810
zhaojw1998/Beat-Transformer
Codes for ISMIR 2022 paper: Beat Transformer: Demixed Beat and Downbeat Tracking with Dilated Self-Attention
Language:Python93 3 1919
seorim0/DCCRN-with-various-loss-functions
DCCRN with various loss functions
Language:Python92 1 822
JupiterEthan/GCRN-complex
Language:Python87 3 741
Dannynis/xvector_pytorch
A pytorch implementation of xvector embedding
Language:Jupyter Notebook78 4 77
JupiterEthan/CRN-causal
Language:Python58 2 325
WildHoneyPie/BEAST
Codes for ICASSP 2024 paper: BEAST: Online Joint Beat and Downbeat Tracking Based on Streaming Transformer. An online beat tracking system based on streaming Transformer
Language:Python34 2 31
YangangCao/Causal-U-Net
unofficial PyTorch implementation of 《A Causal U-net based Neural Beamforming Network for Real-Time Multi-Channel Speech Enhancement》
Language:Python32 3 27
yongxuUSTC/grnnbf
Generalized RNN beamformer for speech separation
Language:HTML17 3 00
TJU-haoran/VCTK-16k-simulated
Simulation data from VCTK Corpus (version 0.92) for direction of arrival (DoA) estimation, and detailed data simulation process.
Language:Python9 1 32

realspacepen

realspacepen's Stars

Zeyi-Lin/HivisionIDPhotos

asteroid-team/asteroid

archinetai/audio-ai-timeline

maum-ai/voicefilter

WenzheLiu-Speech/awesome-speech-enhancement

f90/Wave-U-Net

DmitryRyumin/INTERSPEECH-2023-24-Papers

TaoRuijie/ECAPA-TDNN

HarryVolek/PyTorch_Speaker_Verification

JusperLee/Conv-TasNet

MoonInTheRiver/NeuralSVB

DmitryRyumin/ICASSP-2023-24-Papers

huyanxin/DeepComplexCRN

sharathadavanne/seld-net

haoxiangsnr/A-Convolutional-Recurrent-Neural-Network-for-Real-Time-Speech-Enhancement

fgnt/nn-gev

Enny1991/beamformers

ZitengWang/MASP

haoxiangsnr/IRM-based-Speech-Enhancement-using-LSTM

yinkalario/Two-Stage-Polyphonic-Sound-Event-Detection-and-Localization

Audio-WestlakeU/FN-SSL

zhaojw1998/Beat-Transformer

seorim0/DCCRN-with-various-loss-functions

JupiterEthan/GCRN-complex

Dannynis/xvector_pytorch

JupiterEthan/CRN-causal

WildHoneyPie/BEAST

YangangCao/Causal-U-Net

yongxuUSTC/grnnbf

TJU-haoran/VCTK-16k-simulated