Pinned Repositories
NBSS
The official repo of NBC & SpatialNet for multichannel speech separation, denoising, and dereverberation
google-research
Google Research
pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, speaker embedding
SkipConvNet
Speech Dereverberation using Fully Convolutional Networks
UFLDL-Tutorial-Exercise
SonicSim
FunASR
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
NeMo
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
WenetSpeech
A 10000+ hours dataset for Chinese speech recognition
TAC
transform-average-concatenate (TAC) method for end-to-end microphone permutation and number invariant ad-hoc beamforming.
haha010508's Repositories
haha010508/pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, speaker embedding
haha010508/SkipConvNet
Speech Dereverberation using Fully Convolutional Networks
haha010508/UFLDL-Tutorial-Exercise