haha010508

Pinned Repositories

NBSS
The official repo of NBC & SpatialNet for multichannel speech separation, denoising, and dereverberation
Language:Python240 7 3627
google-research
Google Research
Language:Jupyter Notebook34.6k 755 1.3k8k
pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, speaker embedding
Language:Python0 0 00
SkipConvNet
Speech Dereverberation using Fully Convolutional Networks
Language:Python0 0 00
UFLDL-Tutorial-Exercise
Language:MATLAB0 1 00
SonicSim
Language:Python211 8 725
FunASR
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Language:Python7.5k 69 1.3k798
NeMo
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
Language:Python12.6k 210 2.3k2.6k
WenetSpeech
A 10000+ hours dataset for Chinese speech recognition
Language:Shell511 6 2449
TAC
transform-average-concatenate (TAC) method for end-to-end microphone permutation and number invariant ad-hoc beamforming.
Language:Python263 6 1554

haha010508's Repositories

haha010508/pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, speaker embedding
Language:Python0 0 00
haha010508/SkipConvNet
Speech Dereverberation using Fully Convolutional Networks
Language:Python0 0 00
haha010508/UFLDL-Tutorial-Exercise
Language:MATLAB0 1 00