Pratik039's Stars
GATECH-EIC/S3-Router
[NeurIPS 2022] "Losses Can Be Blessings: Routing Self-Supervised Speech Representations Towards Efficient Multilingual and Multitask Speech Processing" by Yonggan Fu, Yang Zhang, Kaizhi Qian, Zhifan Ye, Zhongzhi Yu, Cheng-I Lai, Yingyan Lin
skit-ai/Map-Mix
The official implementation of the method discussed in the paper Improving Spoken Language Identification with Map-Mix(work accepted at ICASSP-2023)
DmitryRyumin/INTERSPEECH-2023-24-Papers
INTERSPEECH 2023-2024 Papers: A complete collection of influential and exciting research papers from the INTERSPEECH 2023-24 conference. Explore the latest advances in speech and language processing. Code included. Star the repository to support the advancement of speech technology!
jcvasquezc/DisVoice
feature extraction from speech signals
sevagh/chord-detection
DSP algorithms for chord detection + key estimation
bayesian-optimization/BayesianOptimization
A Python implementation of global optimization with gaussian processes.
Lhx94As/Awesome-Spoken-Language-Identification
An awesome spoken LID repository. (Working in progress
lhotse-speech/lhotse
Tools for handling speech data in machine learning projects.
wiseman/py-webrtcvad
Python interface to the WebRTC Voice Activity Detector
RicherMans/PLDA
An LDA/PLDA estimator using KALDI in python for speaker verification tasks
linto-ai/whisper-timestamped
Multilingual Automatic Speech Recognition with word-level timestamps and confidence
jagabandhumishra/E2E_LD
End to end language diarization