wxlsummer's Stars
tmux/tmux
tmux source code
espnet/espnet
End-to-End Speech Processing Toolkit
kohpangwei/influence-release
TaoRuijie/ECAPA-TDNN
Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)
Demfier/multimodal-speech-emotion-recognition
Lightweight and Interpretable ML Model for Speech Emotion Recognition and Ambiguity Resolution (trained on IEMOCAP dataset)
ASR-project/Multilingual-PR
Phoneme Recognition using pre-trained models Wav2vec2, HuBERT and WavLM. Throughout this project, we compared specifically three different self-supervised models, Wav2vec (2019, 2020), HuBERT (2021) and WavLM (2022) pretrained on a corpus of English speech that we will use in various ways to perform phoneme recognition for different languages with a network trained with Connectionist Temporal Classification (CTC) algorithm.
dborrelli/chat-intents
Clustering sentence embeddings to extract message intent
zszyellow/WER-in-python
This program calculates the word error rate of hypothesis in ASR and print the aligned result.
felixkreuk/UnsupSeg
Self-Supervised Contrastive Learning for Unsupervised Phoneme Segmentation (INTERSPEECH 2020)
JoergFranke/phoneme_recognition
Phoneme Recognition using RecNet
matthijsvk/multimodalSR
Multimodal speech recognition using lipreading (with CNNs) and audio (using LSTMs). Sensor fusion is done with an attention network.
tbornt/phoneme_ctc
Bidirectional dynamic RNN + CTC for phoneme recognition
AntreasAntoniou/minimal-ml-template
A very minimal ml project template that uses HF transformers and wandb to train a simple NN and evaluate it, in a stateless manner compatible with Spot instances kubernetes workflows
foundintranslation/Kaldi
Kaldi Snapshot
AntreasAntoniou/kubejobs
fengxin-bupt/Application-of-Word2vec-in-Phoneme-Recognition
Build an attention-based model for speech recogntion.Use the Word2vec model to help to train the attention model.
cvqluu/MTL-Speaker-Embeddings
Code for the paper: "Leveraging speaker attribute information using multi task learning for speaker verification and diarization" presented at Interspeech 2021
nkrao220/accent-classification
Accent Classification in Speech
qinxiaoyi/Cross-Age_Speaker_Verification
agrija9/Avalinguo-Dataset-Speaker-Fluency-Level-Classification-Paper-
Code for paper "Speaker Fluency Level Classification using Machine Learning Techniques."
chorowski-lab/CPC_audio
An implementation of the Contrast Predictive Coding (CPC) method to train audio features in an unsupervised fashion.
wiebket/bt4vt
Bias Tests for Voice Technologies (bt4vt)
LIAvignon/BA-LR
Explainable Speaker Recognition system
huashen218/Voxceleb-Fairness
ICASSP 2022 paper - improve fairness of speaker verification models
yhifny/keras-kaldi
Flexible deep neural acoustic modeling using the Keras-Kaldi toolkit
bobzsj87/pykaldi
mirkomarras/fair-voice
A Python toolbox for fairness analysis in speaker veriification
a1rishav/speaker-recognizer
mexca/mexca-sd-experiment
A repository for comparing potential speaker diarization tools to be used in the MEXCA pipeline.
wxlsummer/tmux
tmux source code