wxlsummer

University of Edinburgh

wxlsummer's Stars

tmux/tmux
tmux source code
Language:C34.9k 425 3.4k2.1k
espnet/espnet
End-to-End Speech Processing Toolkit
Language:Python8.4k 182 2.4k2.2k
kohpangwei/influence-release
Language:Jupyter Notebook771 18 22177
TaoRuijie/ECAPA-TDNN
Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)
Language:Python591 4 82112
Demfier/multimodal-speech-emotion-recognition
Lightweight and Interpretable ML Model for Speech Emotion Recognition and Ambiguity Resolution (trained on IEMOCAP dataset)
Language:Jupyter Notebook396 11 3484
ASR-project/Multilingual-PR
Phoneme Recognition using pre-trained models Wav2vec2, HuBERT and WavLM. Throughout this project, we compared specifically three different self-supervised models, Wav2vec (2019, 2020), HuBERT (2021) and WavLM (2022) pretrained on a corpus of English speech that we will use in various ways to perform phoneme recognition for different languages with a network trained with Connectionist Temporal Classification (CTC) algorithm.
Language:Python194 4 517
dborrelli/chat-intents
Clustering sentence embeddings to extract message intent
Language:Jupyter Notebook166 5 524
zszyellow/WER-in-python
This program calculates the word error rate of hypothesis in ASR and print the aligned result.
Language:Python152 5 877
felixkreuk/UnsupSeg
Self-Supervised Contrastive Learning for Unsupervised Phoneme Segmentation (INTERSPEECH 2020)
Language:Python135 5 929
JoergFranke/phoneme_recognition
Phoneme Recognition using RecNet
Language:Python90 9 630
matthijsvk/multimodalSR
Multimodal speech recognition using lipreading (with CNNs) and audio (using LSTMs). Sensor fusion is done with an attention network.
Language:Jupyter Notebook66 8 119
tbornt/phoneme_ctc
Bidirectional dynamic RNN + CTC for phoneme recognition
Language:Python44 2 412
AntreasAntoniou/minimal-ml-template
A very minimal ml project template that uses HF transformers and wandb to train a simple NN and evaluate it, in a stateless manner compatible with Spot instances kubernetes workflows
Language:Python35 2 16
foundintranslation/Kaldi
Kaldi Snapshot
Language:C++30 9 152
AntreasAntoniou/kubejobs
Language:Python27 2 38
fengxin-bupt/Application-of-Word2vec-in-Phoneme-Recognition
Build an attention-based model for speech recogntion.Use the Word2vec model to help to train the attention model.
Language:Python27 1 39
cvqluu/MTL-Speaker-Embeddings
Code for the paper: "Leveraging speaker attribute information using multi task learning for speaker verification and diarization" presented at Interspeech 2021
Language:Python24 2 16
nkrao220/accent-classification
Accent Classification in Speech
Language:Python24 1 15
qinxiaoyi/Cross-Age_Speaker_Verification
22 2 42
agrija9/Avalinguo-Dataset-Speaker-Fluency-Level-Classification-Paper-
Code for paper "Speaker Fluency Level Classification using Machine Learning Techniques."
Language:Jupyter Notebook17 1 16
chorowski-lab/CPC_audio
An implementation of the Contrast Predictive Coding (CPC) method to train audio features in an unsupervised fashion.
Language:Jupyter Notebook11 1 09
wiebket/bt4vt
Bias Tests for Voice Technologies (bt4vt)
Language:Python11 1 411
LIAvignon/BA-LR
Explainable Speaker Recognition system
Language:HTML6 1 01
huashen218/Voxceleb-Fairness
ICASSP 2022 paper - improve fairness of speaker verification models
Language:Python5 2 01
yhifny/keras-kaldi
Flexible deep neural acoustic modeling using the Keras-Kaldi toolkit
Language:Python5 4 21
bobzsj87/pykaldi
Language:Python2 2 00
mirkomarras/fair-voice
A Python toolbox for fairness analysis in speaker veriification
Language:Jupyter Notebook2 1 02
a1rishav/speaker-recognizer
Language:Jupyter Notebook1 1 01
mexca/mexca-sd-experiment
A repository for comparing potential speaker diarization tools to be used in the MEXCA pipeline.
Language:Jupyter Notebook1 0 02
wxlsummer/tmux
tmux source code
Language:C1 0 00

wxlsummer

wxlsummer's Stars

tmux/tmux

espnet/espnet

kohpangwei/influence-release

TaoRuijie/ECAPA-TDNN

Demfier/multimodal-speech-emotion-recognition

ASR-project/Multilingual-PR

dborrelli/chat-intents

zszyellow/WER-in-python

felixkreuk/UnsupSeg

JoergFranke/phoneme_recognition

matthijsvk/multimodalSR

tbornt/phoneme_ctc

AntreasAntoniou/minimal-ml-template

foundintranslation/Kaldi

AntreasAntoniou/kubejobs

fengxin-bupt/Application-of-Word2vec-in-Phoneme-Recognition

cvqluu/MTL-Speaker-Embeddings

nkrao220/accent-classification

qinxiaoyi/Cross-Age_Speaker_Verification

agrija9/Avalinguo-Dataset-Speaker-Fluency-Level-Classification-Paper-

chorowski-lab/CPC_audio

wiebket/bt4vt

LIAvignon/BA-LR

huashen218/Voxceleb-Fairness

yhifny/keras-kaldi

bobzsj87/pykaldi

mirkomarras/fair-voice

a1rishav/speaker-recognizer

mexca/mexca-sd-experiment

wxlsummer/tmux