speaker-recognition
There are 281 repositories under speaker-recognition topic.
shangeth/wavencoder
WavEncoder is a Python library for encoding audio signals, transforms for audio augmentation, and training audio classification models with PyTorch backend.
TaoRuijie/Loss-Gated-Learning
ICASSP 2022: 'Self-supervised Speaker Recognition with Loss-gated Learning'
yuyq96/D-TDNN
PyTorch implementation of Densely Connected Time Delay Neural Network
GauravWaghmare/Speaker-Identification
A program for automatic speaker identification using deep learning techniques.
cvqluu/GE2E-Loss
Pytorch implementation of Generalized End-to-End Loss for speaker verification
georgygospodinov/speech_course
Deep Learning for Speech
linhdvu14/vggvox-speaker-identification
Speaker identification with VGGVox network
Speech-Interaction-Technology-Aalto-U/itsp
Introduction to Speech Processing
thuiar/MIntRec
MIntRec: A New Dataset for Multimodal Intent Recognition (ACM MM 2022)
seongmin-kye/meta-SR
Pytorch implementation of Meta-Learning for Short Utterance Speaker Recognition with Imbalance Length Pairs (Interspeech, 2020)
grausof/keras-sincnet
Keras (tensorflow) implementation of SincNet (Mirco Ravanelli, Yoshua Bengio - https://github.com/mravanelli/SincNet)
VidyasagarMSC/WatBot
An Android ChatBot powered by IBM Watson Services (Assistant V1, Text-to-Speech, and Speech-to-Text with Speaker Recognition) on IBM Cloud.
cyrta/voxceleb
mirror of VoxCeleb dataset - a large-scale speaker identification dataset
hyperion-ml/hyperion
Python toolkit for speech processing
mjpyeon/wavenet-classifier
Keras Implementation of Deepmind's WaveNet for Supervised Learning Tasks
shangeth/SpeakerProfiling
Estimating the Age, Height, and Gender of a speaker with their speech signal. https://arxiv.org/pdf/2110.13653.pdf
Voice-Privacy-Challenge/Voice-Privacy-Challenge-2022
Baseline Recipe for VoicePrivacy Challenge 2022: anonymization systems and evaluation software
zycv/OpenSpeaker
OpenSpeaker is a completely independent and open source speaker recognition project. It provides the entire process of speaker recognition including multi-platform deployment and model optimization.
Wadaboa/titanet
Speaker identification/verification models for Machine Learning for Computer Vision class at UNIBO
mialrr/Speaker-Recognition
声纹识别(Voiceprint Recognition, VPR),也称为说话人识别(Speaker Recognition),有两类,即说话人辨认(Speaker Identification)和说话人确认(Speaker Verification)
Adirockzz95/Piwho
Speaker recognition library based on MARF for raspberry pi and other SBCs.
andi611/Mockingjay-Speech-Representation
Official Implementation of Mockingjay in Pytorch
SuperKogito/Voice-based-speaker-identification
:sound: :boy: :girl: :woman: :man: Speaker identification using voice MFCCs and GMM
Aurora11111/speaker-recognition-pytorch
Speaker recognition ,Voiceprint recognition
pika-online/AESRC2020
a deep accent recognition network
ZhaZhaFon/resource_speech
语音算法相关资源汇总 Resource for Speech Processing || NEWS: official link of VoxCeleb fails recently and an external link is added for download
qianhwan/KaldiBasedSpeakerVerification
Kaldi based speaker verification
ranchlai/awesome-speaker-embedding
A curated list of speaker-embedding speaker-verification, speaker-identification resources.
vi7/ecoute-macos
Ecoute is a live transcription tool that provides real-time transcripts for both the user's microphone input (You) and the user's speakers output (Speaker) in a textbox. It also generates a suggested response using OpenAI's GPT-3.5 for the user to say based on the live transcription of the conversation.
bioidiap/bob
Bob is a free signal-processing and machine learning toolbox originally developed by the Biometrics group at Idiap Research Institute, in Switzerland. - Mirrored from https://gitlab.idiap.ch/bob/bob
cvqluu/nn-similarity-diarization
Neural network based similarity scoring for diarization (pytorch implementation of "LSTM based Similarity Measurement with Spectral Clustering for Speaker Diarization")
KrishnaDN/Attentive-Statistics-Pooling-for-Deep-Speaker-Embedding
Implementation of the paper "Attentive Statistics Pooling for Deep Speaker Embedding" in Pytorch
maxhollmann/voxceleb-luigi
Luigi pipeline to download VoxCeleb(2) audio from YouTube and extract speaker segments
swshon/voxceleb-ivector
Voxceleb1 i-vector based speaker recognition system
wq2012/SpeakerRecognitionFromScratch
Final project for the Speaker Recognition course on Udemy, 机器之心, 深蓝学院 and 语音之家
PiotrTa/Huawei-Challenge-Speaker-Identification
Trained speaker embedding deep learning models and evaluation pipelines in pytorch and tesorflow for speaker recognition.