speaker-recognition

There are 294 repositories under speaker-recognition topic.

NVIDIA/NeMo
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
Language:Python13.1k 216 2.4k2.7k
speechbrain/speechbrain
A PyTorch-based Speech Toolkit
Language:Python9.4k 136 1.1k1.4k
pyannote/pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
Language:Jupyter Notebook6.9k 75 1k825
google/uis-rnn
This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Supervised Speaker Diarization.
Language:Python1.6k 100 87320
mravanelli/SincNet
SincNet is a neural architecture for efficiently processing raw audio samples.
Language:Python1.2k 34 106263
clovaai/voxceleb_trainer
In defence of metric learning for speaker recognition
Language:Python1.1k 30 174276
athena-team/athena
an open-source implementation of sequence-to-sequence based speech processing engine
Language:C++935 37 137188
yeyupiaoling/VoiceprintRecognition-Pytorch
This project uses a variety of advanced voiceprint recognition models such as EcapaTdnn, ResNetSE, ERes2Net, CAM++, etc. It is not excluded that more models will be supported in the future. At the same time, this project also supports MelSpectrogram, Spectrogram data preprocessing methods
Language:Python895 11 70132
wenet-e2e/wespeaker
Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit
Language:Python825 17 138126
astorfi/3D-convolutional-speaker-recognition
:speaker: Deep Learning & 3D Convolutional Neural Networks for Speaker Verification
Language:Python782 58 58273
TaoRuijie/ECAPA-TDNN
Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)
Language:Python640 4 84116
cvqluu/Angular-Penalty-Softmax-Losses-Pytorch
Angular penalty loss functions in Pytorch (ArcFace, SphereFace, Additive Margin, CosFace)
Language:Python485 11 1891
taylorlu/Speaker-Diarization
speaker diarization by uis-rnn and speaker embedding by vgg-speaker-recognition
Language:Python478 14 61120
nuaazs/VAF_2
Aims to create a comprehensive voice toolkit for training, testing, and deploying speaker verification systems.
Language:Python405 4 021
google/speaker-id
This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team at Google.
Language:Python393 18 739
speechbrain/speechbrain.github.io
The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.
Language:HTML366 41 529
manojpamk/pytorch_xvectors
Deep speaker embeddings in PyTorch, including x-vectors. Code used in this work: https://arxiv.org/abs/2007.16196
Language:Python309 9 1564
yeyupiaoling/VoiceprintRecognition-Tensorflow
使用Tensorflow实现声纹识别
Language:Python306 4 2466
yeyupiaoling/VoiceprintRecognition-PaddlePaddle
本项目使用了EcapaTdnn、ResNetSE、ERes2Net、CAM++等多种先进的声纹识别模型，同时本项目也支持了MelSpectrogram、Spectrogram、MFCC、Fbank等多种数据预处理方法
Language:Python252 6 1447
Walleclipse/Deep_Speaker-speaker_recognition_system
Keras implementation of ‘’Deep Speaker: an End-to-End Neural Speaker Embedding System‘’ (speaker recognition)
Language:Python248 10 7880
crouchred/speaker-recognition-py3
Base on MFCC and GMM(基于MFCC和高斯混合模型的语音识别)
Language:Python246 10 1378
SamirPaulb/real-time-voice-translator
A desktop application that uses AI to translate voice between languages in real time, while preserving the speaker's tone and emotion.
Language:Tcl230 4 461
jymsuper/SpeakerRecognition_tutorial
Simple d-vector based Speaker Recognition (verification and identification) using Pytorch
Language:Python211 10 2146
Atul-Anand-Jha/Speaker-Identification-Python
Speaker Identification System (upto 100% accuracy); built using Python 2.7 and python_speech_features library
Language:Python207 13 3075
VITA-Group/AutoSpeech
[InterSpeech 2020] "AutoSpeech: Neural Architecture Search for Speaker Recognition" by Shaojin Ding*, Tianlong Chen*, Xinyu Gong, Weiwei Zha, Zhangyang Wang
Language:Python207 16 1442
cvqluu/TDNN
Time delay neural network (TDNN) implementation in Pytorch using unfold method
Language:Python199 7 340
IBM-Cloud/chatbot-watson-android
An Android ChatBot powered by Watson Services - Assistant, Speech-to-Text and Text-to-Speech on IBM Cloud.
Language:Java195 22 0182
NavodPeiris/speechlib
speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with actual speaker names
Language:Python186 5 1617
oscarknagg/voicemap
Identifying people from small audio fragments
Language:Python170 6 773
Speaker-Identification/You-Only-Speak-Once
Deep Learning - one shot learning for speaker recognition using Filter Banks
Language:Jupyter Notebook164 6 941
lihanghang/CASR-DEMO
基于Flask Web的中文自动语音识别演示系统,包含语音识别、语音合成、声纹识别之说话人识别。
Language:CSS159 4 429
cvqluu/Factorized-TDNN
PyTorch implementation of the Factorized TDNN (TDNN-F) from "Semi-Orthogonal Low-Rank Matrix Factorization for Deep Neural Networks" and Kaldi
Language:Python143 8 534
yeyupiaoling/VoiceprintRecognition-Keras
基于Kersa实现的声纹识别模型
Language:Python137 4 428
jefflai108/pytorch-kaldi-neural-speaker-embeddings
A light weight neural speaker embeddings extraction based on Kaldi and PyTorch.
Language:Perl136 9 634
Anwarvic/Speaker-Recognition
This repo contains my attempt to create a Speaker Recognition and Verification system using SideKit-1.3.1
Language:Python111 4 1533
bjfu-ai-institute/speaker-recognition-papers
Share some recent speaker recognition papers and their implementations.
Language:Python90 14 920

speaker-recognition

NVIDIA/NeMo

speechbrain/speechbrain

pyannote/pyannote-audio

google/uis-rnn

mravanelli/SincNet

clovaai/voxceleb_trainer

athena-team/athena

yeyupiaoling/VoiceprintRecognition-Pytorch

wenet-e2e/wespeaker

astorfi/3D-convolutional-speaker-recognition

TaoRuijie/ECAPA-TDNN

cvqluu/Angular-Penalty-Softmax-Losses-Pytorch

taylorlu/Speaker-Diarization

nuaazs/VAF_2

google/speaker-id

speechbrain/speechbrain.github.io

manojpamk/pytorch_xvectors

yeyupiaoling/VoiceprintRecognition-Tensorflow

yeyupiaoling/VoiceprintRecognition-PaddlePaddle

Walleclipse/Deep_Speaker-speaker_recognition_system

crouchred/speaker-recognition-py3

SamirPaulb/real-time-voice-translator

jymsuper/SpeakerRecognition_tutorial

Atul-Anand-Jha/Speaker-Identification-Python

VITA-Group/AutoSpeech

cvqluu/TDNN

IBM-Cloud/chatbot-watson-android

NavodPeiris/speechlib

oscarknagg/voicemap

Speaker-Identification/You-Only-Speak-Once

lihanghang/CASR-DEMO

cvqluu/Factorized-TDNN

yeyupiaoling/VoiceprintRecognition-Keras

jefflai108/pytorch-kaldi-neural-speaker-embeddings

Anwarvic/Speaker-Recognition

bjfu-ai-institute/speaker-recognition-papers