speaker-identification

There are 138 repositories under speaker-identification topic.

alphacep/vosk-api
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
Language:Jupyter Notebook13.2k 133 1.7k1.6k
mravanelli/SincNet
SincNet is a neural architecture for efficiently processing raw audio samples.
Language:Python1.2k 33 106269
FluidInference/FluidAudio
Native Swift and CoreML SDK for local speaker diarization, VAD, and speech-to-text for real-time workloads. Works on iOS and macOS.
Language:Swift626
HarryVolek/PyTorch_Speaker_Verification
PyTorch implementation of "Generalized End-to-End Loss for Speaker Verification" by Wan, Li et al.
Language:Python589 19 74165
google/speaker-id
This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team at Google.
Language:Python432 17 840
speechbrain/speechbrain.github.io
The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.
Language:HTML371 40 531
Atul-Anand-Jha/Speaker-Identification-Python
Speaker Identification System (upto 100% accuracy); built using Python 2.7 and python_speech_features library
Language:Python211 12 3077
jymsuper/SpeakerRecognition_tutorial
Simple d-vector based Speaker Recognition (verification and identification) using Pytorch
Language:Python211 10 2146
oscarknagg/voicemap
Identifying people from small audio fragments
Language:Python171 6 773
Speaker-Identification/You-Only-Speak-Once
Deep Learning - one shot learning for speaker recognition using Filter Banks
Language:Jupyter Notebook169 5 941
kaistmm/Audio-Mamba-AuM
Official Implementation of the work "Audio Mamba: Bidirectional State Space Model for Audio Representation Learning"
Language:Python152 7 515
jefflai108/pytorch-kaldi-neural-speaker-embeddings
A light weight neural speaker embeddings extraction based on Kaldi and PyTorch.
Language:Perl136 9 634
Warma10032/easytts
打造最简单的TTS前端集合，最简单的有声小说制作工作流。基于正则规则对小说进行分句，基于RoBERTa对小说中的对话进行说话人识别，从而实现一键式生成多人有声小说。多说话人的语音合成，高质量的有声小说制作。
Language:Python13110
SiavashShams/ssamba
[SLT'24] The official implementation of SSAMBA: Self-Supervised Audio Representation Learning with Mamba State Space Model
Language:Python126 7 711
Anwarvic/Speaker-Recognition
This repo contains my attempt to create a Speaker Recognition and Verification system using SideKit-1.3.1
Language:Python113 3 1533
Appen/UHV-OTS-Speech
A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.
Language:Forth107 7 119
FAKEBOB-adversarial-attack/FAKEBOB
Source code for paper "Who is real Bob? Adversarial Attacks on Speaker Recognition Systems" (IEEE S&P 2021)
Language:Python104 6 1629
funcwj/ge2e-speaker-verification
Pytorch implementation of "Generalized End-to-End Loss for Speaker Verification"
Language:Python103 5 525
cvqluu/GE2E-Loss
Pytorch implementation of Generalized End-to-End Loss for speaker verification
Language:Python86 3 116
nezhar/speech-condenser
A tool for summarizing dialogues from videos or audio
Language:Python82 4 110
cyrta/voxceleb
mirror of VoxCeleb dataset - a large-scale speaker identification dataset
Language:Shell73 3 419
Wadaboa/titanet
Speaker identification/verification models for Machine Learning for Computer Vision class at UNIBO
Language:Jupyter Notebook65 1 713
mjpyeon/wavenet-classifier
Keras Implementation of Deepmind's WaveNet for Supervised Learning Tasks
Language:Python64 5 312
CouncilDataProject/speakerbox
Speakerbox: Fine-tune Audio Transformers for speaker identification.
Language:Python59 6 166
mialrr/Speaker-Recognition
声纹识别(Voiceprint Recognition, VPR)，也称为说话人识别(Speaker Recognition)，有两类，即说话人辨认(Speaker Identification)和说话人确认(Speaker Verification)
Language:Python57 2 49
SuperKogito/Voice-based-speaker-identification
:sound: :boy: :girl: :woman: :man: Speaker identification using voice MFCCs and GMM
Language:Python54 3 115
jojojaeger/whisper-streamlit
this master thesis project is based on OpenAI Whisper with the goal to transcibe interviews
Language:Python47 1 717
qianhwan/KaldiBasedSpeakerVerification
Kaldi based speaker verification
Language:C++47 6 419
KrishnaDN/Attentive-Statistics-Pooling-for-Deep-Speaker-Embedding
Implementation of the paper "Attentive Statistics Pooling for Deep Speaker Embedding" in Pytorch
Language:Python45 2 310
swshon/voxceleb-ivector
Voxceleb1 i-vector based speaker recognition system
Language:Perl43 2 111
manthanthakker/speakerIdentificationNeuralNetworks
⇨ The Speaker Recognition System consists of two phases, Feature Extraction and Recognition. ⇨ In the Extraction phase, the Speaker's voice is recorded and typical number of features are extracted to form a model. ⇨ During the Recognition phase, a speech sample is compared against a previously created voice print stored in the database. ⇨ The highlight of the system is that it can identify the Speaker's voice in a Multi-Speaker Environment too. Multi-layer Perceptron (MLP) Neural Network based on error back propagation training algorithm was used to train and test the system. ⇨ The system response time was 74 µs with an average efficiency of 95%.
Language:MATLAB39 4 220
Picovoice/eagle
On-device speaker recognition engine powered by deep learning
Language:Python37 9 186
PiotrTa/Huawei-Challenge-Speaker-Identification
Trained speaker embedding deep learning models and evaluation pipelines in pytorch and tesorflow for speaker recognition.
Language:Jupyter Notebook36 3 010
mycrazycracy/tf-kaldi-speaker
Neural speaker recognition/verification system based on Kaldi and Tensorflow
Language:Python32 2 1716
PlayVoice/VI-Speaker
Speaker embedding for VI-SVC and VI-SVS, alse for VITS; Use this to replace the ID to implement voice clone.
Language:Python30 0 24
imranparuk/speaker-recognition-3d-cnn
Keras + pyTorch implimentation of "Deep Learning & 3D Convolutional Neural Networks for Speaker Verification"
Language:Python29 3 612

speaker-identification

alphacep/vosk-api

mravanelli/SincNet

FluidInference/FluidAudio

HarryVolek/PyTorch_Speaker_Verification

google/speaker-id

speechbrain/speechbrain.github.io

Atul-Anand-Jha/Speaker-Identification-Python

jymsuper/SpeakerRecognition_tutorial

oscarknagg/voicemap

Speaker-Identification/You-Only-Speak-Once

kaistmm/Audio-Mamba-AuM

jefflai108/pytorch-kaldi-neural-speaker-embeddings

Warma10032/easytts

SiavashShams/ssamba

Anwarvic/Speaker-Recognition

Appen/UHV-OTS-Speech

FAKEBOB-adversarial-attack/FAKEBOB

funcwj/ge2e-speaker-verification

cvqluu/GE2E-Loss

nezhar/speech-condenser

cyrta/voxceleb

Wadaboa/titanet

mjpyeon/wavenet-classifier

CouncilDataProject/speakerbox

mialrr/Speaker-Recognition

SuperKogito/Voice-based-speaker-identification

jojojaeger/whisper-streamlit

qianhwan/KaldiBasedSpeakerVerification

KrishnaDN/Attentive-Statistics-Pooling-for-Deep-Speaker-Embedding

swshon/voxceleb-ivector

manthanthakker/speakerIdentificationNeuralNetworks

Picovoice/eagle

PiotrTa/Huawei-Challenge-Speaker-Identification

mycrazycracy/tf-kaldi-speaker

PlayVoice/VI-Speaker

imranparuk/speaker-recognition-3d-cnn