speaker-identification
There are 138 repositories under speaker-identification topic.
alphacep/vosk-api
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
mravanelli/SincNet
SincNet is a neural architecture for efficiently processing raw audio samples.
FluidInference/FluidAudio
Native Swift and CoreML SDK for local speaker diarization, VAD, and speech-to-text for real-time workloads. Works on iOS and macOS.
HarryVolek/PyTorch_Speaker_Verification
PyTorch implementation of "Generalized End-to-End Loss for Speaker Verification" by Wan, Li et al.
google/speaker-id
This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team at Google.
speechbrain/speechbrain.github.io
The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.
Atul-Anand-Jha/Speaker-Identification-Python
Speaker Identification System (upto 100% accuracy); built using Python 2.7 and python_speech_features library
jymsuper/SpeakerRecognition_tutorial
Simple d-vector based Speaker Recognition (verification and identification) using Pytorch
oscarknagg/voicemap
Identifying people from small audio fragments
Speaker-Identification/You-Only-Speak-Once
Deep Learning - one shot learning for speaker recognition using Filter Banks
kaistmm/Audio-Mamba-AuM
Official Implementation of the work "Audio Mamba: Bidirectional State Space Model for Audio Representation Learning"
jefflai108/pytorch-kaldi-neural-speaker-embeddings
A light weight neural speaker embeddings extraction based on Kaldi and PyTorch.
Warma10032/easytts
打造最简单的TTS前端集合,最简单的有声小说制作工作流。基于正则规则对小说进行分句,基于RoBERTa对小说中的对话进行说话人识别,从而实现一键式生成多人有声小说。多说话人的语音合成,高质量的有声小说制作。
SiavashShams/ssamba
[SLT'24] The official implementation of SSAMBA: Self-Supervised Audio Representation Learning with Mamba State Space Model
Anwarvic/Speaker-Recognition
This repo contains my attempt to create a Speaker Recognition and Verification system using SideKit-1.3.1
Appen/UHV-OTS-Speech
A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.
FAKEBOB-adversarial-attack/FAKEBOB
Source code for paper "Who is real Bob? Adversarial Attacks on Speaker Recognition Systems" (IEEE S&P 2021)
funcwj/ge2e-speaker-verification
Pytorch implementation of "Generalized End-to-End Loss for Speaker Verification"
cvqluu/GE2E-Loss
Pytorch implementation of Generalized End-to-End Loss for speaker verification
nezhar/speech-condenser
A tool for summarizing dialogues from videos or audio
cyrta/voxceleb
mirror of VoxCeleb dataset - a large-scale speaker identification dataset
Wadaboa/titanet
Speaker identification/verification models for Machine Learning for Computer Vision class at UNIBO
mjpyeon/wavenet-classifier
Keras Implementation of Deepmind's WaveNet for Supervised Learning Tasks
CouncilDataProject/speakerbox
Speakerbox: Fine-tune Audio Transformers for speaker identification.
mialrr/Speaker-Recognition
声纹识别(Voiceprint Recognition, VPR),也称为说话人识别(Speaker Recognition),有两类,即说话人辨认(Speaker Identification)和说话人确认(Speaker Verification)
SuperKogito/Voice-based-speaker-identification
:sound: :boy: :girl: :woman: :man: Speaker identification using voice MFCCs and GMM
jojojaeger/whisper-streamlit
this master thesis project is based on OpenAI Whisper with the goal to transcibe interviews
qianhwan/KaldiBasedSpeakerVerification
Kaldi based speaker verification
KrishnaDN/Attentive-Statistics-Pooling-for-Deep-Speaker-Embedding
Implementation of the paper "Attentive Statistics Pooling for Deep Speaker Embedding" in Pytorch
swshon/voxceleb-ivector
Voxceleb1 i-vector based speaker recognition system
manthanthakker/speakerIdentificationNeuralNetworks
⇨ The Speaker Recognition System consists of two phases, Feature Extraction and Recognition. ⇨ In the Extraction phase, the Speaker's voice is recorded and typical number of features are extracted to form a model. ⇨ During the Recognition phase, a speech sample is compared against a previously created voice print stored in the database. ⇨ The highlight of the system is that it can identify the Speaker's voice in a Multi-Speaker Environment too. Multi-layer Perceptron (MLP) Neural Network based on error back propagation training algorithm was used to train and test the system. ⇨ The system response time was 74 µs with an average efficiency of 95%.
Picovoice/eagle
On-device speaker recognition engine powered by deep learning
PiotrTa/Huawei-Challenge-Speaker-Identification
Trained speaker embedding deep learning models and evaluation pipelines in pytorch and tesorflow for speaker recognition.
mycrazycracy/tf-kaldi-speaker
Neural speaker recognition/verification system based on Kaldi and Tensorflow
PlayVoice/VI-Speaker
Speaker embedding for VI-SVC and VI-SVS, alse for VITS; Use this to replace the ID to implement voice clone.
imranparuk/speaker-recognition-3d-cnn
Keras + pyTorch implimentation of "Deep Learning & 3D Convolutional Neural Networks for Speaker Verification"