speaker-identification

There are 138 repositories under speaker-identification topic.

  • alphacep/vosk-api

    Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node

    Language:Jupyter Notebook13.2k1331.7k1.6k
  • mravanelli/SincNet

    SincNet is a neural architecture for efficiently processing raw audio samples.

    Language:Python1.2k33106269
  • FluidAudio

    FluidInference/FluidAudio

    Native Swift and CoreML SDK for local speaker diarization, VAD, and speech-to-text for real-time workloads. Works on iOS and macOS.

    Language:Swift626
  • HarryVolek/PyTorch_Speaker_Verification

    PyTorch implementation of "Generalized End-to-End Loss for Speaker Verification" by Wan, Li et al.

    Language:Python5891974165
  • google/speaker-id

    This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team at Google.

    Language:Python43217840
  • speechbrain/speechbrain.github.io

    The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.

    Language:HTML37140531
  • Atul-Anand-Jha/Speaker-Identification-Python

    Speaker Identification System (upto 100% accuracy); built using Python 2.7 and python_speech_features library

    Language:Python211123077
  • jymsuper/SpeakerRecognition_tutorial

    Simple d-vector based Speaker Recognition (verification and identification) using Pytorch

    Language:Python211102146
  • oscarknagg/voicemap

    Identifying people from small audio fragments

    Language:Python1716773
  • Speaker-Identification/You-Only-Speak-Once

    Deep Learning - one shot learning for speaker recognition using Filter Banks

    Language:Jupyter Notebook1695941
  • kaistmm/Audio-Mamba-AuM

    Official Implementation of the work "Audio Mamba: Bidirectional State Space Model for Audio Representation Learning"

    Language:Python1527515
  • jefflai108/pytorch-kaldi-neural-speaker-embeddings

    A light weight neural speaker embeddings extraction based on Kaldi and PyTorch.

    Language:Perl1369634
  • Warma10032/easytts

    打造最简单的TTS前端集合,最简单的有声小说制作工作流。基于正则规则对小说进行分句,基于RoBERTa对小说中的对话进行说话人识别,从而实现一键式生成多人有声小说。多说话人的语音合成,高质量的有声小说制作。

    Language:Python13110
  • SiavashShams/ssamba

    [SLT'24] The official implementation of SSAMBA: Self-Supervised Audio Representation Learning with Mamba State Space Model

    Language:Python1267711
  • Anwarvic/Speaker-Recognition

    This repo contains my attempt to create a Speaker Recognition and Verification system using SideKit-1.3.1

    Language:Python11331533
  • Appen/UHV-OTS-Speech

    A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.

    Language:Forth1077119
  • FAKEBOB-adversarial-attack/FAKEBOB

    Source code for paper "Who is real Bob? Adversarial Attacks on Speaker Recognition Systems" (IEEE S&P 2021)

    Language:Python10461629
  • funcwj/ge2e-speaker-verification

    Pytorch implementation of "Generalized End-to-End Loss for Speaker Verification"

    Language:Python1035525
  • cvqluu/GE2E-Loss

    Pytorch implementation of Generalized End-to-End Loss for speaker verification

    Language:Python863116
  • nezhar/speech-condenser

    A tool for summarizing dialogues from videos or audio

    Language:Python824110
  • cyrta/voxceleb

    mirror of VoxCeleb dataset - a large-scale speaker identification dataset

    Language:Shell733419
  • Wadaboa/titanet

    Speaker identification/verification models for Machine Learning for Computer Vision class at UNIBO

    Language:Jupyter Notebook651713
  • mjpyeon/wavenet-classifier

    Keras Implementation of Deepmind's WaveNet for Supervised Learning Tasks

    Language:Python645312
  • CouncilDataProject/speakerbox

    Speakerbox: Fine-tune Audio Transformers for speaker identification.

    Language:Python596166
  • mialrr/Speaker-Recognition

    声纹识别(Voiceprint Recognition, VPR),也称为说话人识别(Speaker Recognition),有两类,即说话人辨认(Speaker Identification)和说话人确认(Speaker Verification)

    Language:Python57249
  • SuperKogito/Voice-based-speaker-identification

    :sound: :boy: :girl: :woman: :man: Speaker identification using voice MFCCs and GMM

    Language:Python543115
  • jojojaeger/whisper-streamlit

    this master thesis project is based on OpenAI Whisper with the goal to transcibe interviews

    Language:Python471717
  • qianhwan/KaldiBasedSpeakerVerification

    Kaldi based speaker verification

    Language:C++476419
  • KrishnaDN/Attentive-Statistics-Pooling-for-Deep-Speaker-Embedding

    Implementation of the paper "Attentive Statistics Pooling for Deep Speaker Embedding" in Pytorch

    Language:Python452310
  • swshon/voxceleb-ivector

    Voxceleb1 i-vector based speaker recognition system

    Language:Perl432111
  • manthanthakker/speakerIdentificationNeuralNetworks

    ⇨ The Speaker Recognition System consists of two phases, Feature Extraction and Recognition. ⇨ In the Extraction phase, the Speaker's voice is recorded and typical number of features are extracted to form a model. ⇨ During the Recognition phase, a speech sample is compared against a previously created voice print stored in the database. ⇨ The highlight of the system is that it can identify the Speaker's voice in a Multi-Speaker Environment too. Multi-layer Perceptron (MLP) Neural Network based on error back propagation training algorithm was used to train and test the system. ⇨ The system response time was 74 µs with an average efficiency of 95%.

    Language:MATLAB394220
  • Picovoice/eagle

    On-device speaker recognition engine powered by deep learning

    Language:Python379186
  • PiotrTa/Huawei-Challenge-Speaker-Identification

    Trained speaker embedding deep learning models and evaluation pipelines in pytorch and tesorflow for speaker recognition.

    Language:Jupyter Notebook363010
  • mycrazycracy/tf-kaldi-speaker

    Neural speaker recognition/verification system based on Kaldi and Tensorflow

    Language:Python3221716
  • PlayVoice/VI-Speaker

    Speaker embedding for VI-SVC and VI-SVS, alse for VITS; Use this to replace the ID to implement voice clone.

    Language:Python30024
  • imranparuk/speaker-recognition-3d-cnn

    Keras + pyTorch implimentation of "Deep Learning & 3D Convolutional Neural Networks for Speaker Verification"

    Language:Python293612