speaker-diarization

There are 125 repositories under speaker-diarization topic.

speechbrain/speechbrain
A PyTorch-based Speech Toolkit
Language:Python9.1k 134 1.1k1.4k
espnet/espnet
End-to-End Speech Processing Toolkit
Language:Python8.6k 179 2.4k2.2k
modelscope/FunASR
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Language:Python7.4k 67 1.2k795
pyannote/pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
Language:Jupyter Notebook6.6k 73 1k800
MahmoudAshraf97/whisper-diarization
Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
Language:Jupyter Notebook3.9k 48 212349
linto-ai/whisper-timestamped
Multilingual Automatic Speech Recognition with word-level timestamps and confidence
Language:Python2.1k 31 158163
wq2012/awesome-diarization
A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.
1.6k 76 8228
google/uis-rnn
This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Supervised Speaker Diarization.
Language:Python1.6k 101 87320
Purfview/whisper-standalone-win
Whisper & Faster-Whisper standalone executables for those who don't want to bother with Python.
1.4k 34 20371
modelscope/3D-Speaker
A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization
Language:Python1.4k 17 113111
juanmc2005/diart
A python package to build AI-powered real-time audio applications
Language:Python1.1k 22 15290
wenet-e2e/wespeaker
Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit
Language:Python770 16 133124
transcriptionstream/transcriptionstream
turnkey self-hosted offline transcription and diarization service with llm summary
Language:Python766 8 1645
yinruiqing/pyannote-whisper
Language:Python536 20 2091
wq2012/SpectralCluster
Python re-implementation of the (constrained) spectral clustering algorithms used in Google's speaker diarization papers.
Language:Python518 19 4573
taylorlu/Speaker-Diarization
speaker diarization by uis-rnn and speaker embedding by vgg-speaker-recognition
Language:Python473 14 61120
nuaazs/VAF_2
Aims to create a comprehensive voice toolkit for training, testing, and deploying speaker verification systems.
Language:Python403 4 021
google/speaker-id
This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team at Google.
Language:Python386 18 739
hitachi-speech/EEND
End-to-End Neural Diarization
Language:Python381 17 4659
revdotcom/reverb
Open source inference code for Rev's model
Language:Python347 11 1524
manojpamk/pytorch_xvectors
Deep speaker embeddings in PyTorch, including x-vectors. Code used in this work: https://arxiv.org/abs/2007.16196
Language:Python307 9 1565
DongKeon/Awesome-Speaker-Diarization
Some comprehensive papers about speaker diarization
236 13 05
cvqluu/TDNN
Time delay neural network (TDNN) implementation in Pytorch using unfold method
Language:Python198 7 340
IBM-Cloud/chatbot-watson-android
An Android ChatBot powered by Watson Services - Assistant, Speech-to-Text and Text-to-Speech on IBM Cloud.
Language:Java195 22 0181
NavodPeiris/speechlib
speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with actual speaker names
Language:Python168 3 1515
cvqluu/Factorized-TDNN
PyTorch implementation of the Factorized TDNN (TDNN-F) from "Semi-Orthogonal Low-Rank Matrix Factorization for Deep Neural Networks" and Kaldi
Language:Python144 8 534
cvqluu/simple_diarizer
Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code
Language:Python142 8 1627
yufan-aslp/AliMeeting
The project is associated with the recently-launched ICASSP 2022 Multi-channel Multi-party Meeting Transcription Challenge (M2MeT) to provide participants with baseline systems for speech recognition and speaker diarization in conference scenario.
Language:Python115 3 1117
Appen/UHV-OTS-Speech
A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.
Language:Forth100 7 119
Audio-WestlakeU/FS-EEND
The official Pytorch implementation of "Frame-wise streaming end-to-end speaker diarization with non-autoregressive self-attention-based attractors". [ICASSP 2024] and "LS-EEND: long-form streaming end-to-end neural diarization with online attractor extraction"
Language:Python96 4 144
yuyq96/D-TDNN
PyTorch implementation of Densely Connected Time Delay Neural Network
Language:Python85 5 1324
cvqluu/GE2E-Loss
Pytorch implementation of Generalized End-to-End Loss for speaker verification
Language:Python83 4 116
nezhar/speech-condenser
A tool for summarizing dialogues from videos or audio
Language:Python80 4 110
FlorianKrey/DNC
Discriminative Neural Clustering for Speaker Diarisation
Language:Python78 9 714
VidyasagarMSC/WatBot
An Android ChatBot powered by IBM Watson Services (Assistant V1, Text-to-Speech, and Speech-to-Text with Speaker Recognition) on IBM Cloud.
Language:Java72 10 1753
FrenchKrab/IS2023-powerset-diarization
Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.
Language:Jupyter Notebook71 5 74

speaker-diarization

speechbrain/speechbrain

espnet/espnet

modelscope/FunASR

pyannote/pyannote-audio

MahmoudAshraf97/whisper-diarization

linto-ai/whisper-timestamped

wq2012/awesome-diarization

google/uis-rnn

Purfview/whisper-standalone-win

modelscope/3D-Speaker

juanmc2005/diart

wenet-e2e/wespeaker

transcriptionstream/transcriptionstream

yinruiqing/pyannote-whisper

wq2012/SpectralCluster

taylorlu/Speaker-Diarization

nuaazs/VAF_2

google/speaker-id

hitachi-speech/EEND

revdotcom/reverb

manojpamk/pytorch_xvectors

DongKeon/Awesome-Speaker-Diarization

cvqluu/TDNN

IBM-Cloud/chatbot-watson-android

NavodPeiris/speechlib

cvqluu/Factorized-TDNN

cvqluu/simple_diarizer

yufan-aslp/AliMeeting

Appen/UHV-OTS-Speech

Audio-WestlakeU/FS-EEND

yuyq96/D-TDNN

cvqluu/GE2E-Loss

nezhar/speech-condenser

FlorianKrey/DNC

VidyasagarMSC/WatBot

FrenchKrab/IS2023-powerset-diarization