speaker-diarization
There are 103 repositories under speaker-diarization topic.
speechbrain/speechbrain
A PyTorch-based Speech Toolkit
espnet/espnet
End-to-End Speech Processing Toolkit
pyannote/pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
modelscope/FunASR
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
MahmoudAshraf97/whisper-diarization
Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
linto-ai/whisper-timestamped
Multilingual Automatic Speech Recognition with word-level timestamps and confidence
google/uis-rnn
This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Supervised Speaker Diarization.
wq2012/awesome-diarization
A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.
juanmc2005/diart
A python package to build AI-powered real-time audio applications
modelscope/3D-Speaker
A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization
transcriptionstream/transcriptionstream
turnkey self-hosted offline transcription and diarization service with llm summary
wenet-e2e/wespeaker
Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit
wq2012/SpectralCluster
Python re-implementation of the (constrained) spectral clustering algorithms used in Google's speaker diarization papers.
taylorlu/Speaker-Diarization
speaker diarization by uis-rnn and speaker embedding by vgg-speaker-recognition
nuaazs/VAF_2
Aims to create a comprehensive voice toolkit for training, testing, and deploying speaker verification systems.
hitachi-speech/EEND
End-to-End Neural Diarization
google/speaker-id
This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team at Google.
manojpamk/pytorch_xvectors
Deep speaker embeddings in PyTorch, including x-vectors. Code used in this work: https://arxiv.org/abs/2007.16196
cvqluu/TDNN
Time delay neural network (TDNN) implementation in Pytorch using unfold method
IBM-Cloud/chatbot-watson-android
An Android ChatBot powered by Watson Services - Assistant, Speech-to-Text and Text-to-Speech on IBM Cloud.
cvqluu/Factorized-TDNN
PyTorch implementation of the Factorized TDNN (TDNN-F) from "Semi-Orthogonal Low-Rank Matrix Factorization for Deep Neural Networks" and Kaldi
DongKeon/Awesome-Speaker-Diarization
Some comprehensive papers about speaker diarization
cvqluu/simple_diarizer
Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code
yufan-aslp/AliMeeting
The project is associated with the recently-launched ICASSP 2022 Multi-channel Multi-party Meeting Transcription Challenge (M2MeT) to provide participants with baseline systems for speech recognition and speaker diarization in conference scenario.
NavodPeiris/speechlib
speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with actual speaker names
Appen/UHV-OTS-Speech
A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.
yuyq96/D-TDNN
PyTorch implementation of Densely Connected Time Delay Neural Network
cvqluu/GE2E-Loss
Pytorch implementation of Generalized End-to-End Loss for speaker verification
FlorianKrey/DNC
Discriminative Neural Clustering for Speaker Diarisation
nezhar/speech-condenser
A tool for summarizing dialogues from videos or audio
VidyasagarMSC/WatBot
An Android ChatBot powered by IBM Watson Services (Assistant V1, Text-to-Speech, and Speech-to-Text with Speaker Recognition) on IBM Cloud.
Audio-WestlakeU/FS-EEND
The official Pytorch implementation of "Frame-wise streaming end-to-end speaker diarization with non-autoregressive self-attention-based attractors". [ICASSP 2024]
vishalshar/SpeakerDiarization_RNN_CNN_LSTM
Speaker Diarization is the problem of separating speakers in an audio. There could be any number of speakers and final result should state when speaker starts and ends. In this project, we analyze given audio file with 2 channels and 2 speakers (on separate channels).
wq2012/SimpleDER
A lightweight library to compute Diarization Error Rate (DER).
FrenchKrab/IS2023-powerset-diarization
Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.