diarization

There are 110 repositories under diarization topic.

Purfview/whisper-standalone-win
Whisper & Faster-Whisper standalone executables for those who don't want to bother with Python.
2.6k 51 274140
R3gm/SoniTranslate
Synchronized Translation for Videos. Video dubbing
Language:Python1.3k 22 159292
transcriptionstream/transcriptionstream
turnkey self-hosted offline transcription and diarization service with llm summary
Language:Python899 9 1756
microsoft/UniSpeech
UniSpeech - Large Scale Self-Supervised Learning for Speech
Language:Python472 17 4774
revdotcom/reverb
Open source inference code for Rev's model
Language:Python433 12 1727
gong-io/gecko
Gecko - A Tool for Effective Annotation of Human Conversations
Language:JavaScript299 15 3144
thewh1teagle/sherpa-rs
Rust bindings to https://github.com/k2-fsa/sherpa-onnx
Language:Rust246 6 5843
SuyashMore/MevonAI-Speech-Emotion-Recognition
Identify the emotion of multiple speakers in an Audio Segment
Language:C178 8 1347
narcotic-sh/senko
Very fast, accurate speaker diarization
Language:Python164 4 312
cvqluu/simple_diarizer
Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code
Language:Python149 7 1632
taresh18/TTSizer
🎙️ Automatically transcribe audio/video into high-quality, speaker-specific Text-To-Speech datasets ✨
Language:Python12516
desh2608/dover-lap
Python package for combining diarization system outputs.
Language:Python90 5 812
thewh1teagle/pyannote-rs
pyannote audio diarization in rust
Language:Rust81 5 1411
bunyaminergen/Callytics
Callytics is an advanced call analytics solution that leverages speech recognition and large language models (LLMs) technologies to analyze phone conversations from customer service and call centers.
Language:Python71 5 39
wq2012/SimpleDER
A lightweight library to compute Diarization Error Rate (DER).
Language:Python62 3 29
JSchmie/ScrAIbe
Tool for automatic transcription and speaker diarization based on whisper and pyannote.
Language:Python61 2 1418
Picovoice/falcon
On-device speaker diarization powered by deep learning
Language:Python57 9 27
cvqluu/nn-similarity-diarization
Neural network based similarity scoring for diarization (pytorch implementation of "LSTM based Similarity Measurement with Spectral Clustering for Speaker Diarization")
Language:Python44 2 612
jeanjerome/EchoInStone
EchoInStone is an audio processing tool that transcribes, diarizes, and aligns speaker segments from audio files, prioritizing accuracy and reliability.
Language:Python40 2 86
desh2608/spyder
Simple Python package for fast DER computation
Language:C++35 2 47
empenoso/offline-audio-transcriber
Локальное и бесплатное распознавание речи с помощью OpenAI Whisper. Автоматизируйте расшифровку лекций и совещаний на вашем ПК без облачных сервисов и подписок
Language:Python317
exemplaryai/ai-engine
Easy to use Multi-Provider ASR/Speech To Text and NLP engine
27 3 01
jakariaemon/WSI
Whisper Speaker Identification (WSI), a cutting-edge model for multilingual speaker identification.
Language:Python24 3 31
chimechallenge/chime-utils
Scripts for data generation, scoring and data manifest preparation for CHiME-8 DASR task.
Language:Python23 9 04
harmlessman/PAFTS
PAFTS : Library That Preprocessing Audio For TTS.
Language:Python23 1 25
pulijon/Sttcast
Transcription from mp3 files to html with or without embedded player
Language:Jupyter Notebook20 2 25
shahruk10/kaldi-tflite
Convert kaldi feature extraction and nnet3 models into Tensorflow Lite models. Currently aimed at converting kaldi's x-vector models and diarization pipelines to tensorflow models.
Language:Python20 3 13
KaddaOK/TASMAS
Free open-source transcriber and summarizer for file-per-speaker recordings, such as Discord calls recorded by the Craig bot
Language:Python19 2 40
cadia-lvl/kaldi-speaker-diarization
This repository creates speaker diarization recipes to be used within the egs folder of kaldi.
Language:Shell17 4 23
mmaudet/speaker-splitter
A Python tool to separate audio files by speaker using diarization data.
Language:Python16 1 23
thewh1teagle/loud.cpp
Whisper.cpp with diarization
Language:C++16 2 74
ElmiraGhorbani/gpt-speaker-diarization
Conversational Speaker Diarization using OpenAI AI Language Models(gpt-4) and OpenAI Whisper.
Language:Jupyter Notebook14 1 01
orianemartin/WhispGrid
A Whisper to TextGrid script that I use to automatize Corpus Annotation on Praat, with speaker diarization.
Language:Python12 1 03
SEERNET/Multi-Speaker-Diarization
Automated Multi Speaker diarization API for meetings, calls, interviews, press-conference etc.
11 1 10
CrispStrobe/Susurrus
speech to text gui for different (mostly Whisper, also Voxtral) models and backends, including whisper.cpp, mlx-whisper, faster-whisper, ctranslate2; applies pyannote for diarization
Language:Python10 1 00
LianaMikael/SpeechDatasets
Large publicly available speech datasets
10 1 00

diarization

Purfview/whisper-standalone-win

R3gm/SoniTranslate

transcriptionstream/transcriptionstream

microsoft/UniSpeech

revdotcom/reverb

gong-io/gecko

thewh1teagle/sherpa-rs

SuyashMore/MevonAI-Speech-Emotion-Recognition

narcotic-sh/senko

cvqluu/simple_diarizer

taresh18/TTSizer

desh2608/dover-lap

thewh1teagle/pyannote-rs

bunyaminergen/Callytics

wq2012/SimpleDER

JSchmie/ScrAIbe

Picovoice/falcon

cvqluu/nn-similarity-diarization

jeanjerome/EchoInStone

desh2608/spyder

empenoso/offline-audio-transcriber

exemplaryai/ai-engine

jakariaemon/WSI

chimechallenge/chime-utils

harmlessman/PAFTS

pulijon/Sttcast

shahruk10/kaldi-tflite

KaddaOK/TASMAS

cadia-lvl/kaldi-speaker-diarization

mmaudet/speaker-splitter

thewh1teagle/loud.cpp

ElmiraGhorbani/gpt-speaker-diarization

orianemartin/WhispGrid

SEERNET/Multi-Speaker-Diarization

CrispStrobe/Susurrus

LianaMikael/SpeechDatasets