speech-activity-detection

There are 17 repositories under speech-activity-detection topic.

pyannote/pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
Language:Jupyter Notebook8.3k 78 1k938
jtkim-kaist/VAD
Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.
Language:MATLAB863 45 40235
ina-foss/inaSpeechSegmenter
CNN-based audio segmentation toolkit. Allows to detect speech, music, noise and speaker gender. Has been designed for large scale gender equality studies based on speech time per gender.
Language:Python835 23 76141
RicherMans/GPV
Repository for our Interspeech2020 general-purpose voice activity detection (GPVAD) paper
Language:Python142 4 929
RicherMans/Datadriven-GPVAD
The codebase for Data-driven general-purpose voice activity detection.
Language:Python94 7 1623
bigcash/awesome-vad
A curated list of awesome voice activity detection
62 5 04
HHousen/speaker-change-detection
Speaker change detection using SincNet and an LSTM/Transformer
Language:Jupyter Notebook53 3 76
jsvir/vad
[Tiny VAD] SG-VAD: Stochastic Gates Based Speech Activity Detection
Language:Python35 2 83
vimalmanohar/kaldi
Fork of the official kaldi.
Language:Shell22 6 02
AmirHoseein99/Depression-Engine
Detecting depressed Patient based on Speech Activity, Pauses in Speech and Using Deep learning Approach
Language:Python20 1 05
idiap/zff_vad
Unsupervised Voice Activity Detection by Modeling Source and System Information using Zero Frequency Filtering
Language:Python19 6 01
dangvansam/pyannote-onnx
PyAnnote Voice Activity Detection (ONNX version)
Language:Jupyter Notebook18 0 04
ina-foss/InaGVAD
Voice activity detection and speaker gender segmentation audiovisual corpus
Language:Jupyter Notebook15 3 11
rafaelgreca/voxseg-pytorch
The Voxseg implementation in PyTorch. Voxseg is a python library for voice activity detection (VAD) for speech/non-speech segmentation.
Language:Python12 2 34
aditya-joglekar/FS02_Scoring_Toolkit
Scoring Toolkit for the Fearless Steps Challenge Phase-02 Tasks
Language:Python6 2 23
KF-R/turk-chat
Lightweight speech-to-speech web-based chat app combining speech recognition, LLM completion and text-to-speech. Implemented with Python (Flask) and vanilla JavaScript only.
Language:Python3 1 00
sajR/V-SAD
Language:Python1 1 00

speech-activity-detection

pyannote/pyannote-audio

jtkim-kaist/VAD

ina-foss/inaSpeechSegmenter

RicherMans/GPV

RicherMans/Datadriven-GPVAD

bigcash/awesome-vad

HHousen/speaker-change-detection

jsvir/vad

vimalmanohar/kaldi

AmirHoseein99/Depression-Engine

idiap/zff_vad

dangvansam/pyannote-onnx

ina-foss/InaGVAD

rafaelgreca/voxseg-pytorch

aditya-joglekar/FS02_Scoring_Toolkit

KF-R/turk-chat

sajR/V-SAD