speech-activity-detection

There are 17 repositories under speech-activity-detection topic.

  • pyannote/pyannote-audio

    Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

    Language:Jupyter Notebook8.3k781k938
  • jtkim-kaist/VAD

    Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.

    Language:MATLAB8634540235
  • ina-foss/inaSpeechSegmenter

    CNN-based audio segmentation toolkit. Allows to detect speech, music, noise and speaker gender. Has been designed for large scale gender equality studies based on speech time per gender.

    Language:Python8352376141
  • RicherMans/GPV

    Repository for our Interspeech2020 general-purpose voice activity detection (GPVAD) paper

    Language:Python1424929
  • RicherMans/Datadriven-GPVAD

    The codebase for Data-driven general-purpose voice activity detection.

    Language:Python9471623
  • bigcash/awesome-vad

    A curated list of awesome voice activity detection

  • HHousen/speaker-change-detection

    Speaker change detection using SincNet and an LSTM/Transformer

    Language:Jupyter Notebook53376
  • jsvir/vad

    [Tiny VAD] SG-VAD: Stochastic Gates Based Speech Activity Detection

    Language:Python35283
  • vimalmanohar/kaldi

    Fork of the official kaldi.

    Language:Shell22602
  • AmirHoseein99/Depression-Engine

    Detecting depressed Patient based on Speech Activity, Pauses in Speech and Using Deep learning Approach

    Language:Python20105
  • idiap/zff_vad

    Unsupervised Voice Activity Detection by Modeling Source and System Information using Zero Frequency Filtering

    Language:Python19601
  • dangvansam/pyannote-onnx

    PyAnnote Voice Activity Detection (ONNX version)

    Language:Jupyter Notebook18004
  • ina-foss/InaGVAD

    Voice activity detection and speaker gender segmentation audiovisual corpus

    Language:Jupyter Notebook15311
  • rafaelgreca/voxseg-pytorch

    The Voxseg implementation in PyTorch. Voxseg is a python library for voice activity detection (VAD) for speech/non-speech segmentation.

    Language:Python12234
  • aditya-joglekar/FS02_Scoring_Toolkit

    Scoring Toolkit for the Fearless Steps Challenge Phase-02 Tasks

    Language:Python6223
  • KF-R/turk-chat

    Lightweight speech-to-speech web-based chat app combining speech recognition, LLM completion and text-to-speech. Implemented with Python (Flask) and vanilla JavaScript only.

    Language:Python3100
  • sajR/V-SAD

    Language:Python1100