vad
There are 98 repositories under vad topic.
modelscope/FunASR
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
smacke/ffsubsync
Automagically synchronize subtitles with video.
snakers4/silero-vad
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
CheshireCC/faster-whisper-GUI
faster_whisper GUI with PySide6
k2-fsa/sherpa-ncnn
Real-time speech recognition and voice activity detection (VAD) using next-gen Kaldi with ncnn without Internet connection. Support iOS, Android, Linux, macOS, Windows, Raspberry Pi, VisionFive2, LicheePi4A etc.
jtkim-kaist/VAD
Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.
amsehili/auditok
An audio/acoustic activity detection and audio segmentation tool
DmitryRyumin/ICASSP-2023-24-Papers
ICASSP 2023-2024 Papers: A complete collection of influential and exciting research papers from the ICASSP 2023-24 conferences. Explore the latest advancements in acoustics, speech and signal processing. Code included. Star the repository to support the advancement of audio and signal processing!
filippogiruzzi/voice_activity_detection
Voice Activity Detection based on Deep Learning & TensorFlow
gtreshchev/RuntimeAudioImporter
Runtime Audio Importer plugin for Unreal Engine. Importing audio of various formats at runtime.
Baidu-AIP/speech-vad-demo
集成Webrtc的VAD,用于切分音频文件
shashikg/WhisperS2T
An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Engine
EtienneAb3d/WhisperHallu
Experimental code: sound file preprocessing to optimize Whisper transcriptions without hallucinated texts
gkonovalov/android-vad
Android Voice Activity Detection (VAD) library. Supports WebRTC VAD GMM, Silero VAD DNN, Yamnet VAD DNN models.
eesungkim/Voice_Activity_Detector
A statistical model-based Voice Activity Detection
Picovoice/cobra
On-device voice activity detection (VAD) powered by deep learning
xiongyihui/python-webrtc-audio-processing
Python bindings of WebRTC Audio Processing
voithru/voice-activity-detection
Pytorch implementation of SELF-ATTENTIVE VAD, ICASSP 2021
0vercl0k/sic
Enumerate user mode shared memory mappings on Windows.
fjchange/object_centric_VAD
An Tensorflow Re-Implement of CVPR 2019 "Object-centric Auto-Encoders and Dummy Anomalies for Abnormal Event Detection in Video"
xia-chu/webrtc_apm
webrtc中apm相关代码的提取,包括AEC/NS/AGC/VAD ,另外还包括mp3/aac编码器、SoundTouch
NickWilkinson37/voxseg
A python library for voice activity detection (VAD) for speech/non-speech segmentation.
spokestack/spokestack-android
Extensible Android mobile voice framework: wakeword, ASR, NLU, and TTS. Easily add voice to any Android app!
EtienneAb3d/karaok-AI
Karaoke Player / Editor with automatic clip creation from any song file using vocals and lyrics extraction (Speech-to-Text)
lef-fan/aria
A local and uncensored AI entity.
mgonzs13/whisper_ros
Speech-to-Text based on SileroVAD + whisper.cpp (GGML Whisper) for ROS 2
mochi-neko/voice-activity-detection-unity
A voice activity detection (VAD) library for Unity.
mounalab/LSTM-RNN-VAD
Voice Activity Detection LSTM-RNN learning model
HadreamOrg/HadreamAssistant
HadreamAssistant, 你的智能家居/自定义语音助手, 支持树莓派/Linux
spokestack/spokestack-ios
Spokestack: give your iOS app a voice interface!
sooftware/End-to-End-Speech-Recognition-Models
PyTorch implementation of automatic speech recognition models.
shanghaimoon888/mod_vadasr
This is FreeSwitch module that can do VAD and ASR with IFLYTEK websocket api.
ideastudios/Vad
android vad library base on webrtc vad
OzymandiasTheGreat/libfvad-wasm
Voice activity detection (VAD) library, based on WebRTC's VAD engine built to WASM with Emscripten to run in browsers, Node, and NativeScript
baabaaox/go-webrtcvad
WebRTC Voice Activity Detection for Golang
vpegasus/xuesebot
一个关于血色衣冠的对话机器人, 基于 Rasa, 可语音与机器人对话