speech-commands
There are 21 repositories under speech-commands topic.
YuanGongND/ast
Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".
Audio-WestlakeU/audiossl
A library built for easier audio self-supervised training, downstream tasks evaluation
dobby-seo/Wav2Keyword
Wav2Keyword is keyword spotting(KWS) based on Wav2Vec 2.0. This model shows state-of-the-art in Speech commands dataset V1 and V2.
nyumaya/nyumaya_audio_recognition
Classify audio with neural nets on embedded systems like the Raspberry Pi
philsyn/DiffWave-unconditional
Pytorch Reimplementation of DiffWave unconditional generation: a high quality waveform synthesizer.
ace19-dev/tensorflow-speech-recognition-challenge
Kaggle Competitions: TensorFlow Speech Recognition Challenge
htqin/BiFSMN
Pytorch implementation of BiFSMN, IJCAI 2022
isadrtdinov/kws-attention
Attention-based model for keywords spotting
shitian-ni/speech-recognition-transfer-learning
Speech command recognition DenseNet transfer learning from UrbanSound8k in keras tensorflow
usc-sail/gen-dmcca
Generalized Deep Multiset Canonical Correlation Analysis for Multiview Learning of Speech Representations
danieleninni/small-footprint-keyword-spotting
Effective processing pipeline and advanced neural network architectures for small-footprint keyword spotting
manojsvgit/Voice_Based_Email_For_Blind
A Python-based application designed specifically for visually impaired users, enabling them to seamlessly send and receive emails using intuitive speech commands. This innovative solution enhances accessibility and independence by allowing users to manage their email communication effortlessly, utilizing voice recognition technology to ensure a us.
mryndzionek/kws_cli
Small footprint, standalone, zero dependency, offline keyword spotting (KWS) CLI tool.
tuanio/audio-classification
Audio Classification with AlexNet and Speech Commands dataset
epfluegel/TalkMaths
A Vocola 2 (DNS) extension for creating and editing mathematics (in LaTeX) by voice, using a ZOO interface (Zoomable Online Outliner) such as WorkFlowy or Dynalist.
aminul-huq/Speech_Command_Recognition
Multi-class classification of speech command data. Dataset collected from kaggle speech recognition challenge and used pyTorch for implementation.
Akash100997/Keyword_Spotting
This project is about spotting a keyword from the Google Speech Commands Dataset.
Bill2015/Speech-Chinese-Model-Agent
A Model-based Agent, for chinese speech recognize.
reddiedev/197z-kws
zero-shot keyword spotting with KWS test dataset using ImageBind
SebastianThomas1/keyword_spotter
Speech recognition of keyword commands
hoang1007/FRIDAY
Female Replacement Intelligent Digital Assistant Youth