speechrecognition

There are 162 repositories under speechrecognition topic.

speechbrain/speechbrain
A PyTorch-based Speech Toolkit
Language:Python9.2k 134 1.1k1.4k
speechbrain/speechbrain.github.io
The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.
Language:HTML365 42 529
revdotcom/reverb
Open source inference code for Rev's model
Language:Python358 12 1525
robmsmt/KerasDeepSpeech
A Keras CTC implementation of Baidu's DeepSpeech for model experimentation
Language:Python241 18 1778
Azure-Samples/SpeechToText-WebSockets-Javascript
SDK & Sample to do speech recognition using websockets in Javascript
Language:TypeScript217 39 65151
SamirPaulb/real-time-voice-translator
A desktop application that uses AI to translate voice between languages in real time, while preserving the speaker's tone and emotion.
Language:Tcl214 4 456
roshan9419/PersonalAssistantChatbot
It is a personal assistant chatbot, capable to perform many tasks same as Google Assistant plus more extra features...
Language:Python128 3 1048
by2101/OpenASR
A pytorch based end2end speech recognition system.
Language:Python112 2 1224
shangeth/wavencoder
WavEncoder is a Python library for encoding audio signals, transforms for audio augmentation, and training audio classification models with PyTorch backend.
Language:Python89 4 1513
Open-Speech-EkStep/vakyansh-wav2vec2-experimentation
Repository containing experimentation platform on how to train, infer on wav2vec2 models.
Language:Python86 4 3237
goxr3plus/java-google-speech-api
🙊 Speech Recognition , Text To Speech , Google Translate
Language:Java80 12 1134
solyarisoftware/WeBAD
Web Browser Audio Detection/Speech Recording Events API
Language:JavaScript71 3 415
botbahlul/autosrt
A python script COMMAND LINE utility to AUTO GENERATE SUBTITLE FILE (using free Google Speech Recognition API) and TRANSLATED SUBTITLE FILE (using unofficial online Google Translate API) for any video or audio file
Language:Python57 3 611
syntithenai/opensnips
Open source projects related to Snips https://snips.ai/.
Language:JavaScript54 8 321
jindongwang/EasyEspnet
Making Espnet easier to use
Language:Python53 5 33
IS2AI/ISSAI_SAIDA_Kazakh_ASR
the first industrial-scale open-source Kazakh speech corpus. KSC2 corpus subsumes the previously introduced two corpora: KSC and KazakhTTS2 and supplements additional data from other sources. KSC2 contains around 1.2k hours of high-quality transcribed data comprising over 600k utterances.
Language:Shell48 6 76
rollingstarky/Python-Voice-Assistant
A Python based Voice Assistant like Siri
Language:Python43 4 219
AppleHolic/PytorchSR
Pytorch based phoneme recognition (TIMIT phoneme classification)
Language:Python34 3 05
ng-web-apis/speech
A library for using Web Speech API with Angular
Language:TypeScript32 6 16
botbahlul/pyvosklivesubtitle
PySimpleGUI based DESKTOP APP that can RECOGNIZE any live streaming in 23 languages that supported by VOSK then TRANSLATE (using unofficial online Google Translate API) and display it as LIVE CAPTION / LIVE SUBTITLE
Language:Python28 3 63
Kushal997-das/Pyautogui-module-using-audio
📌 This repo is all about how we implemented pyttsx3,speech_recognition,colored all three modules with pyautogui module.
Language:Python28 3 02
srinivr/kaldi-long-audio-alignment
Long audio alignment using Kaldi
Language:Shell24 4 110
G10DRAS/RoboCop
Artificially Intelligent Machine with Computer Vision, Natural Language Processing, AI, Sense and Feelings.
Language:Python23 5 3514
LinkonBSMRSTU/Speech-To-Text-App-iOS
A simple iOS App that can convert speech/voice into text. Only English voice is supported for now. Used Swift 5, AVKit and Speech.
Language:Swift22 2 13
botbahlul/whisper_autosrt
A python script COMMAND LINE utility to AUTO GENERATE SUBTITLE FILE (using faster_whisper module which is a reimplementation of OpenAI Whisper module) and TRANSLATED SUBTITLE FILE (using unofficial online Google Translate API) for any video or audio file
Language:Python21 3 22
untemps/react-vocal
React component and hook to initiate a SpeechRecognition session
Language:JavaScript20 3 173
franchesoni/s2t
:speaking_head: :keyboard: Speech-to-text on key for Linux
Language:Shell18 3 13
robmsmt/SpeechLoop
Many ASRs under one roof. With Benchmarking... answering the question. What is the best ASR for my dataset?
Language:Python18 4 00
Abhishek-op/SR
💡Kivy-android speech recognition
Language:Python16 3 54
azu/transcript-audio
Transcript your audio files like Podcast using SpeechRecognition and Virtual Audio Device.
Language:TypeScript16 3 22
ShawnPi233/EatecPlayerMaster
食课——PyQt5多功能视频播放器（数据管理、笔记、识别字幕、视频关键词生成）
Language:Python16 1 13
scottykwok/cantonese-selfish-project
Cantonese Selfish Project 廣東話自肥企劃 at PYCON HK 2021
Language:Jupyter Notebook15 1 11
botbahlul/android-autosrt
ANDROID APP to AUTO GENERATE SUBTITLE FILE and TRANSLATED SUBTITLE FILE (using unofficial online Google Translate API) for any audio/video files
Language:Java14 2 21
botbahlul/android-autosrt-v2
ANDROID APP to AUTO GENERATE SUBTITLE FILE and TRANSLATED SUBTITLE FILE (using unofficial online Google Translate API) for any audio/video files using 2 ACTIVITIES
Language:Java14 2 12
Manasvi070902/Meetify
Microsoft Engage program 2021
Language:JavaScript14 2 16
IlyaZaprutski/bluetooth-lamp
Demo project for bluetooth lamp
Language:JavaScript13 2 10

speechrecognition

speechbrain/speechbrain

speechbrain/speechbrain.github.io

revdotcom/reverb

robmsmt/KerasDeepSpeech

Azure-Samples/SpeechToText-WebSockets-Javascript

SamirPaulb/real-time-voice-translator

roshan9419/PersonalAssistantChatbot

by2101/OpenASR

shangeth/wavencoder

Open-Speech-EkStep/vakyansh-wav2vec2-experimentation

goxr3plus/java-google-speech-api

solyarisoftware/WeBAD

botbahlul/autosrt

syntithenai/opensnips

jindongwang/EasyEspnet

IS2AI/ISSAI_SAIDA_Kazakh_ASR

rollingstarky/Python-Voice-Assistant

AppleHolic/PytorchSR

ng-web-apis/speech

botbahlul/pyvosklivesubtitle

Kushal997-das/Pyautogui-module-using-audio

srinivr/kaldi-long-audio-alignment

G10DRAS/RoboCop

LinkonBSMRSTU/Speech-To-Text-App-iOS

botbahlul/whisper_autosrt

untemps/react-vocal

franchesoni/s2t

robmsmt/SpeechLoop

Abhishek-op/SR

azu/transcript-audio

ShawnPi233/EatecPlayerMaster

scottykwok/cantonese-selfish-project

botbahlul/android-autosrt

botbahlul/android-autosrt-v2

Manasvi070902/Meetify

IlyaZaprutski/bluetooth-lamp