speech-to-text
There are 2737 repositories under speech-to-text topic.
ggerganov/whisper.cpp
Port of OpenAI's Whisper model in C/C++
mozilla/DeepSpeech
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
leon-ai/leon
🧠 Leon is your open-source personal assistant.
kaldi-asr/kaldi
kaldi-asr/kaldi is the official location of the Kaldi project.
m-bain/whisperX
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
SYSTRAN/faster-whisper
Faster Whisper transcription with CTranslate2
Uberi/speech_recognition
Speech recognition module for Python, supporting several engines and APIs, online and offline.
speechbrain/speechbrain
A PyTorch-based Speech Toolkit
nl8590687/ASRT_SpeechRecognition
A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统
alphacep/vosk-api
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
TalAter/annyang
:speech_balloon: Speech recognition for your site
jianchang512/pyvideotrans
Translate the video from one language to another and add dubbing. 将视频从一种语言翻译为另一种语言,并添加配音
snakers4/silero-models
Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple
sanchit-gandhi/whisper-jax
JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.
tensorflow/lingvo
Lingvo
toverainc/willow
Open source, local, and self-hosted Amazon Echo/Google Home competitive Voice Assistant alternative
pannous/tensorflow-speech-recognition
🎙Speech recognition using the tensorflow deep learning framework, sequence-to-sequence neural networks
coqui-ai/STT
🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.
MahmoudAshraf97/whisper-diarization
Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
mesolitica/NLP-Models-Tensorflow
Gathers machine learning and Tensorflow deep learning models for NLP problems, 1.13 < Tensorflow < 2.0
kalliope-project/kalliope
Kalliope is a framework that will help you to create your own personal assistant.
jarikomppa/soloud
Free, easy, portable audio engine for games
ahmetoner/whisper-asr-webservice
OpenAI Whisper ASR Webservice API
NVIDIA/OpenSeq2Seq
Toolkit for efficient experimentation with Speech Recognition, Text2Speech and NLP
linto-ai/whisper-timestamped
Multilingual Automatic Speech Recognition with word-level timestamps and confidence
DragonComputer/Dragonfire
the open-source virtual assistant for Ubuntu based Linux distributions
sdkcarlos/artyom.js
A voice control - voice commands - speech recognition and speech synthesis javascript library. Create your own siri,google now or cortana with Google Chrome within your website.
coqui-ai/open-speech-corpora
💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
Kyubyong/dc_tts
A TensorFlow Implementation of DC-TTS: yet another text-to-speech model
jianchang512/stt
Voice Recognition to Text Tool / 一个离线运行的本地语音识别转文字服务,输出json、srt字幕带时间戳、纯文字格式
codeforequity-at/botium-speech-processing
Botium Speech Processing
sindresorhus/awesome-whisper
🔊 Awesome list for Whisper — an open-source AI-powered speech recognition system developed by OpenAI
modal-labs/quillman
A chat app that transcribes audio in real-time, streams back a response from a language model, and synthesizes this response as natural-sounding speech.
TensorSpeech/TensorFlowASR
:zap: TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subwords
mikeyy/nonoCAPTCHA
An asynchronized Python library to automate solving ReCAPTCHA v2 using audio
backmeupplz/voicy
@voicybot Telegram bot main repository