automatic-speech-recognition
There are 328 repositories under automatic-speech-recognition topic.
wenet-e2e/wenet
Production First and Production Ready End-to-End Speech Recognition Toolkit
zzw922cn/awesome-speech-recognition-speech-synthesis-papers
Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)
zzw922cn/Automatic_Speech_Recognition
End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow
ahmetoner/whisper-asr-webservice
OpenAI Whisper ASR Webservice API
coqui-ai/STT
🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.
kakaobrain/pororo
PORORO: Platform Of neuRal mOdels for natuRal language prOcessing
TensorSpeech/TensorFlowASR
:zap: TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subwords
FireRedTeam/FireRedASR
Open-source industrial-grade ASR models supporting Mandarin, Chinese dialects and English, achieving a new SOTA on public Mandarin ASR benchmarks, while also offering outstanding singing lyrics recognition capability.
snakers4/open_stt
Open STT
jitsi/jiwer
Evaluate your speech-to-text system with similarity measures such as word error rate (WER)
shirayu/whispering
Streaming transcriber with whisper
EmulationAI/awesome-large-audio-models
Collection of resources on the applications of Large Language Models (LLMs) in Audio AI.
Picovoice/cheetah
On-device streaming speech-to-text engine powered by deep learning
hirofumi0810/neural_sp
End-to-end ASR/LM implementation with PyTorch
YoavRamon/awesome-kaldi
This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )
Z-yq/TensorflowASR
一个执着于让CPU\端侧-Model逼近GPU-Model性能的项目,CPU上的实时率(RTF)小于0.1
Picovoice/leopard
On-device speech-to-text engine powered by deep learning
jonatasgrosman/huggingsound
HuggingSound: A toolkit for speech-related tasks based on Hugging Face's tools
double22a/speech_dataset
The dataset of Speech Recognition
ArthurFDLR/whisper-youtube
🔉 Youtube Videos Transcription with OpenAI's Whisper
vilassn/whisper_android
Offline Speech Recognition with OpenAI Whisper and TensorFlow Lite for Android
hirofumi0810/tensorflow_end2end_speech_recognition
End-to-End speech recognition implementation base on TensorFlow (CTC, Attention, and MTL training)
m3hrdadfi/soxan
Wav2Vec for speech recognition, classification, and audio classification
rolczynski/Automatic-Speech-Recognition
🎧 Automatic Speech Recognition: DeepSpeech & Seq2Seq (TensorFlow)
smeetrs/deep_avsr
A PyTorch implementation of the Deep Audio-Visual Speech Recognition paper.
NavodPeiris/speechlib
speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with actual speaker names
bricewalker/Hey-Jetson
Deep Learning based Automatic Speech Recognition with attention for the Nvidia Jetson.
sovaai/sova-asr
SOVA ASR (Automatic Speech Recognition)
anton-jeran/FAST-RIR
This is the official implementation of our neural-network-based fast diffuse room impulse response generator (FAST-RIR) for generating room impulse responses (RIRs) for a given acoustic environment.
CoEDL/elpis
🙊 software for creating speech recognition models.
noco-ai/spellbook-docker
AI stack for interacting with LLMs, Stable Diffusion, Whisper, xTTS and many other AI models
tugstugi/mongolian-speech-recognition
Mongolian speech recognition with PyTorch
at16k/at16k
Trained models for automatic speech recognition (ASR). A library to quickly build applications that require speech to text conversion.
dangvansam/viet-asr
VietASR - Vietnamese Automatic Speech Recognition
ieasybooks/tafrigh
تفريغ النصوص وإنشاء ملفات SRT و VTT باستخدام نماذج Whisper وتقنية wit.ai.
biodatlab/thonburian-whisper
Thonburian Whisper: Open models for fine-tuned Whisper in Thai. Try our demo on Huggingface space: