asr-model
There are 94 repositories under asr-model topic.
revdotcom/reverb
Open source inference code for Rev's model
sovaai/sova-asr
SOVA ASR (Automatic Speech Recognition)
at16k/at16k
Trained models for automatic speech recognition (ASR). A library to quickly build applications that require speech to text conversion.
vietai/ASR
End-to-End Vietnamese Speech Recognition using wav2vec 2.0
IS2AI/TurkicASR
A multilingual ASR model that can recognize ten Turkic languages—Azerbaijani, Bashkir, Chuvash, Kazakh, Kyrgyz, Sakha, Tatar, Turkish, Uyghur, and Uzbek.
iamjanvijay/rnnt
An implementation of RNN-Transducer loss in TF-2.0.
Hamtech-ai/wav2vec2-fa
fine-tune Wav2vec2. an ASR model released by Facebook
juan-csv/GPT3-text-summarization
Summarization, topic generation using GPT3
oleges1/quartznet-pytorch
Quartznet implementation on pytorch [https://arxiv.org/abs/1910.10261]
antouanbg/Bulgarian_Linguistic
Collection and resources for Bulgarian Corpus, Datasets and Models used in ASR, TTS or NLP tasks together with the links of corresponding tools/apps.
tifaniwarnita/indonesian-asr
Automatic speech recognition (ASR) for Indonesian language built by using HTK and Julius. Web interface is built using Node.js.
robmsmt/SpeechLoop
Many ASRs under one roof. With Benchmarking... answering the question. What is the best ASR for my dataset?
ccoreilly/deepspeech-catala
Deepspeech ASR Model for the Catalan Language
kingabzpro/WOLOF-ASR-Wav2Vec2
Audio Preprocessing and finetuning of wav2vec2-large-xlsr model on AI4D Baamtu Datamation - Automatic Speech Recognition in WOLOF Data.
Kirili4ik/QuartzNet-ASR-pytorch
Automatic Speech Recognition (ASR) model QuartzNet trained on English CommonVoice. In PyTroch with CTC loss and beam search.
MegEngine/End-to-end-ASR-Transformer
An end to end ASR Transformer model training repo
BudEcosystem/BarkingGPT
Audio to Audio (Whisper+ChatGPT+Bark)
ccoreilly/catalan-speech-recognition-benchmark
A benchmark of speech recognition solutions for the Catalan language
KrishnaDN/LAS-Pytorch
Implementation of the paper "Listen, Attend and Spell" Paper in Pytorch
fquirin/kaldi-adapt-lm
Create and adapt n-gram and JSGF language models, e.g. for Kaldi-ASR nnet3 chain models from Zamia-Speech
SzLeaves/asr-webapp
ASR Web APP 中文语音识别实验室APP,使用Django构建,包含中文语音转文字与中文语音聊天机器人模块
isadrtdinov/quartznet
QuartzNet implementation for Automatic Speech Recognition task
LuluW8071/Automatic-Speech-Recognition-with-PyTorch
Real-Time ASR with CNN-BiLSTM: End-to-End Live Streaming in Lightning AI :zap: with Training Scripts
LaurentVeyssier/Automatic-Speech-Recognizer
Build end-to-end Deep Neural Network to translate speech to text (ASR model)
Nexdata-AI/359-Hours-Indonesian-Speech-Data-by-Mobile-Phone_Reading
Indonesian Speech Dataset
Nexdata-AI/800-Hours-Sichuan-Dialect-Conversational-Speech-Data-by-Mobile-Phone
The dataset of Sichuan dialect conversational speech
Nexdata-AI/Conversational_Speech_Dataset
Mega Conversational Speech Datasets for Speech Recognition
abdeLKabir-56/Whisper-ASR-Transcription-Project
Whisper ASR Transcription Project
alwaz-shahid/whisper-asr-cli
Automatic Speech Recognition ASR / Speech To Text STT demonstration using Whisper/base model. The cli python application transcribe an audio to text, works offline.
daisyyedda/whisper-large-v2-atcosim_corpus
A fine-tuned Whisper model (whisper-large-v2) for aviation audio transcription. WER < 5%.
hammaad2002/SimpleASRmodel
A simple CRDNN based ASR model for my own understanding of how ASR works and are trained. (Work in progress) If anyone finds any error or have any suggestion please do let me know.
kalindasiaminwe/ChitongaASR
A natural language processing and machine learning project for a low resource langauge in Zambia.
Nexdata-AI/200-People-Chinese-Wake-up-Words-Speech-Data-by-Mobile-Phone
Chinese Wake-up Words Speech Dataset
Nexdata-AI/240-Hours-Hindi-Speech-Data-by-Mobile-Phone_Reading
Hindi Speech Dataset
Nexdata-AI/300-Hours-Mixed-Speech-with-Korean-and-English-Data-by-Mobile-Phone
Mixed Speech with Korean and English Dataset
Nexdata-AI/Interspeech2020-Accented-English-Speech-Recognition-Competition-Data
Interspeech2020 Accented English Speech Recognition Competition Data