asr-model
There are 99 repositories under asr-model topic.
ASR
End-to-End Vietnamese Speech Recognition using wav2vec 2.0
TurkicASR
A multilingual ASR model that can recognize ten Turkic languages—Azerbaijani, Bashkir, Chuvash, Kazakh, Kyrgyz, Sakha, Tatar, Turkish, Uyghur, and Uzbek.
rnnt
An implementation of RNN-Transducer loss in TF-2.0.
wav2vec2-fa
fine-tune Wav2vec2. an ASR model released by Facebook
GPT3-text-summarization
Summarization, topic generation using GPT3
quartznet-pytorch
Quartznet implementation on pytorch [https://arxiv.org/abs/1910.10261]
Bulgarian_Linguistic
Collection and resources for Bulgarian Corpus, Datasets and Models used in ASR, TTS or NLP tasks together with the links of corresponding tools/apps.
indonesian-asr
Automatic speech recognition (ASR) for Indonesian language built by using HTK and Julius. Web interface is built using Node.js.
SpeechLoop
Many ASRs under one roof. With Benchmarking... answering the question. What is the best ASR for my dataset?
WOLOF-ASR-Wav2Vec2
Audio Preprocessing and finetuning of wav2vec2-large-xlsr model on AI4D Baamtu Datamation - Automatic Speech Recognition in WOLOF Data.
deepspeech-catala
Deepspeech ASR Model for the Catalan Language
QuartzNet-ASR-pytorch
Automatic Speech Recognition (ASR) model QuartzNet trained on English CommonVoice. In PyTroch with CTC loss and beam search.
End-to-end-ASR-Transformer
An end to end ASR Transformer model training repo
BarkingGPT
Audio to Audio (Whisper+ChatGPT+Bark)
catalan-speech-recognition-benchmark
A benchmark of speech recognition solutions for the Catalan language
LAS-Pytorch
Implementation of the paper "Listen, Attend and Spell" Paper in Pytorch
asr-webapp
ASR Web APP 中文语音识别实验室APP,使用Django构建,包含中文语音转文字与中文语音聊天机器人模块
kaldi-adapt-lm
Create and adapt n-gram and JSGF language models, e.g. for Kaldi-ASR nnet3 chain models from Zamia-Speech
Automatic-Speech-Recognition-with-PyTorch
Real-Time ASR with CNN-BiLSTM: End-to-End Live Streaming Using Lightning AI ⚡ with Training Scripts
quartznet
QuartzNet implementation for Automatic Speech Recognition task
359-Hours-Indonesian-Speech-Data-by-Mobile-Phone_Reading
Indonesian Speech Dataset
800-Hours-Sichuan-Dialect-Conversational-Speech-Data-by-Mobile-Phone
The dataset of Sichuan dialect conversational speech
Conversational_Speech_Dataset
Mega Conversational Speech Datasets for Speech Recognition
Automatic-Speech-Recognizer
Build end-to-end Deep Neural Network to translate speech to text (ASR model)
whisper-large-v2-atcosim_corpus
A fine-tuned Whisper model (whisper-large-v2) for aviation audio transcription. WER < 5%.
whisper-asr-cli
Automatic Speech Recognition ASR / Speech To Text STT demonstration using Whisper/base model. The cli python application transcribe an audio to text, works offline.
Whisper-ASR-Transcription-Project
Whisper ASR Transcription Project
Nda-Nda-Force-Aligner
Forced alignment of Nda‘ Nda’ a Cameroonian language
SimpleASRmodel
A simple CRDNN based ASR model for my own understanding of how ASR works and are trained. (Work in progress) If anyone finds any error or have any suggestion please do let me know.
ChitongaASR
A natural language processing and machine learning project for a low resource langauge in Zambia.
Interspeech2020-Accented-English-Speech-Recognition-Competition-Data
Interspeech2020 Accented English Speech Recognition Competition Data
300-Hours-Mixed-Speech-with-Korean-and-English-Data-by-Mobile-Phone
Mixed Speech with Korean and English Dataset
200-People-Chinese-Wake-up-Words-Speech-Data-by-Mobile-Phone
Chinese Wake-up Words Speech Dataset
261-Hours-Japanese-Speech-Data-by-Mobile-Phone
Japanese Speech Dataset
240-Hours-Hindi-Speech-Data-by-Mobile-Phone_Reading
Hindi Speech Dataset
176-Hours-Suzhou-Dialect-Speech-Data-by-Mobile-Phone
Suzhou Dialect Speech Dataset