asr-model

There are 99 repositories under asr-model topic.

ASR
End-to-End Vietnamese Speech Recognition using wav2vec 2.0
93
TurkicASR
A multilingual ASR model that can recognize ten Turkic languages—Azerbaijani, Bashkir, Chuvash, Kazakh, Kyrgyz, Sakha, Tatar, Turkish, Uyghur, and Uzbek.
Language:Python59
rnnt
An implementation of RNN-Transducer loss in TF-2.0.
Language:Python45
wav2vec2-fa
fine-tune Wav2vec2. an ASR model released by Facebook
Language:Jupyter Notebook37
GPT3-text-summarization
Summarization, topic generation using GPT3
Language:Jupyter Notebook32
quartznet-pytorch
Quartznet implementation on pytorch [https://arxiv.org/abs/1910.10261]
Language:Jupyter Notebook26
Bulgarian_Linguistic
Collection and resources for Bulgarian Corpus, Datasets and Models used in ASR, TTS or NLP tasks together with the links of corresponding tools/apps.
Language:Java24
indonesian-asr
Automatic speech recognition (ASR) for Indonesian language built by using HTK and Julius. Web interface is built using Node.js.
Language:Lex20
SpeechLoop
Many ASRs under one roof. With Benchmarking... answering the question. What is the best ASR for my dataset?
Language:Python18
WOLOF-ASR-Wav2Vec2
Audio Preprocessing and finetuning of wav2vec2-large-xlsr model on AI4D Baamtu Datamation - Automatic Speech Recognition in WOLOF Data.
Language:Jupyter Notebook17
deepspeech-catala
Deepspeech ASR Model for the Catalan Language
Language:Python17
QuartzNet-ASR-pytorch
Automatic Speech Recognition (ASR) model QuartzNet trained on English CommonVoice. In PyTroch with CTC loss and beam search.
Language:Jupyter Notebook16
End-to-end-ASR-Transformer
An end to end ASR Transformer model training repo
Language:Python13
BarkingGPT
Audio to Audio (Whisper+ChatGPT+Bark)
Language:JavaScript11
catalan-speech-recognition-benchmark
A benchmark of speech recognition solutions for the Catalan language
8
LAS-Pytorch
Implementation of the paper "Listen, Attend and Spell" Paper in Pytorch
Language:Python7
asr-webapp
ASR Web APP 中文语音识别实验室APP，使用Django构建，包含中文语音转文字与中文语音聊天机器人模块
Language:Python6
kaldi-adapt-lm
Create and adapt n-gram and JSGF language models, e.g. for Kaldi-ASR nnet3 chain models from Zamia-Speech
Language:Python6
Automatic-Speech-Recognition-with-PyTorch
Real-Time ASR with CNN-BiLSTM: End-to-End Live Streaming Using Lightning AI ⚡ with Training Scripts
Language:Python5
quartznet
QuartzNet implementation for Automatic Speech Recognition task
Language:Python4
359-Hours-Indonesian-Speech-Data-by-Mobile-Phone_Reading
Indonesian Speech Dataset
3
800-Hours-Sichuan-Dialect-Conversational-Speech-Data-by-Mobile-Phone
The dataset of Sichuan dialect conversational speech
3
Conversational_Speech_Dataset
Mega Conversational Speech Datasets for Speech Recognition
3
Automatic-Speech-Recognizer
Build end-to-end Deep Neural Network to translate speech to text (ASR model)
Language:Jupyter Notebook3
whisper-large-v2-atcosim_corpus
A fine-tuned Whisper model (whisper-large-v2) for aviation audio transcription. WER < 5%.
Language:Jupyter Notebook2
whisper-asr-cli
Automatic Speech Recognition ASR / Speech To Text STT demonstration using Whisper/base model. The cli python application transcribe an audio to text, works offline.
Language:Python2
Whisper-ASR-Transcription-Project
Whisper ASR Transcription Project
Language:Python2
Nda-Nda-Force-Aligner
Forced alignment of Nda‘ Nda’ a Cameroonian language
Language:Shell2
SimpleASRmodel
A simple CRDNN based ASR model for my own understanding of how ASR works and are trained. (Work in progress) If anyone finds any error or have any suggestion please do let me know.
Language:Jupyter Notebook2
ChitongaASR
A natural language processing and machine learning project for a low resource langauge in Zambia.
Language:Jupyter Notebook2
Interspeech2020-Accented-English-Speech-Recognition-Competition-Data
Interspeech2020 Accented English Speech Recognition Competition Data
2
300-Hours-Mixed-Speech-with-Korean-and-English-Data-by-Mobile-Phone
Mixed Speech with Korean and English Dataset
2
200-People-Chinese-Wake-up-Words-Speech-Data-by-Mobile-Phone
Chinese Wake-up Words Speech Dataset
2
261-Hours-Japanese-Speech-Data-by-Mobile-Phone
Japanese Speech Dataset
2
240-Hours-Hindi-Speech-Data-by-Mobile-Phone_Reading
Hindi Speech Dataset
2
176-Hours-Suzhou-Dialect-Speech-Data-by-Mobile-Phone
Suzhou Dialect Speech Dataset
2