asr-model
There are 98 repositories under asr-model topic.
revdotcom/reverb
Open source inference code for Rev's model
sovaai/sova-asr
SOVA ASR (Automatic Speech Recognition)
at16k/at16k
Trained models for automatic speech recognition (ASR). A library to quickly build applications that require speech to text conversion.
vietai/ASR
End-to-End Vietnamese Speech Recognition using wav2vec 2.0
IS2AI/TurkicASR
A multilingual ASR model that can recognize ten Turkic languages—Azerbaijani, Bashkir, Chuvash, Kazakh, Kyrgyz, Sakha, Tatar, Turkish, Uyghur, and Uzbek.
iamjanvijay/rnnt
An implementation of RNN-Transducer loss in TF-2.0.
Hamtech-ai/wav2vec2-fa
fine-tune Wav2vec2. an ASR model released by Facebook
juan-csv/GPT3-text-summarization
Summarization, topic generation using GPT3
oleges1/quartznet-pytorch
Quartznet implementation on pytorch [https://arxiv.org/abs/1910.10261]
antouanbg/Bulgarian_Linguistic
Collection and resources for Bulgarian Corpus, Datasets and Models used in ASR, TTS or NLP tasks together with the links of corresponding tools/apps.
tifaniwarnita/indonesian-asr
Automatic speech recognition (ASR) for Indonesian language built by using HTK and Julius. Web interface is built using Node.js.
robmsmt/SpeechLoop
Many ASRs under one roof. With Benchmarking... answering the question. What is the best ASR for my dataset?
ccoreilly/deepspeech-catala
Deepspeech ASR Model for the Catalan Language
kingabzpro/WOLOF-ASR-Wav2Vec2
Audio Preprocessing and finetuning of wav2vec2-large-xlsr model on AI4D Baamtu Datamation - Automatic Speech Recognition in WOLOF Data.
Kirili4ik/QuartzNet-ASR-pytorch
Automatic Speech Recognition (ASR) model QuartzNet trained on English CommonVoice. In PyTroch with CTC loss and beam search.
MegEngine/End-to-end-ASR-Transformer
An end to end ASR Transformer model training repo
BudEcosystem/BarkingGPT
Audio to Audio (Whisper+ChatGPT+Bark)
LuluW8071/Automatic-Speech-Recognition-with-PyTorch
Real-Time ASR with CNN-BiLSTM: End-to-End Live Streaming Using PyTorch Lightning⚡
luizomf/sussu
CLI educacional para transcrição com OpenAI Whisper
ccoreilly/catalan-speech-recognition-benchmark
A benchmark of speech recognition solutions for the Catalan language
fquirin/kaldi-adapt-lm
Create and adapt n-gram and JSGF language models, e.g. for Kaldi-ASR nnet3 chain models from Zamia-Speech
KrishnaDN/LAS-Pytorch
Implementation of the paper "Listen, Attend and Spell" Paper in Pytorch
Nexdata-AI/359-Hours-Indonesian-Speech-Data-by-Mobile-Phone_Reading
Indonesian Speech Dataset
SzLeaves/asr-webapp
ASR Web APP 中文语音识别实验室APP,使用Django构建,包含中文语音转文字与中文语音聊天机器人模块
djelia-org/djelia-python-sdk
this repo contain packages that allow easy interaction with Djelia api.
hwk06023/SONATA
SONATA (SOund and Narrative Advanced Transcription Assistant): An advanced ASR system that captures human expressions including emotive sounds and non-verbal cues.
isadrtdinov/quartznet
QuartzNet implementation for Automatic Speech Recognition task
Nexdata-AI/Conversational_Speech_Dataset
Mega Conversational Speech Datasets for Speech Recognition
LaurentVeyssier/Automatic-Speech-Recognizer
Build end-to-end Deep Neural Network to translate speech to text (ASR model)
LianjiaTech/bella-whisper
bella-whisper是一系列基于OpenAI Whisper的变体模型,为实现精确的语音识别转写而设计。通过采用数千小时的高质量数据进行微调训练,bella-whisper在多个基准测试中表现出色,特别是在房产经纪领域。
mende237/Nda-Nda-Force-Aligner
Forced alignment of Nda‘ Nda’ a Cameroonian language
Nexdata-AI/347-Hours-Italian-Speech-Data-Collected-by-Mobile-Phone
Italian Speech Dataset
OmeshThokchom/N7speech
Manipuri ASR – A state-of-the-art, low-latency speech-to-text library with advanced voice activity detection and real-time transcription, purpose-built for the Manipuri language.
daisyyedda/whisper-large-v2-atcosim_corpus
A fine-tuned Whisper model (whisper-large-v2) for aviation audio transcription. WER < 5%.
yandex-cloud-examples/yc-speechkit-streams-recognizer
SpeechKit Streaming Recognizer.
yandex-cloud-examples/yc-speechkit-stt-java
Пример использования распознавания речи SpeechKit на Java.