asr-model

There are 98 repositories under asr-model topic.

revdotcom/reverb
Open source inference code for Rev's model
Language:Python433 12 1727
sovaai/sova-asr
SOVA ASR (Automatic Speech Recognition)
Language:Python173 12 2422
at16k/at16k
Trained models for automatic speech recognition (ASR). A library to quickly build applications that require speech to text conversion.
Language:Python130 11 1118
vietai/ASR
End-to-End Vietnamese Speech Recognition using wav2vec 2.0
103 3 210
IS2AI/TurkicASR
A multilingual ASR model that can recognize ten Turkic languages—Azerbaijani, Bashkir, Chuvash, Kazakh, Kyrgyz, Sakha, Tatar, Turkish, Uyghur, and Uzbek.
Language:Python73 6 39
iamjanvijay/rnnt
An implementation of RNN-Transducer loss in TF-2.0.
Language:Python46 3 79
Hamtech-ai/wav2vec2-fa
fine-tune Wav2vec2. an ASR model released by Facebook
Language:Jupyter Notebook38 2 15
juan-csv/GPT3-text-summarization
Summarization, topic generation using GPT3
Language:Jupyter Notebook33 2 310
oleges1/quartznet-pytorch
Quartznet implementation on pytorch [https://arxiv.org/abs/1910.10261]
Language:Jupyter Notebook27 3 37
antouanbg/Bulgarian_Linguistic
Collection and resources for Bulgarian Corpus, Datasets and Models used in ASR, TTS or NLP tasks together with the links of corresponding tools/apps.
Language:Java25 4 02
tifaniwarnita/indonesian-asr
Automatic speech recognition (ASR) for Indonesian language built by using HTK and Julius. Web interface is built using Node.js.
Language:Lex21 8 15
robmsmt/SpeechLoop
Many ASRs under one roof. With Benchmarking... answering the question. What is the best ASR for my dataset?
Language:Python19 3 00
ccoreilly/deepspeech-catala
Deepspeech ASR Model for the Catalan Language
Language:Python17 5 10
kingabzpro/WOLOF-ASR-Wav2Vec2
Audio Preprocessing and finetuning of wav2vec2-large-xlsr model on AI4D Baamtu Datamation - Automatic Speech Recognition in WOLOF Data.
Language:Jupyter Notebook17 1 08
Kirili4ik/QuartzNet-ASR-pytorch
Automatic Speech Recognition (ASR) model QuartzNet trained on English CommonVoice. In PyTroch with CTC loss and beam search.
Language:Jupyter Notebook16 3 23
MegEngine/End-to-end-ASR-Transformer
An end to end ASR Transformer model training repo
Language:Python13 2 03
BudEcosystem/BarkingGPT
Audio to Audio (Whisper+ChatGPT+Bark)
Language:JavaScript11 2 03
LuluW8071/Automatic-Speech-Recognition-with-PyTorch
Real-Time ASR with CNN-BiLSTM: End-to-End Live Streaming Using PyTorch Lightning⚡
Language:Python10 1 92
luizomf/sussu
CLI educacional para transcrição com OpenAI Whisper
Language:Python9
ccoreilly/catalan-speech-recognition-benchmark
A benchmark of speech recognition solutions for the Catalan language
8 2 00
fquirin/kaldi-adapt-lm
Create and adapt n-gram and JSGF language models, e.g. for Kaldi-ASR nnet3 chain models from Zamia-Speech
Language:Python7 2 12
KrishnaDN/LAS-Pytorch
Implementation of the paper "Listen, Attend and Spell" Paper in Pytorch
Language:Python7 1 03
Nexdata-AI/359-Hours-Indonesian-Speech-Data-by-Mobile-Phone_Reading
Indonesian Speech Dataset
6 1 02
SzLeaves/asr-webapp
ASR Web APP 中文语音识别实验室APP，使用Django构建，包含中文语音转文字与中文语音聊天机器人模块
Language:Python6 2 01
djelia-org/djelia-python-sdk
this repo contain packages that allow easy interaction with Djelia api.
Language:Python5 0 12
hwk06023/SONATA
SONATA (SOund and Narrative Advanced Transcription Assistant): An advanced ASR system that captures human expressions including emotive sounds and non-verbal cues.
Language:Python41
isadrtdinov/quartznet
QuartzNet implementation for Automatic Speech Recognition task
Language:Python4 1 02
Nexdata-AI/Conversational_Speech_Dataset
Mega Conversational Speech Datasets for Speech Recognition
4 1 00
LaurentVeyssier/Automatic-Speech-Recognizer
Build end-to-end Deep Neural Network to translate speech to text (ASR model)
Language:Jupyter Notebook3 1 0
LianjiaTech/bella-whisper
bella-whisper是一系列基于OpenAI Whisper的变体模型，为实现精确的语音识别转写而设计。通过采用数千小时的高质量数据进行微调训练，bella-whisper在多个基准测试中表现出色，特别是在房产经纪领域。
Language:Python34
mende237/Nda-Nda-Force-Aligner
Forced alignment of Nda‘ Nda’ a Cameroonian language
Language:Shell3 1 00
Nexdata-AI/347-Hours-Italian-Speech-Data-Collected-by-Mobile-Phone
Italian Speech Dataset
3 1 0
OmeshThokchom/N7speech
Manipuri ASR – A state-of-the-art, low-latency speech-to-text library with advanced voice activity detection and real-time transcription, purpose-built for the Manipuri language.
Language:Python3
daisyyedda/whisper-large-v2-atcosim_corpus
A fine-tuned Whisper model (whisper-large-v2) for aviation audio transcription. WER < 5%.
Language:Jupyter Notebook2 1 00
yandex-cloud-examples/yc-speechkit-streams-recognizer
SpeechKit Streaming Recognizer.
Language:Python2 4 0
yandex-cloud-examples/yc-speechkit-stt-java
Пример использования распознавания речи SpeechKit на Java.
Language:Java2 3 00

asr-model

revdotcom/reverb

sovaai/sova-asr

at16k/at16k

vietai/ASR

IS2AI/TurkicASR

iamjanvijay/rnnt

Hamtech-ai/wav2vec2-fa

juan-csv/GPT3-text-summarization

oleges1/quartznet-pytorch

antouanbg/Bulgarian_Linguistic

tifaniwarnita/indonesian-asr

robmsmt/SpeechLoop

ccoreilly/deepspeech-catala

kingabzpro/WOLOF-ASR-Wav2Vec2

Kirili4ik/QuartzNet-ASR-pytorch

MegEngine/End-to-end-ASR-Transformer

BudEcosystem/BarkingGPT

LuluW8071/Automatic-Speech-Recognition-with-PyTorch

luizomf/sussu

ccoreilly/catalan-speech-recognition-benchmark

fquirin/kaldi-adapt-lm

KrishnaDN/LAS-Pytorch

Nexdata-AI/359-Hours-Indonesian-Speech-Data-by-Mobile-Phone_Reading

SzLeaves/asr-webapp

djelia-org/djelia-python-sdk

hwk06023/SONATA

isadrtdinov/quartznet

Nexdata-AI/Conversational_Speech_Dataset

LaurentVeyssier/Automatic-Speech-Recognizer

LianjiaTech/bella-whisper

mende237/Nda-Nda-Force-Aligner

Nexdata-AI/347-Hours-Italian-Speech-Data-Collected-by-Mobile-Phone

OmeshThokchom/N7speech

daisyyedda/whisper-large-v2-atcosim_corpus

yandex-cloud-examples/yc-speechkit-streams-recognizer

yandex-cloud-examples/yc-speechkit-stt-java