Pinned Repositories
LPRNet
LPRNet re-written with pytorch-lightning
whisper-diarization
Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
NeMo
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
deep-text-recognition-benchmark
PyTorch code of my ICDAR 2021 paper Vision Transformer for Fast and Efficient Scene Text Recognition (ViTSTR)
GigaAM
Foundational Model for Speech Recognition Tasks
dataset_light
solid-start
SolidStart, the Solid app framework
flowbite
Open-source UI component library and front-end development framework based on Tailwind CSS