stevenantya's Stars
amanv1906/GENAI-CareerAssistant-Multiagent
GenAI career assistant
olive-robotics/bots_bento_icra24
tencent-ailab/IP-Adapter
The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.
cfanatic/ocr-receipt
Perform optical character recognition on receipts
cheeaun/busrouter-sg
BusRouter SG: Singapore Bus Routes Explorer
dpar39/ppp
A passport photo ID creation tool
DmitryRyumin/INTERSPEECH-2023-24-Papers
INTERSPEECH 2023-2024 Papers: A complete collection of influential and exciting research papers from the INTERSPEECH 2023-24 conference. Explore the latest advances in speech and language processing. Code included. Star the repository to support the advancement of speech technology!
speechbrain/speechbrain
A PyTorch-based Speech Toolkit
espnet/espnet
End-to-End Speech Processing Toolkit
JazminVidal/gop-dnn-epadb
Goodness of Pronunciation using Kaldi on Epa-DB database
jimbozhang/speechocean762
A non-native English corpus for pronunciation scoring task
speechsuper/SpeechSuper-API-Samples
Deep learning based speech and pronunciation assessment API for 8 languages.
azkadev/whisper
Whisper Dart is a cross platform library for dart and flutter that allows converting audio to text / speech to text / inference from Open AI models
innovatorved/whisper.api
This project provides an API with user level access support to transcribe speech to text using a finetuned and processed Whisper ASR model.
ltzehan/EE2026-Project
bradtraversy/design-resources-for-developers
Curated list of design and UI resources from stock photos, web templates, CSS frameworks, UI libraries, tools and much more