vision-ai

There are 33 repositories under vision-ai topic.

instill-ai/console
📺 Instill Console for 🔮 Instill Core: https://github.com/instill-ai/instill-core
Language:TypeScript41 11 011
yihong1120/YOLOv8-License-Plate-Insights
This repository demonstrates YOLOv8-based license plate recognition with GCP Vision AI integration, enabling versatile real-world applications like vehicle identification, traffic monitoring, and geospatial analysis while capturing vital media metadata for enhanced insights.
Language:Jupyter Notebook9 2 24
pej0918/SK-RD4AD
[CVPRW'25] Official Code For "SK-RD4AD: Skip-Connected Reverse Distillation for One-Class Anomaly Detection"
Language:Python8
choudaryhussainali/MCQ_Grading_Bot
MCQ_Grading_Bot is an AI-powered tool that grades solved MCQ exam sheets from images using Gemini Vision. It extracts student info, checks answers, calculates score, and displays detailed results—all through a simple Gradio interface in Colab.
Language:Jupyter Notebook4
Navy10021/MDDenseResNet
MDDenseResNet : Enhanced Malware Detection Using DNNs
Language:Jupyter Notebook3 1 00
ShihabYasin/STGAN
STGAN: A Unified Selective Transfer Network for Arbitrary Image Attribute Editing
Language:Python3 1 0
go-park-mail-ru/2023_2_OND_team
Backend проекта Pinterest команды OND team
Language:Go2 3 41
s59mz/eagle-eye-ai
Eagle-Eye-AI is a project designed for the Kria KR260 board that enables AI-driven camera tracking and face detection.
Language:Tcl2 1 02
simonyang0608/DeeperSimon
General vision AI defect detection engine for MLops process/simulations
Language:Python20
CodeLeom/text-in-image
Detect text in image, using Autogon AI
Language:JavaScript1 1 0
dj-ayush/MetaSynAI
MetaSynAI is an AI‑driven accessibility framework that enables seamless interaction through voice commands, hand gestures, and eye‑tracking, offering a modern and inclusive way to control web interfaces.
Language:HTML1
DrozeNzzz/SK-RD4AD
[CVPR 2025 Workshop] SK-RD4AD: Skip-Connected Reverse Distillation for One-Class Anomaly Detection
Language:Python1
HotwireRobotics/frc-bumper-vision
Real-time FRC robot detection via bumper vision using YOLOv8 + Limelight, integrated with AD* pathfinding for dynamic obstacle avoidance. WIP by the programming team!
Language:Python1
instill-ai/.github
🏡 Instill AI organisation profile and default configuration
1 11 00
srvaroa/ai-camera
People detection and notifications based on the Raspberry Pi + AI Camera
Language:Python1 1 01
Supershivam07/Vision-AI
Language:Jupyter Notebook1
YCSE/nanobanana-mcp
Gemini Vision & Image Generation MCP for Claude Desktop and Claude Code
Language:JavaScript1
iuliaL/handwriting-2-text-converter
Using Google Vision AI
Language:JavaScript0 1 00
kckang1103/ScrapeGoats
Web scraping and machine learning for sentiment analysis over the history of a term's usage on twitter.
Language:Python0 0 00
WhatIsLoveOO/NicolaBlindAssistant
"Nicola Blind Assistant" — мобільний додаток, який допомагає людям з вадами зору орієнтуватися в просторі, розпізнавати текст, об'єкти та обличчя, використовуючи сучасні технології."
Language:LLVM0 1 00
0xnomy/SnapQuery
SnapQuery is a lightweight multimodal AI application that lets you interact with images through natural language. Powered by Groq's high-speed LLMs (LLaMA 4 Scout), it supports visual question answering, image captioning, and general chat.
Language:Python
Aidoni0797/Computer_Vision_Neural_Networks
Computer_Vision_Neural_Networks
Language:Python
alwaysai/yolov4-object-detector
Yolov4 ONNX Object Detector
Language:Python1 0
drakyanerlanggarizkiwardhana/Stable-Diffusion-With-midjourney4
Language:Jupyter Notebook1 0
FaNa-AI/YOLO
A minimal YOLOv8n-based object detection project using the lightweight Nano version of the model for fast and efficient training and inference on small datasets like coco128.
Language:Jupyter Notebook
industrial-edge/vision-connector-sdk-and-plugin-examples
SDK to Implement custom Industrial camera connectors for usage in Siemens Vision Connector
Language:HTML
juancarlosqr/vf-vision
Vision AI in Voiceflow
Language:JavaScript
ksm26/Reasoning-with-o1
This repository explores OpenAI’s o1 model, a cutting-edge AI designed for abstract reasoning, coding, and vision-based tasks. It provides insights into o1’s strengths, advanced prompting techniques, task delegation, and real-world applications, enabling developers to build intelligent, high-performance AI-driven solutions.
Language:Jupyter Notebook1
MaharshPatelX/qwen-clip-multimodal
Multimodal Vision-AI: CLIP eyes + Qwen2.5 brain, 155 K-step pipeline & demo.
Language:Python
moses-varghese/Agri-Agentic-Suite
An open-source suite of three distinct, containerized AI prototypes designed to provide critical decision support for the agricultural sector. The project includes a general query agent, a financial advisor, and a vision-based crop disease diagnostic tool, all powered by local, open-source AI models.
Language:Python
nabeelshan78/yolo-object-detection-pipeline
An end‑to‑end TensorFlow/Keras implementation of the YOLO object detection pipeline. Load images, run fast and accurate bounding‑box inference, filter and refine predictions and visualize results side‑by‑side - all organized into a clean, modular workflow.
Language:Jupyter Notebook
RealUnfazed/PyCVision
PyCVision is a Python-based real-time object detection system powered by the YOLOv3 (You Only Look Once) algorithm. This project leverages the efficiency and accuracy of YOLOv3 for detecting and classifying multiple objects in live video streams or static images.
Language:Python
YooSungHyun/Transformer-OCR
Transformer OCR by Torch Lightning
Language:Python1 1

vision-ai

instill-ai/console

yihong1120/YOLOv8-License-Plate-Insights

pej0918/SK-RD4AD

choudaryhussainali/MCQ_Grading_Bot

Navy10021/MDDenseResNet

ShihabYasin/STGAN

go-park-mail-ru/2023_2_OND_team

s59mz/eagle-eye-ai

simonyang0608/DeeperSimon

CodeLeom/text-in-image

dj-ayush/MetaSynAI

DrozeNzzz/SK-RD4AD

HotwireRobotics/frc-bumper-vision

instill-ai/.github

srvaroa/ai-camera

Supershivam07/Vision-AI

YCSE/nanobanana-mcp

iuliaL/handwriting-2-text-converter

kckang1103/ScrapeGoats

WhatIsLoveOO/NicolaBlindAssistant

0xnomy/SnapQuery

Aidoni0797/Computer_Vision_Neural_Networks

alwaysai/yolov4-object-detector

drakyanerlanggarizkiwardhana/Stable-Diffusion-With-midjourney4

FaNa-AI/YOLO

industrial-edge/vision-connector-sdk-and-plugin-examples

juancarlosqr/vf-vision

ksm26/Reasoning-with-o1

MaharshPatelX/qwen-clip-multimodal

moses-varghese/Agri-Agentic-Suite

nabeelshan78/yolo-object-detection-pipeline

RealUnfazed/PyCVision

YooSungHyun/Transformer-OCR