vision-ai
There are 33 repositories under vision-ai topic.
instill-ai/console
📺 Instill Console for 🔮 Instill Core: https://github.com/instill-ai/instill-core
yihong1120/YOLOv8-License-Plate-Insights
This repository demonstrates YOLOv8-based license plate recognition with GCP Vision AI integration, enabling versatile real-world applications like vehicle identification, traffic monitoring, and geospatial analysis while capturing vital media metadata for enhanced insights.
pej0918/SK-RD4AD
[CVPRW'25] Official Code For "SK-RD4AD: Skip-Connected Reverse Distillation for One-Class Anomaly Detection"
choudaryhussainali/MCQ_Grading_Bot
MCQ_Grading_Bot is an AI-powered tool that grades solved MCQ exam sheets from images using Gemini Vision. It extracts student info, checks answers, calculates score, and displays detailed results—all through a simple Gradio interface in Colab.
Navy10021/MDDenseResNet
MDDenseResNet : Enhanced Malware Detection Using DNNs
ShihabYasin/STGAN
STGAN: A Unified Selective Transfer Network for Arbitrary Image Attribute Editing
go-park-mail-ru/2023_2_OND_team
Backend проекта Pinterest команды OND team
s59mz/eagle-eye-ai
Eagle-Eye-AI is a project designed for the Kria KR260 board that enables AI-driven camera tracking and face detection.
simonyang0608/DeeperSimon
General vision AI defect detection engine for MLops process/simulations
CodeLeom/text-in-image
Detect text in image, using Autogon AI
dj-ayush/MetaSynAI
MetaSynAI is an AI‑driven accessibility framework that enables seamless interaction through voice commands, hand gestures, and eye‑tracking, offering a modern and inclusive way to control web interfaces.
DrozeNzzz/SK-RD4AD
[CVPR 2025 Workshop] SK-RD4AD: Skip-Connected Reverse Distillation for One-Class Anomaly Detection
HotwireRobotics/frc-bumper-vision
Real-time FRC robot detection via bumper vision using YOLOv8 + Limelight, integrated with AD* pathfinding for dynamic obstacle avoidance. WIP by the programming team!
instill-ai/.github
🏡 Instill AI organisation profile and default configuration
srvaroa/ai-camera
People detection and notifications based on the Raspberry Pi + AI Camera
YCSE/nanobanana-mcp
Gemini Vision & Image Generation MCP for Claude Desktop and Claude Code
iuliaL/handwriting-2-text-converter
Using Google Vision AI
kckang1103/ScrapeGoats
Web scraping and machine learning for sentiment analysis over the history of a term's usage on twitter.
WhatIsLoveOO/NicolaBlindAssistant
"Nicola Blind Assistant" — мобільний додаток, який допомагає людям з вадами зору орієнтуватися в просторі, розпізнавати текст, об'єкти та обличчя, використовуючи сучасні технології."
0xnomy/SnapQuery
SnapQuery is a lightweight multimodal AI application that lets you interact with images through natural language. Powered by Groq's high-speed LLMs (LLaMA 4 Scout), it supports visual question answering, image captioning, and general chat.
Aidoni0797/Computer_Vision_Neural_Networks
Computer_Vision_Neural_Networks
alwaysai/yolov4-object-detector
Yolov4 ONNX Object Detector
FaNa-AI/YOLO
A minimal YOLOv8n-based object detection project using the lightweight Nano version of the model for fast and efficient training and inference on small datasets like coco128.
industrial-edge/vision-connector-sdk-and-plugin-examples
SDK to Implement custom Industrial camera connectors for usage in Siemens Vision Connector
juancarlosqr/vf-vision
Vision AI in Voiceflow
ksm26/Reasoning-with-o1
This repository explores OpenAI’s o1 model, a cutting-edge AI designed for abstract reasoning, coding, and vision-based tasks. It provides insights into o1’s strengths, advanced prompting techniques, task delegation, and real-world applications, enabling developers to build intelligent, high-performance AI-driven solutions.
MaharshPatelX/qwen-clip-multimodal
Multimodal Vision-AI: CLIP eyes + Qwen2.5 brain, 155 K-step pipeline & demo.
moses-varghese/Agri-Agentic-Suite
An open-source suite of three distinct, containerized AI prototypes designed to provide critical decision support for the agricultural sector. The project includes a general query agent, a financial advisor, and a vision-based crop disease diagnostic tool, all powered by local, open-source AI models.
nabeelshan78/yolo-object-detection-pipeline
An end‑to‑end TensorFlow/Keras implementation of the YOLO object detection pipeline. Load images, run fast and accurate bounding‑box inference, filter and refine predictions and visualize results side‑by‑side - all organized into a clean, modular workflow.
RealUnfazed/PyCVision
PyCVision is a Python-based real-time object detection system powered by the YOLOv3 (You Only Look Once) algorithm. This project leverages the efficiency and accuracy of YOLOv3 for detecting and classifying multiple objects in live video streams or static images.
YooSungHyun/Transformer-OCR
Transformer OCR by Torch Lightning