vision-ai

There are 33 repositories under vision-ai topic.

  • instill-ai/console

    📺 Instill Console for 🔮 Instill Core: https://github.com/instill-ai/instill-core

    Language:TypeScript4111011
  • yihong1120/YOLOv8-License-Plate-Insights

    This repository demonstrates YOLOv8-based license plate recognition with GCP Vision AI integration, enabling versatile real-world applications like vehicle identification, traffic monitoring, and geospatial analysis while capturing vital media metadata for enhanced insights.

    Language:Jupyter Notebook9224
  • pej0918/SK-RD4AD

    [CVPRW'25] Official Code For "SK-RD4AD: Skip-Connected Reverse Distillation for One-Class Anomaly Detection"

    Language:Python8
  • choudaryhussainali/MCQ_Grading_Bot

    MCQ_Grading_Bot is an AI-powered tool that grades solved MCQ exam sheets from images using Gemini Vision. It extracts student info, checks answers, calculates score, and displays detailed results—all through a simple Gradio interface in Colab.

    Language:Jupyter Notebook4
  • Navy10021/MDDenseResNet

    MDDenseResNet : Enhanced Malware Detection Using DNNs

    Language:Jupyter Notebook3100
  • ShihabYasin/STGAN

    STGAN: A Unified Selective Transfer Network for Arbitrary Image Attribute Editing

    Language:Python310
  • go-park-mail-ru/2023_2_OND_team

    Backend проекта Pinterest команды OND team

    Language:Go2341
  • s59mz/eagle-eye-ai

    Eagle-Eye-AI is a project designed for the Kria KR260 board that enables AI-driven camera tracking and face detection.

    Language:Tcl2102
  • simonyang0608/DeeperSimon

    General vision AI defect detection engine for MLops process/simulations

    Language:Python20
  • CodeLeom/text-in-image

    Detect text in image, using Autogon AI

    Language:JavaScript110
  • dj-ayush/MetaSynAI

    MetaSynAI is an AI‑driven accessibility framework that enables seamless interaction through voice commands, hand gestures, and eye‑tracking, offering a modern and inclusive way to control web interfaces.

    Language:HTML1
  • DrozeNzzz/SK-RD4AD

    [CVPR 2025 Workshop] SK-RD4AD: Skip-Connected Reverse Distillation for One-Class Anomaly Detection

    Language:Python1
  • HotwireRobotics/frc-bumper-vision

    Real-time FRC robot detection via bumper vision using YOLOv8 + Limelight, integrated with AD* pathfinding for dynamic obstacle avoidance. WIP by the programming team!

    Language:Python1
  • instill-ai/.github

    🏡 Instill AI organisation profile and default configuration

  • srvaroa/ai-camera

    People detection and notifications based on the Raspberry Pi + AI Camera

    Language:Python1101
  • Supershivam07/Vision-AI

    Language:Jupyter Notebook1
  • YCSE/nanobanana-mcp

    Gemini Vision & Image Generation MCP for Claude Desktop and Claude Code

    Language:JavaScript1
  • iuliaL/handwriting-2-text-converter

    Using Google Vision AI

    Language:JavaScript0100
  • kckang1103/ScrapeGoats

    Web scraping and machine learning for sentiment analysis over the history of a term's usage on twitter.

    Language:Python0000
  • WhatIsLoveOO/NicolaBlindAssistant

    "Nicola Blind Assistant" — мобільний додаток, який допомагає людям з вадами зору орієнтуватися в просторі, розпізнавати текст, об'єкти та обличчя, використовуючи сучасні технології."

    Language:LLVM0100
  • 0xnomy/SnapQuery

    SnapQuery is a lightweight multimodal AI application that lets you interact with images through natural language. Powered by Groq's high-speed LLMs (LLaMA 4 Scout), it supports visual question answering, image captioning, and general chat.

    Language:Python
  • Aidoni0797/Computer_Vision_Neural_Networks

    Computer_Vision_Neural_Networks

    Language:Python
  • alwaysai/yolov4-object-detector

    Yolov4 ONNX Object Detector

    Language:Python10
  • FaNa-AI/YOLO

    A minimal YOLOv8n-based object detection project using the lightweight Nano version of the model for fast and efficient training and inference on small datasets like coco128.

    Language:Jupyter Notebook
  • industrial-edge/vision-connector-sdk-and-plugin-examples

    SDK to Implement custom Industrial camera connectors for usage in Siemens Vision Connector

    Language:HTML
  • juancarlosqr/vf-vision

    Vision AI in Voiceflow

    Language:JavaScript
  • ksm26/Reasoning-with-o1

    This repository explores OpenAI’s o1 model, a cutting-edge AI designed for abstract reasoning, coding, and vision-based tasks. It provides insights into o1’s strengths, advanced prompting techniques, task delegation, and real-world applications, enabling developers to build intelligent, high-performance AI-driven solutions.

    Language:Jupyter Notebook1
  • MaharshPatelX/qwen-clip-multimodal

    Multimodal Vision-AI: CLIP eyes + Qwen2.5 brain, 155 K-step pipeline & demo.

    Language:Python
  • moses-varghese/Agri-Agentic-Suite

    An open-source suite of three distinct, containerized AI prototypes designed to provide critical decision support for the agricultural sector. The project includes a general query agent, a financial advisor, and a vision-based crop disease diagnostic tool, all powered by local, open-source AI models.

    Language:Python
  • nabeelshan78/yolo-object-detection-pipeline

    An end‑to‑end TensorFlow/Keras implementation of the YOLO object detection pipeline. Load images, run fast and accurate bounding‑box inference, filter and refine predictions and visualize results side‑by‑side - all organized into a clean, modular workflow.

    Language:Jupyter Notebook
  • RealUnfazed/PyCVision

    PyCVision is a Python-based real-time object detection system powered by the YOLOv3 (You Only Look Once) algorithm. This project leverages the efficiency and accuracy of YOLOv3 for detecting and classifying multiple objects in live video streams or static images.

    Language:Python
  • YooSungHyun/Transformer-OCR

    Transformer OCR by Torch Lightning

    Language:Python11