gpt4vision
There are 14 repositories under gpt4vision topic.
AmberSahdev/Open-Interface
Control Any Computer Using LLMs
soulteary/amazing-openai-api
Convert different model APIs into the OpenAI API format out of the box.
kyegomez/Lets-Verify-Step-by-Step
"Improving Mathematical Reasoning with Process Supervision" by OPENAI
TIGER-AI-Lab/VIEScore
Visual Instruction-guided Explainable Metric. Code for "Towards Explainable Metrics for Conditional Image Synthesis Evaluation" (ACL 2024 main)
mostafasadeghi97/auto-pilot-computer
This is a tool that uses GPT4 Vision to operate your computer
Azure-Samples/rag-as-a-service-with-vision
This repository offers a Python framework for a retrieval-augmented generation (RAG) pipeline using text and images from MHTML documents, leveraging Azure AI and OpenAI services. It includes ingestion and enrichment flows, a RAG with Vision pipeline, and evaluation tools.
mapluisch/GPT-4-Vision-for-HoloLens
Capture images with HoloLens and receive descriptive responses from OpenAI's GPT-4V(ision).
afonso07/ruskin
Your own personal Ruskin.
Envedity/DAIA
Digital Artificial Intelligence Agent
HaimOzer123/Automated-Construction-Site-Inspector
Developed an IoT-based construction site inspector using a Raspberry Pi 4 that autonomously navigates and inspects construction sites. The system features two DC motors for line-following and a servo-mounted ultrasonic sensor for real-time obstacle detection.
nectariferous/gpt4all-webui
Web-based user interface for GPT4All and set it up to be hosted on GitHub Pages. This will allow users to interact with the model through a browser. We'll use Flask for the backend and some modern HTML/CSS/JavaScript for the frontend.
yunwoong7/VisionQuery-GPT-4v
VisionQuery GPT-4v is a cutting-edge tool that combines screenshot-based queries with OpenAI's GPT-4. It enables users to capture screens, ask questions, and receive insightful answers from GPT-4v, revolutionizing digital interaction and understanding.
hoangv97/camerAI
Camera powered with AI on the web