gpt-4v
There are 19 repositories under gpt-4v topic.
OpenGVLab/InternVL
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4V. 接近GPT-4V表现的可商用开源多模态对话模型
open-compass/VLMEvalKit
Open-source evaluation toolkit of large vision-language models (LVLMs), support GPT-4v, Gemini, QwenVLPlus, 50+ HF models, 20+ benchmarks
davideuler/awesome-assistant-api
Try openai assistant api apps on Google Colab for free. Awesome assistant API Demos!
tianyi-lab/HallusionBench
[CVPR'24] HallusionBench: You See What You Think? Or You Think What You See? An Image-Context Reasoning Benchmark Challenging for GPT-4V(ision), LLaVA-1.5, and Other Multi-modality Models
Denis2054/Transformers-for-NLP-and-Computer-Vision-3rd-Edition
Transformers 3rd Edition
yachty66/gpt_pdf_md
🚀 gpt_pdf_md: Convert PDF to Markdown with GPT-4V & more. Extract images, upload to Google Cloud, & generate Markdown with images. Python, GPT-4V Vision, Scala. Ideal for developers, researchers. PDF to Markdown, GPT-4V, image extraction, Python package
taogoddd/GPT-4V-API
Self-hosted GPT-4V api
jameszhou-gl/gpt-4v-distribution-shift
Code for "How Well Does GPT-4V(ision) Adapt to Distribution Shifts? A Preliminary Investigation"
autodistill/autodistill-gpt-4v
GPT-4V(ision) module for use with Autodistill.
roboflow/gpt-checkup
Monitor the performance of OpenAI's GPT-4V model over time.
logicalroot/gpt-4v-demos
🤖 GPT-4V Demos • Test the model's vision capabilities in your browser using Streamlit • Easy setup
android-com-pl/wp-ai-alt-generator
WordPress plugin that leverages OpenAI's Vision API to automatically generate descriptive alt text for images, enhancing accessibility and SEO.
afonso07/ruskin
Your own personal Ruskin.
aymenfurter/copilot-insurance-claim-demo
How a Picture of Car Damage Can File Your Insurance Claim
aymenfurter/azure-chat-with-your-photos-demo
Chatbot that comprehends uploaded images and engages in detailed conversations about their content.
gutbash/lmm-graph-vision
How well do the GPT-4V, Gemini Pro Vision, and Claude 3 Opus models perform zero-shot vision tasks on data structures?
metatatt/003-wireDiagramReader
Wiring Diagram Reader: Use GPT-4V to interpret electrical diagrams. Simplifying complex schematics for seamless high-level understanding.
ndurner/oai_chat
Multi-modal Chatbot based on OpenAI
zaidmukaddam/nmap-vision
NMAP Scan Analysis powered by GPT-4V and GPT-4 Turbo!