gpt-vision

There are 25 repositories under gpt-vision topic.

  • Lambdua/openai4j

    Java client library for OpenAI API.Full support for all OpenAI API models including Completions, Chat, Edits, Embeddings, Audio, Files, Assistants-v2, Images, Moderations, Batch, and Fine-tuning.

    Language:Java35995235
  • speak-gpt

    AndraxDev/speak-gpt

    Your personal voice assistant based on OpenAI ChatGPT.

    Language:Kotlin2991112958
  • libraryofcelsus/Aetherius_AI_Assistant

    A completely private, locally-operated Ai Assistant/Chatbot/Sub-Agent Framework with realistic Long Term Memory and thought formation using Open Source LLMs. Qdrant is used for the Vector DB.

    Language:Python28018635
  • ZhiShuYun/HubFrontend

    集成 GPT 问答、Midjourney 绘画等一站式服务的系统

    Language:Vue1646535
  • fingerthief/minimal-chat

    MinimalChat is a lightweight, open-source chat application that allows you to interact with various large language models.

    Language:Vue14935922
  • matrix_chatgpt_bot

    hibobmaster/matrix_chatgpt_bot

    A simple matrix bot that supports image generation and chatting using ChatGPT

    Language:Python8134218
  • speak-gpt-web

    AndraxDev/speak-gpt-web

    Web version of SpeakGPT created using ReactJS and Google Material Design 3.

    Language:JavaScript211211
  • arshad-yaseen/pictocode

    Convert Screenshots 📸 into Code 🧑‍💻

    Language:TypeScript18103
  • zyocum/pdf2md

    Convert PDF to Markdown via OpenAI multi-modal text/vision model.

    Language:Python17314
  • mickymultani/GPT-4-Vision-Architecture-Scanner

    A web-based tool that utilizes GPT-4's vision capabilities to analyze and describe system architecture diagrams, providing instant insights and detailed breakdowns in an interactive chat interface.

    Language:JavaScript14322
  • Uli-Z/autoPDFtagger

    autoPDFtagger is a Python tool designed for efficient home-office organization, focusing on digitizing and organizing both digital and paper-based documents. By automating the tagging of PDF files, including image-rich documents and scans of varying quality, it aims to streamline the organization of digital archives.

    Language:Python14340
  • caesarnine/llm-experiments

    Playing around with LLMs

    Language:Python8102
  • gpt-to-aws

    johntelforduk/gpt-to-aws

    Create AWS infrastructure using architecture diagrams and natural language interpreted using the OpenAI GPT model.

    Language:Python7100
  • zhudotexe/kani-vision

    Kani extension for supporting vision-language models (VLMs). Comes with model-agnostic support for GPT-Vision and LLaVA.

    Language:Python7200
  • ScorchChamp/replicate-GPT

    Replicate any image using Dall-E 3 and GPT-Vision!

    Language:Python6100
  • BardieTS

    Zoheb-Malik/BardieTS

    A powerful AI package (built using typescript), inspired by @rizzlogy/bardie, for interacting with the Google Bard API - without needing to set your own cookie!

    Language:TypeScript4100
  • adityathakurxd/make-real-polls

    Create interactive polls directly from the whiteboard content. Built on top of tldraw make-real template and live audio-video by 100ms, it uses OpenAI's GPT Vision to create an appropriate question with options to launch a poll instantly that helps engage the audience.

    Language:TypeScript2180
  • Cryserrrrr/openAi-interface

    Interface to use openAI API (GPT4, Dall-e 3, ...)

    Language:TypeScript21211
  • GeorgeNance/gpt-caption

    Auto caption images for training in Stable Diffusion

    Language:Python2300
  • YohanV1/InvoiceGPT

    AI Powered Invoice Processing! Capture data effectively through contextual OCR and then ask your AI assistant about your own past purchases.

    Language:Python10
  • arnenoori/gptv-screenshot-renamer

    gpt-4v screenshot renamer

    Language:Python101
  • arnenoori/handwriting2markdown

    handwriting2markdown

    Language:Python10
  • clessn/clellm

    This package provides functions to interact with OpenAI's GPT model for image analysis, install Ollama on Linux systems, install models with Ollama, and call the Ollama API. It is designed to facilitate easy interaction with these services through R functions.

    Language:R101
  • mayuras7685/HIS-Unkils

    Project submission for hack it sapiens hackathon.

    Language:Python10