gemini-vision-pro

There are 14 repositories under gemini-vision-pro topic.

  • IRedDragonICY/vixevia

    An AI-powered Virtual YouTuber (Vtuber) utilizing Google's Gemini language model to create engaging, personalized, and context-aware interactions. This project explores the potential of AI in human-computer interaction and virtual content creation.

    Language:Python35154
  • VisionScriptBot

    nuhmanpk/VisionScriptBot

    A telegram bot that uses Google's Gemini Pro Vision API to convert image to text

    Language:Python20218
  • Xeven777/gemini-chat

    An AI-powered chat bot built with Next.js 14 and Google Gemini, featuring real-time interaction, responsive design, and support for Gemini Pro and Gemini Vision models. This project showcases the power of AI in enhancing user engagement and providing intelligent responses.

    Language:JavaScript20127
  • Manraj29/Wardrobe-Guru

    Suggesting some cool outfits for the user available clothing options, using Gemini Pro and Gemini Vision Pro API.

    Language:Python4100
  • Manraj29/Image-Caption-Gen

    Basic image recognition application using Gemini Vision Pro and also gets some cool captions for Instagram

    Language:Python3100
  • arslanstack/gemini-vision-pro-implementation

    Gemini Vision Pro API with Multimodal Prompts in JavaScript (Node.js & Express.js)

    Language:JavaScript2100
  • kevin-rs/gems

    💎 A cli, tui, and sdk for interacting with the Gemini API (WIP)

    Language:Rust2111
  • Cerne17/ProjetoNotas

    Este projeto é o projeto final da Imersão IA da Alura em colaboração com o Google. Projeto de geração de PDF's com base em notas escritas à mão.

    Language:Python1100
  • rohit2k3/Gemini-Telegram-Bot

    Gemini: Your AI-Powered Q&A and Image Guru in Telegram Ask any question, get clear answers. Upload any image, unveil its secrets. All powered by Google's next-gen AI. ✨ Join the Gemini Telegram Bot now!

    Language:JavaScript1103
  • ssabrut/medical-image-detection-llm

    This project is an advanced AI-powered tool designed to analyze medical images, leveraging the robust capabilities of Google Gemini for accurate image recognition and Streamlit for an intuitive user interface.

    Language:Python120
  • BedantaGautom/AI-Chemist

    Web application using Generative AI

    Language:Python00
  • polymathbenchmark/polymathbenchmark.github.io

    A Challenging Multi-Modal Mathematical Reasoning Benchmark

    Language:JavaScript0000
  • jhenilparihar/NutriSafe

    A PWA app that scans food for details, alerts on allergens, suggests alternatives, and provides recipes.

    Language:JavaScript00
  • NajiAboo/google-gemini

    Detailed code explanation of google LLM gemini

    Language:Jupyter Notebook10