VTprompt

This repository contains the code for the paper: "Joint Visual and Text Prompting for Improved Object-Centric Perception with Multimodal Large Language Models".

Installation

Environment Setup

Please follow the instructions in Grounded Segment Anything to set up the environment.

Usage

Building Vprompt
Using Tprompt to prompt Multimodal Large Language Models for generating answers.

Evaluation Code and Usage Tutorial

We are currently in the process of organizing detailed evaluation code and usage tutorials. Please stay tuned for updates!

jinyeying/VTprompt

VTprompt

Installation

Environment Setup

Usage

Evaluation Code and Usage Tutorial