jiangsongtao/VTprompt
The code for paper:Joint Visual and Text Prompting for Improved Object-Centric Perception with Multimodal Large Language Models
The code for paper:Joint Visual and Text Prompting for Improved Object-Centric Perception with Multimodal Large Language Models