WisconsinAIVision/ViP-LLaVA

[CVPR2024] ViP-LLaVA: Making Large Multimodal Models Understand Arbitrary Visual Prompts

PythonApache-2.0

Readme
16Issues
179Stargazers
5Watchers

Watchers

dnth
@zenml-io
haotian-liu
UW-Madison
hsaigroup
L4zyy
mu-cai
University of Wisconsin - Madison

Contact site admin: Geeks.