vision-language-dataset

There are 3 repositories under vision-language-dataset topic.

  • Q-Future/Q-Bench

    ①[ICLR2024 Spotlight] (GPT-4V/Gemini-Pro/Qwen-VL-Plus+16 OS MLLMs) A benchmark for multi-modality LLMs (MLLMs) on low-level vision and visual quality assessment.

    Language:Jupyter Notebook23311112
  • SHTUPLUS/GITM-MR

    The official implementation for the ICCV 2023 paper "Grounded Image Text Matching with Mismatched Relation Reasoning".

    Language:Python6300
  • unitaryai/VTC-dataset

    Language:Python0210