Pinned Repositories
ICD-LM
An in-context learning repo for Large Multimodal Model (LMM)
OFv2_ICL_VQA
[CVPR 2024] How to Configure Good In-Context Sequence for Visual Question Answering
Pet-Soul
TSG
[ACL 2023] Transforming Visual Scene Graphs to Image Captions
TSG_model
Pet-Soul
zcccccz.github.io
GaryJiajia's Repositories
GaryJiajia/OFv2_ICL_VQA
[CVPR 2024] How to Configure Good In-Context Sequence for Visual Question Answering
GaryJiajia/TSG
[ACL 2023] Transforming Visual Scene Graphs to Image Captions
GaryJiajia/ICD-LM
An in-context learning repo for Large Multimodal Model (LMM)
GaryJiajia/Pet-Soul
GaryJiajia/TSG_model