Pinned Repositories
acl-anthology
Data and software for building the ACL Anthology.
ScienceQA
Data and code for NeurIPS 2022 Paper "Learn to Explain: Multimodal Reasoning via Thought Chains for Science Question Answering".
BLIVA
(AAAI 2024) BLIVA: A Simple Multimodal LLM for Better Handling of Text-rich Visual Questions
LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence
SDA
VisLingInstruct
(NAACL 2024)VisLingInstruct: Elevating Zero-Shot Learning in Multi-Modal Language Models with Autonomous Instruction Optimization
Zhudongsheng75's Repositories
Zhudongsheng75/VisLingInstruct
(NAACL 2024)VisLingInstruct: Elevating Zero-Shot Learning in Multi-Modal Language Models with Autonomous Instruction Optimization
Zhudongsheng75/SDA