[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Large Language-and-Vision Assistant for Biomedicine, built towards multimodal GPT-4 level capabilities.
hylq66 doesn’t have any repository yet.