[NeurIPS'23 Oral] Visual Instruction Tuning: LLaVA (Large Language-and-Vision Assistant) built towards GPT-4V level capabilities.
Primary LanguagePythonApache License 2.0Apache-2.0
No one’s watching this repository yet.