/multimodal-maestro

Effective prompting for Large Multimodal Models like GPT-4 Vision, LLaVA or CogVLM. 🔥

Primary LanguagePythonMIT LicenseMIT

Watchers