/lmm-graph-vision

How well do the GPT-4V, Gemini Pro Vision, and Claude 3 Opus models perform zero-shot vision tasks on data structures?

Primary LanguagePythonMIT LicenseMIT

Watchers