MMIE: Massive Multimodal Interleaved Comprehension Benchmark for Large Vision-Language Models
Primary LanguagePythonMIT LicenseMIT