mllm-reasoning

There are 11 repositories under mllm-reasoning topic.

yaotingwangofficial/Awesome-MCoT
Multimodal Chain-of-Thought Reasoning: A Comprehensive Survey
880 13 1126
ritzz-ai/GUI-R1
Official implementation of GUI-R1 : A Generalist R1-Style Vision-Language Action Model For GUI Agents
Language:Python198 1 3712
Wild-Cooperation-Hub/Awesome-MLLM-Reasoning-Benchmarks
A Comprehensive Survey on Evaluating Reasoning Capabilities in Multimodal Large Language Models.
69 2 27
manglu097/Chiron-o1
[NIPS 2025] Chiron-o1: Igniting Multimodal Large Language Models towards Generalizable Medical Reasoning via Mentor-Intern Collaborative Search
Language:Python67 0 39
luo-junyu/FinMME
[ACL 2025] FinMME: Benchmark Dataset for Financial Multi-Modal Reasoning Evaluation
Language:Python582
YutingLi0606/Vision-Matters
(ArXiv25) Vision Matters: Simple Visual Perturbations Can Boost Multimodal Math Reasoning
Language:Python58 3 35
Kun-Xiang/AtomThink
Offical Repository of "AtomThink: Multimodal Slow Thinking with Atomic Step Reasoning"
Language:Python550
falonss703/Awesome-Uncertainty-based-Reinforcement-Learning
🔥🔥🔥Latest Papers, Codes on Uncertainty-based RL
51 0 03
Jorffy/NoteMR
[CVPR 2025] Code for "Notes-guided MLLM Reasoning: Enhancing MLLM with Knowledge and Visual Notes for Visual Question Answering".
Language:Python161
SkyworkAI/CSVQA
A Multimodal Benchmark for Evaluating Scientific Reasoning Capabilities of VLMs
Language:Python6 0 0
vulab-AI/YESBUT-v2
We introduce the YesBut-v2, a benchmark for assessing AI's ability to interpret juxtaposed comic panels with contradictory narratives. Unlike existing benchmarks, it emphasizes visual understanding, comparative reasoning, and social knowledge.
Language:JavaScript1

mllm-reasoning

yaotingwangofficial/Awesome-MCoT

ritzz-ai/GUI-R1

Wild-Cooperation-Hub/Awesome-MLLM-Reasoning-Benchmarks

manglu097/Chiron-o1

luo-junyu/FinMME

YutingLi0606/Vision-Matters

Kun-Xiang/AtomThink

falonss703/Awesome-Uncertainty-based-Reinforcement-Learning

Jorffy/NoteMR

SkyworkAI/CSVQA

vulab-AI/YESBUT-v2