FlowVQA introduces a novel benchmark for visual question answering, emphasizing the use of flowcharts for complex reasoning and evaluation. This benchmark aims to challenge and enhance the capabilities of multimodal language models through spatial reasoning, decision-making, and logical progression tasks.
Traditional benchmarks in visual question answering do not fully test models' visual grounding and complexity, especially in spatial reasoning. FlowVQA addresses this gap by providing a comprehensive set of 2,272 human-verified flowchart images and 22,413 question-answer pairs. This new benchmark is designed for a thorough evaluation of visual and logical reasoning capabilities in AI.
Please consider citing our paper if you use FlowVQA in your research:
@article{SinghEtAlXXXX, title={Your Paper Title Here}, author={Singh, Shubhankar and others}, journal={Journal Name}, volume={XX}, number={XX}, pages={XX--XX}, year={XXXX}, publisher={Publisher} }
- Data Repository: Contains test and train JSONs, including flowchart images, mermaid scripts, tags, questions, and question types.
- Code Repository: Features code snippets, example prompts, and additional resources.
The project was developed by Shubhankar Singh, Purvi Chaurasia, Yerram Varun, Vatsal Gupta, and Pranshu Pandya, under the mentorship of Dr. Vivek Gupta and Dr. Dan Roth.
FlowVQA facilitates research in several key areas, such as:
- Flowchart Reasoning: Enhancing visual logic and reasoning capabilities of models.
- Graph-Encoder Models: Improving structural reasoning with flowchart-based models.
- Adversarial and Counterfactual Probes: Testing models with challenging questions.
- Complex Subtasks: Developing additional tasks for comprehensive training and evaluation.
- NeuroSymbolic AI: Applying neurosymbolic methods for better performance and understanding.
🚧 Under Construction. Please refrain from using the data or code until further notice.
For project updates and more details on contributing, please follow this repository.
Thank you for your interest in FlowVQA! 🎉