This repository contains tools for generating datasets and evaluating predictions for the following AI2 Leaderboards: ARC (AI2 Reasoning Challenge) OpenBook QA