Benchmark for evaluating the reasoning abilities of deep learning models.
Primary LanguageJupyter NotebookMIT LicenseMIT