The MAGMA Benchmark is designed to evaluate the performance of large language models (LLMs) on classical graph algorithms using intermediate steps.
Primary LanguagePython
No issues in this repository yet.