This repository is not active
xianghuisun/math-evaluation-harness
A simple toolkit for benchmarking LLMs on mathematical reasoning tasks. 🧮✨
PythonMIT
A simple toolkit for benchmarking LLMs on mathematical reasoning tasks. 🧮✨
PythonMIT
This repository is not active