xianghuisun/math-evaluation-harness

A simple toolkit for benchmarking LLMs on mathematical reasoning tasks. 🧮✨

PythonMIT

Readme
0Issues
0Stargazers
0Watchers

This repository is not active

Share to

Contact site admin: Geeks.