agential-ai/agential

[Feature Request]: MATH Benchmark

Opened this issue · 0 comments

Feature Description

MATH benchmark is harder than GSM8K. May be worth including down the line.

Reason

No response