GanjinZero/math401-llm
Source codes and datasets for How well do Large Language Models perform in Arithmetic tasks?
Issues
- 1
Duplicate cases in the test set
#4 opened by yanyc428 - 2
Definition of `big` number?
#3 opened by sieu-n - 1
- 1
评测代码
#1 opened by cobraheleah