/TimeBench

The repository for paper "TimeBench: A Comprehensive Evaluation of Temporal Reasoning Abilities in Large Language Models"

Primary LanguagePythonMIT LicenseMIT

TimeBench: A Comprehensive Evaluation of Temporal Reasoning Abilities in Large Language Models

Paper License

This is the repository containing evaluation datas, instructions and demonstrations with paper TimeBench: A Comprehensive Evaluation of Temporal Reasoning Abilities in Large Language Models (Chu et al., 2023)

Datasets

Symbolic Temporal Reasoning

Commonsense Temporal Reasoning

Event Temporal Reasoning

Models

Performance

image

Citation

If you find our work helpful, you can cite this paper as:

@misc{chu2023timebench,
      title={TimeBench: A Comprehensive Evaluation of Temporal Reasoning Abilities in Large Language Models}, 
      author={Zheng Chu and Jingchang Chen and Qianglong Chen and Weijiang Yu and Haotian Wang and Ming Liu and Bing Qin},
      year={2023},
      eprint={2311.17667},
      archivePrefix={arXiv},
      primaryClass={cs.CL},
      url={https://arxiv.org/abs/2311.17667}
}