/lm-evaluation-harness

few-shot类型生成式大语言模型评估 A framework for few-shot evaluation of autoregressive language models.

Primary LanguagePythonMIT LicenseMIT

No issues in this repository yet.