/lm-evaluation-harness

A framework for few-shot evaluation of autoregressive language models.

Primary LanguagePythonMIT LicenseMIT

Watchers