/lm-evaluation-harness

Few-shot evaluation of language models. Fork for the BabyLM competition (CoNLL '23).

Primary LanguagePythonMIT LicenseMIT

No issues in this repository yet.