/lm-evaluation-harness

Few-shot evaluation of language models. Fork for the BabyLM competition (CoNLL '23).

Primary LanguagePythonMIT LicenseMIT

Stargazers

No one’s star this repository yet.