amariucaitheodor/lm-evaluation-harness
Few-shot evaluation of language models. Fork for the BabyLM competition (CoNLL '23).
PythonMIT
No issues in this repository yet.
Few-shot evaluation of language models. Fork for the BabyLM competition (CoNLL '23).
PythonMIT
No issues in this repository yet.