amariucaitheodor/lm-evaluation-harness
Few-shot evaluation of language models. Fork for the BabyLM competition (CoNLL '23).
PythonMIT
Stargazers
No one’s star this repository yet.
Few-shot evaluation of language models. Fork for the BabyLM competition (CoNLL '23).
PythonMIT
No one’s star this repository yet.