nadyka/lm-evaluation-harness-bg
A framework for few-shot evaluation of language models, extended with benchmarks by https://insait.ai/
PythonMIT
A framework for few-shot evaluation of language models, extended with benchmarks by https://insait.ai/
PythonMIT