Pinned Repositories
tinyBenchmarks
Evaluating LLMs with fewer examples
lm-evaluation-harness
A framework for few-shot evaluation of language models.
tinyBenchmarks
Evaluating LLMs with fewer examples
tinyBenchmarks
Evaluating LLMs with fewer examples
seiljus's Repositories
seiljus/tinyBenchmarks
Evaluating LLMs with fewer examples