Pinned Repositories
lm-evaluation-harness
A framework for few-shot evaluation of language models.
generative_data_prep
lm-evaluation-harness
A framework for few-shot evaluation of language models.
auto_search_web
SambaNova_vs_Groq_Evals
A framework for few-shot evaluation of language models.
A framework for few-shot evaluation of language models.