Pinned Repositories
lm-evaluation-harness
A framework for few-shot evaluation of language models.
s1
s1: Simple test-time scaling
simplescaling.github.io
simplescaling's Repositories
simplescaling/s1
s1: Simple test-time scaling
simplescaling/simplescaling.github.io
simplescaling/lm-evaluation-harness
A framework for few-shot evaluation of language models.