Pinned Repositories
BIG-bench
Beyond the Imitation Game collaborative benchmark for measuring and extrapolating the capabilities of language models
caliban
Research workflows made easy, locally and in the Cloud.
uv-metrics
Composable metric reporters in Python.
ajslone's Repositories
ajslone doesn’t have any repository yet.