Pinned Repositories
t5x
BIG-bench
Beyond the Imitation Game collaborative benchmark for measuring and extrapolating the capabilities of language models
BIG-bench
Beyond the Imitation Game collaborative benchmark for measuring and extrapolating the capabilities of language models
hwchung27.github.io
evals
Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.
hwchung27's Repositories
hwchung27/BIG-bench
Beyond the Imitation Game collaborative benchmark for measuring and extrapolating the capabilities of language models
hwchung27/hwchung27.github.io