Pinned Repositories
aaa_23_submission
arc-oocr
arena
ARENA_2.0
astra-owain
blargh
bridgeboards
BridgeHand2Vec
evals
Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.
reversal-curse-toy-model
Toy models for Reversal Curse (https://owainevans.github.io/reversal_curse.pdf)
johny-b's Repositories
johny-b/astra-owain
johny-b/BridgeHand2Vec
johny-b/evals
Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.
johny-b/reversal-curse-toy-model
Toy models for Reversal Curse (https://owainevans.github.io/reversal_curse.pdf)
johny-b/sa-oocr
johny-b/aaa_23_submission
johny-b/arc-oocr
johny-b/arena
johny-b/ARENA_2.0
johny-b/blargh
johny-b/bridgeboards
johny-b/dds-transformer
johny-b/yapapi
Python high-level API for Golem.
johny-b/inductive-oocr
johny-b/modiff
johny-b/mouse-goal-misgeneralization
johny-b/multi-model-api
johny-b/procgen-tools
Tools for running experiments on RL agents in procgen environments