danbraunai-apollo

danbraunai-apollo's Stars

noanabeshima/tinymodel
A TinyStories LM with SAEs and transcoders
Language:Python7
timothee-chauvin/eyeballvul
future-proof vulnerability detection benchmark, based on CVEs in open-source repos
Language:Python446
hijohnnylin/automated-interpretability
Language:Python64
jbloomAus/SAELens
Training Sparse Autoencoders on Language Models
Language:Jupyter Notebook455121
ai-safety-foundation/sparse_autoencoder
Sparse Autoencoder for Mechanistic Interpretability
Language:Python18839