jbloomAus

Pinned Repositories

alphabetical_probe
Experimental code which trains 26 linear probes to detect the presence of alphabetic letters in GPT-J token strings, given their embeddings. Exploring the resulting vector arithmetic and its impact on GPT-J spelling abilities
Language:Jupyter Notebook2 1 01
ARENA_2.0-RLHF
Preparing content for the ARENA RLHF day.
Language:Jupyter Notebook1 3 01
DecisionTransformerInterpretability
Interpreting how transformers simulate agents performing RL tasks
Language:Jupyter Notebook77 3 7518
SAEDashboard
Language:Python29 4 43
SAELens
Training Sparse Autoencoders on Language Models
Language:Jupyter Notebook616 7 126142
SparseAutoencoderSuperposition
Language:Jupyter Notebook1 2 00
SpellingSAEExperiment
Language:Python1 1 00
toy_model_interpretability
I'd like to start playing around with toy models to better understand results in recent papers.
Language:Python1 2 00
TransformerLens
Language:Python1 1 00
TransformerLens
A library for mechanistic interpretability of GPT-style language models
Language:Python1.9k 17 276334

jbloomAus's Repositories

jbloomAus/SAELens
Training Sparse Autoencoders on Language Models
Language:Jupyter Notebook616 7 126142
jbloomAus/DecisionTransformerInterpretability
Interpreting how transformers simulate agents performing RL tasks
Language:Jupyter Notebook77 3 7518
jbloomAus/SAEDashboard
Language:Python29 4 43
jbloomAus/alphabetical_probe
Experimental code which trains 26 linear probes to detect the presence of alphabetic letters in GPT-J token strings, given their embeddings. Exploring the resulting vector arithmetic and its impact on GPT-J spelling abilities
Language:Jupyter Notebook2 1 01
jbloomAus/ARENA_2.0-RLHF
Preparing content for the ARENA RLHF day.
Language:Jupyter Notebook1 3 01
jbloomAus/SparseAutoencoderSuperposition
Language:Jupyter Notebook1 2 00
jbloomAus/SpellingSAEExperiment
Language:Python1 1 00
jbloomAus/toy_model_interpretability
I'd like to start playing around with toy models to better understand results in recent papers.
Language:Python1 2 00
jbloomAus/TransformerLens
Language:Python1 1 00
jbloomAus/WindWhisper
Wrapper for whisper that lets you record and transcribe to a clipboard.
Language:Python1
jbloomAus/arena-v1
Language:Jupyter Notebook0 1 00
jbloomAus/arena-v1-ldn
Language:Jupyter Notebook0 1 00
jbloomAus/ARENA_2.0
I'm teaching ARENA 2.0 and providing students with direction on careers and personal development.
Language:Python0 1 00
jbloomAus/babyai
BabyAI platform. A testbed for training agents to understand and execute language commands.
Language:Python1 0
jbloomAus/Backwards
Language:Python1 0
jbloomAus/Exploring-2L-SAE
Language:HTML1 01
jbloomAus/geom_median
Fast and differentiable geometric median, a multivariate median analogue. Install with `pip install geom-median`
Language:Python1 0
jbloomAus/Minigrid
Simple and easily configurable grid world environments for reinforcement learning
Language:Python1 0
jbloomAus/Module-1
Module 1 - Autodifferentiation
Language:Python1 0
jbloomAus/post--memory-dt-features
Language:HTML1 01
jbloomAus/protein-inference
A python package for protein inference in Mass Spectrometric data analysis.
Language:Python1 0
jbloomAus/rust_cli_project
I'm teaching myself Rust.
Language:Rust2 0
jbloomAus/rust_text_editor
Learning by doing with Rust. Following along the Hecto tutorial https://www.philippflenker.com/hecto/
Language:Rust2 0
jbloomAus/SAE_Bench_Template
Language:Python
jbloomAus/sparse_autoencoder
Sparse Autoencoder for Mechanistic Interpretability
Language:Python1 0