/hooked_transformer

GPT2 style transformer with hooks for caching residual stream activations

Primary LanguagePython

This repository is not active