/Transformer-Interpretability

Transformer Interpretability using transformer lens. Plotting all the heads of a single layer to see different attentions and ablating different heads to see how much they affect output

Primary LanguageJupyter Notebook

Stargazers

No one’s star this repository yet.