Pinned Repositories
Advent_of_code_2023
automated_interpretability
CAA
Steering Llama 2 with Contrastive Activation Addition
estagio
estagio2
geometry-of-truth
GpiT
llm-latent-language
Repo accompanying our paper "Do Llamas Work in English? On the Latent Language of Multilingual Transformers".
mambaChess
SrGonao's Repositories
SrGonao/llm-latent-language
Repo accompanying our paper "Do Llamas Work in English? On the Latent Language of Multilingual Transformers".
SrGonao/Advent_of_code_2023
SrGonao/automated_interpretability
SrGonao/CAA
Steering Llama 2 with Contrastive Activation Addition
SrGonao/estagio
SrGonao/estagio2
SrGonao/geometry-of-truth
SrGonao/GpiT
SrGonao/mambaChess
SrGonao/nnsight
The nnsight package enables interpreting and manipulating the internals of deep learned models.
SrGonao/paper-learning-to-pivot
Repository for the paper "Learning to Pivot with Adversarial Networks"
SrGonao/srgonao.github.io
SrGonao/StochasticIntruderExtruder
SrGonao/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
SrGonao/tuned-lens
Tools for understanding how transformer predictions are built layer-by-layer