/sae-probe

Investigating the feasibility of using SAE features as a basis for sparse reconstructions of linear probes

Primary LanguagePython

Stargazers