/abliterator

Simple Python library/structure to ablate features in LLMs which are supported by TransformerLens

Primary LanguagePythonMIT LicenseMIT

Stargazers

No one’s star this repository yet.