/abliterator

Simple Python library/structure to ablate features in LLMs which are supported by TransformerLens

Primary LanguagePythonMIT LicenseMIT

This repository is not active