A library for mechanistic interpretability of GPT-style language models
Primary LanguagePythonMIT LicenseMIT