/implementing_activation_steering

A collection of different ways to implement accessing and modifying internal model activations for LLMs

Primary LanguageJupyter NotebookApache License 2.0Apache-2.0

Stargazers