Codebase for the NeurIPS 2024 paper: "Analyzing the Generalization and Reliability of Steering Vectors"
Primary LanguageJupyter Notebook