/Adversarial-Representation-Engineering

Official implementation repository for the paper Towards General Conceptual Model Editing via Adversarial Representation Engineering.

Primary LanguagePythonMIT LicenseMIT

Watchers