CaoYuanpu/BackdoorUnalign
Code of NAACL 2024 paper "Stealthy and Persistent Unalignment on Large Language Models via Backdoor Injections".
Python
Code of NAACL 2024 paper "Stealthy and Persistent Unalignment on Large Language Models via Backdoor Injections".
Python