/Self-Reminder

Code for our paper "Defending ChatGPT against Jailbreak Attack via Self-Reminder" in NMI.

Primary LanguagePythonGNU General Public License v3.0GPL-3.0

Stargazers