/-llm--I-FSJ

Improved Few-Shot Jailbreaking Can Circumvent Aligned Language Models and Their Defenses (NextGenAISafety @ ICML 2024)

Primary LanguagePythonMIT LicenseMIT

Watchers

No one’s watching this repository yet.