/SafeDecoding

Official Repository for ACL 2024 Paper SafeDecoding: Defending against Jailbreak Attacks via Safety-Aware Decoding

Primary LanguageJupyter NotebookMIT LicenseMIT

Stargazers