/SemanticSmooth

Implementation of paper 'Defending Large Language Models against Jailbreak Attacks via Semantic Smoothing'

Primary LanguagePythonMIT LicenseMIT

Stargazers