UCSB-NLP-Chang/SemanticSmooth

Implementation of paper 'Defending Large Language Models against Jailbreak Attacks via Semantic Smoothing'

PythonMIT

Issues

emsemble_policy
#3 opened 5 months ago by huifeng3
0
Issues with running code
#1 opened 9 months ago by yewang
2
Attack String Length for GCG
#2 opened 8 months ago by SinHanYang
2