/CipherChat

A framework to evaluate the generalization capability of safety alignment for LLMs

Primary LanguagePythonMIT LicenseMIT

No issues in this repository yet.