/CipherChat

A framework to evaluate the generalization capability of safety alignment for LLMs

Primary LanguagePythonMIT LicenseMIT

Watchers

No one’s watching this repository yet.