Northind/CipherChat
A framework to evaluate the generalization capability of safety alignment for LLMs
PythonMIT
No issues in this repository yet.
A framework to evaluate the generalization capability of safety alignment for LLMs
PythonMIT
No issues in this repository yet.