roose12/CipherChat
A framework to evaluate the generalization capability of safety alignment for LLMs
PythonMIT
Watchers
No one’s watching this repository yet.
A framework to evaluate the generalization capability of safety alignment for LLMs
PythonMIT
No one’s watching this repository yet.