/red-instruct

Codes and datasets of the paper Red-Teaming Large Language Models using Chain of Utterances for Safety-Alignment

Primary LanguagePythonApache License 2.0Apache-2.0

Issues