declare-lab/red-instruct

Codes and datasets of the paper Red-Teaming Large Language Models using Chain of Utterances for Safety-Alignment

PythonApache-2.0

Issues

Internal thought in CoU, but not in training data
#10 opened 19 days ago by Rogerspy
1
api_key
#8 opened 5 months ago by HITLittleZheng
0
RuntimeError: No GPU found.
#9 opened 5 months ago by HITLittleZheng
0
api_key
#6 opened 6 months ago by richhh520
3
Is it possible to use chatgpt's API instead of gpt4's?
#7 opened 6 months ago by HITLittleZheng
1
missing chat template when querying open-sourced models
#5 opened 6 months ago by wangruohui
1
Loss formulation of Strategy-B: Alignment using red data
#4 opened 6 months ago by YJWon99
1
The Red-Eval Method seems to not work in ChatGPT
#2 opened a year ago by ZiyueWang25
2
Query about the Dataset Structure
#1 opened a year ago by fabrahman
3