thu-coai/SafetyBench
Official github repo for SafetyBench, a comprehensive benchmark to evaluate LLMs' safety. [ACL 2024]
PythonMIT
Issues
- 4
leaderboard 似乎不能登入?
#7 opened by FarnHua - 3
zero shot结果对不齐的原因
#4 opened by Alexyuanfun - 1
测试集没有提供答案么
#6 opened by Chipmunkkk - 1
Submission
#5 opened by HuTongxin - 1
- 3
提交结果以后,能否更新?
#3 opened by web199195 - 1