Pinned Repositories
Flames
Flames is a highly adversarial benchmark in Chinese for LLM's harmlessness evaluation developed by Shanghai AI Lab and Fudan NLP Group.
MLLMGuard
Fake-Alignment
ESC-Eval
[EMNLP 2024] ”ESC-Eval: Evaluating Emotion Support Conversations in Large Language Models“
Reflection-Bench
probing AI intelligence with reflection
AIFlames's Repositories
AIFlames doesn’t have any repository yet.