princeton-nlp/LLMBar
[ICLR 2024] Evaluating Large Language Models at Evaluating Instruction Following
PythonMIT
Issues
- 1
复现对不齐问题
#2 opened by Joe-Hall-Lee - 5
Scripts to generate adversarial data
#1 opened by HuihuiChyan
[ICLR 2024] Evaluating Large Language Models at Evaluating Instruction Following
PythonMIT