open-compass/CriticBench
A comprehensive benchmark for evaluating critique ability of LLMs
PythonApache-2.0
Stargazers
- chenxu05037
- Echo-minnOpenMMLab
- Erwin-X
- evdcush
- Ezra-Yushanghai
- gmftbyGMFTBYBeijing Institute of Technology
- jbwang1997NWPU -> NKU
- JeffCarpenterCanada
- Kunlun-ZhuMila-Quebec AI Institute; UdeM
- MatCaviar
- Meteor-xxBeijing,China
- monmonliUniversity of Michigan, Ann Arbor
- SeungoneKimKAIST AI
- shyramSamsung Research HQ
- vivian1928
- xnzacUK
- zehuichen123USTC & SenseTime
- ZhaoQiiiiShanghai AI Lab
- zhimin-zQueen's University
- zwvc
- ZwwWayneMMLab, NTU