open-compass/CriticBench
[NeurIPS 2024] A comprehensive benchmark for evaluating critique ability of LLMs
PythonApache-2.0
Stargazers
- chenxu05037
- Echo-minnOpenMMLab
- Erwin-X
- evdcush
- Ezra-Yushanghai
- gmftbyGMFTBYBeijing Institute of Technology
- jbwang1997NWPU -> NKU
- JeffCarpenterCanada
- Kunlun-ZhuMila-Quebec AI Institute; UdeM
- MatCaviarTONGJI UNIVERSITY
- Meteor-xxBeijing,China
- monmonliUniversity of Michigan, Ann Arbor
- SeungoneKimCarnegie Mellon University
- shyramSamsung Research HQ
- vivian1928
- xnzacUK
- zehuichen123USTC
- ZhaoQiiiiShanghai AI Lab
- zhimin-zSoftware Analysis and Intelligence Lab
- zwvc
- ZwwWayneMMLab, NTU