mtbench101/mt-bench-101
[ACL 2024] MT-Bench-101: A Fine-Grained Benchmark for Evaluating Large Language Models in Multi-Turn Dialogues
Apache-2.0
Stargazers
- apache2046
- bittersweet1999
- bryanyzhuAmazon AI
- dongguantingRenmin University of China
- doublejingz
- huiwy
- liujiahengBeihang University (BUAA)
- liuyaox
- lmc8133Beijing University of Posts and Telecommunications
- mtbench101
- penglin03
- sefiraAlibaba
- shyramSamsung Research HQ
- tiezhuguangtailang
- victorjiax
- warpmatrixSun Yat-sen University
- wwn1233
- yifan123The Chinese University of Hong Kong
- zemerovMoscow, Russia
- ZhuochengZhang98University of Chinese Academy of Sciences