babelcloud/LLM-RGB
LLM Reasoning and Generation Benchmark. Evaluate LLMs in complex scenarios systematically.
TypeScriptMIT
Stargazers
- 0xluckycodercolombo , Sri Lanka
- AaaaashAlibaba
- afc163Alipay
- atomd
- Bazinga-Wang
- BigBird01Microsoft
- bodynoSoftwareEngineer
- bubuhui
- creke
- fatjyc
- haxShanghai, China
- huajiankan
- hxfdarling@bytedance
- ishotoli
- jiraiyame
- leiluxChengdu, China
- momoxia
- moorejeeZJU
- Ryqsky广州
- ShenQingchuanSHEIN
- shiqiuwang
- szy0syz@UrbanCompass
- Undertone0809Mars
- vangie@aliyun @babelcloud
- war40870527951job
- WhenWenShenzhen
- wille-42
- woc2006ShenZhen, China
- xiangst0816@Bytedance
- Yahiy
- yashdevelopmentVirginia, USA
- ycjcl868BAT
- yxjoey
- zhenweiwang1990@he3-app
- zhlmmcBabel Inc.
- zthreefires