Issues
- 1
Submit your results to C-Eval多久会有结果
#88 opened by zellelin - 1
C-Eval排行榜已经填写问卷,排行榜什么时候可以刷新
#87 opened by Waneila - 18
- 1
示例代码无法正常加载数据集
#86 opened by ningmenghongcha - 1
发现几个问题好像标注答案不太对
#84 opened by zky001 - 1
上传模型评测结果到网页上报错 Any subject must contain at least 50 questions to be calculated
#85 opened by mmarzl17 - 1
为什么test数据集里没有正确答案的信息呢
#83 opened by nanzhaogang - 1
您好,榜单什么时候更新,我们提交了三天了,能麻烦更新下吗
#82 opened by Zheng-Jay - 1
申请公开榜单
#81 opened by lingbaishun - 1
C-Eval排行榜提交,排行榜什么时候可以刷新
#80 opened by Waneila - 1
测试结果提交报错,点击process时,提示:Any subject must contain at least 50 questions to be calculated
#79 opened by hailing2024 - 1
Leaderboard Update
#77 opened by better-one - 3
gpt-4-1106-preview 有人测试过 test 的分数吗?
#68 opened by theblackcat102 - 1
The leaderboard in GitHub is out of sync with the latest version on the official website
#76 opened by zhimin-z - 0
Leaderboard Update
#75 opened by 13416157913 - 1
HOW TO EVALUATE STEM???
#74 opened by AkKari808 - 0
Leaderboard Update
#72 opened by lijiahuan-01 - 0
C-Eval榜单从提交评测到榜单上能看到成绩大概需要多久?
#70 opened by blueseasky - 3
- 2
- 1
什么时候更新榜单呢?
#69 opened by matrix-yang - 0
Leaderboard Update
#66 opened by chn91127 - 1
Leaderboard Update
#67 opened by chn91127 - 1
public display
#65 opened by jiahui098 - 3
模型是否真正掌握了相关知识而不是在猜答案?
#61 opened by yucc-leon - 1
请问chatglm3-6b-base发布在哪里?
#64 opened by yayaQAQ - 1
- 1
prompt行尾含有空格会发生什么?为什么不能有空格
#51 opened by cangyi071 - 1
自然语言处理的相关任务属于知识型还是推理型任务呢?
#59 opened by liumingzhu6060 - 1
llama和其他模型评测时不同点
#63 opened by Chandler-Bing - 3
关于确认CEval可以被hack之后的计划
#41 opened by yucc-leon - 0
- 8
官方示例加载数据集报错
#58 opened by JensenDong - 5
为什么我用c-eavl测试chatglm2-6B 在zero-shot 下的分数很低?
#53 opened by EdisonWujr - 2
Atom-13B不是公开访问的模型
#60 opened by TerraceCN - 3
- 1
- 4
测试集中的部分错误。
#52 opened by hanjr92 - 1
只能单选吗?可以多选吗?
#56 opened by xxm1668 - 2
public display
#54 opened by ZHangZHengEric - 4
申请公开
#55 opened by huayicong23 - 3
C-Eval 提交规则限制
#47 opened by suolyer - 1
请问模型公开结果需要做哪些动作呀?
#50 opened by xyzhou-puck - 4
chatglm2-6b在valid set上的zero-shot结果似乎有问题
#46 opened by ylwangy - 1
lm-evaluation-harness 是用test集测评的吗?
#44 opened by ChangyuanWu - 0
官网无法登录,无法提交答案
#45 opened by wuliaoren05 - 2
middle_school_history_test.csv 中有题目错误
#40 opened by AiLMe-AI - 1
Problematic question in test set
#42 opened by wgb14 - 7
提交结果问题
#43 opened by 18811449050 - 2
结果提交的疑问
#39 opened by renmengjie7