MLGroupJLU/LLM-eval-survey
The official GitHub page for the survey paper "A Survey on Evaluation of Large Language Models".
Issues
- 0
Paper Title Change
#29 opened by nlee-208 - 0
The leaderboard website is down...
#30 opened by zhimin-z - 1
Can you add SpyGame to your survey?
#26 opened by Skytliang - 0
- 1
Can you add our recent work to your survey?
#25 opened by grayground - 4
Add CMB to your paper
#22 opened by g-h-chen - 1
咨询下,LLM的数据污染检测(判断数据集是否训练见过)技术方向靠谱吗?有推荐论文吗?
#21 opened by gongjunjin - 1
- 3
Add Llama 2 as model evaluated?
#15 opened by tiansiyuan - 2
Suggestion for adding OpenCompass to survey
#11 opened by gaotongxiao - 1
- 1
请教下,通过评测反馈LLM模型优化有哪些方向可以研究吗?即评测能反馈LLM优化建议
#20 opened by gongjunjin - 1
- 1
Add paper, ALIGNING AI WITH SHARED HUMAN VALUES
#18 opened by tiansiyuan - 2
ARB Benchmark
#10 opened by kennethleungty - 2
- 1
Add a new paper.
#3 opened by Wangpeiyi9979