codefuse-ai/codefuse-devops-eval
Industrial-first evaluation benchmark for LLMs in the DevOps/AIOps domain.
PythonNOASSERTION
Issues
- 0
tool call模型
#21 opened by zcgit001 - 1
function call dataset
#20 opened by paleblackless - 1
有增加其他模型比较的计划吗?
#7 opened by leiwen83 - 1
您好,请问fcdata-zh-luban和fcdata-zh-codefuse的区别是啥?
#16 opened by GeniusYx - 0
- 0
Any related arxiv?
#17 opened by zhimin-z - 0
DevOps Summary Benchmark
#15 opened by lightislost - 1
toollearning数据集是否可以提供?
#13 opened by yangyuxiang1996 - 1
hf 地址失效了 求更新
#8 opened by Mr1994 - 1
hello~hf链接失效了,页面打开都是404哦~
#6 opened by yangbiaoqiange - 1
支持多种开源模型prompt格式
#3 opened by donttal - 1
数据集需要进一步清洗
#2 opened by hhk123 - 0