FudanSELab/ClassEval

Do you plan to update the benchmark?

rodion-m opened this issue · 1 comments

This is a good benchmark, thank you for that. Do you plan to add modern models like Opus, llama-3, granite, codeqwen1.5-chat and so on to the benchmark?

Thanks for your feedback! We are currently working on this and plan to add the latest models to the benchmark soon.