huhusmang/llm-code-eval
LLMCodeEval: An Execution-Based Multilingual Multitask Multidimensional Benchmark for Evaluating Large Language Models on Code Understanding and Generation
Python
Watchers
No one’s watching this repository yet.
LLMCodeEval: An Execution-Based Multilingual Multitask Multidimensional Benchmark for Evaluating Large Language Models on Code Understanding and Generation
Python
No one’s watching this repository yet.