CLUE benchmark
Organization of Language Understanding Evaluation benchmark for Chinese: tasks & datasets, baselines, pre-trained Chinese models, corpus and leaderboard
Pinned Repositories
CLUE
中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard
CLUECorpus2020
Large-scale Pre-training Corpus for Chinese 100G 中文预训练语料
CLUEDatasetSearch
搜索所有中文NLP数据集,附常用英文NLP数据集
CLUENER2020
CLUENER2020 中文细粒度命名实体识别 Fine Grained Named Entity Recognition
CLUEPretrainedModels
高质量中文预训练模型集合:最先进大模型、最快小模型、相似度专门模型
SuperCLUE
SuperCLUE: 中文通用大模型综合性基准 | A Benchmark for Foundation Models in Chinese
SuperCLUE-Agent
SuperCLUE-Agent: 基于中文原生任务的Agent智能体核心能力测评基准
SuperCLUE-Auto
汽车行业中文大模型测评基准,基于多轮开放式问题的细粒度评测
SuperCLUE-RAG
中文原生检索增强生成测评基准
SuperCLUE-Safety
SC-Safety: 中文大模型多轮对抗安全基准
CLUE benchmark's Repositories
CLUEbenchmark/CLUEDatasetSearch
搜索所有中文NLP数据集,附常用英文NLP数据集
CLUEbenchmark/CLUE
中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard
CLUEbenchmark/SuperCLUE
SuperCLUE: 中文通用大模型综合性基准 | A Benchmark for Foundation Models in Chinese
CLUEbenchmark/CLUENER2020
CLUENER2020 中文细粒度命名实体识别 Fine Grained Named Entity Recognition
CLUEbenchmark/CLUECorpus2020
Large-scale Pre-training Corpus for Chinese 100G 中文预训练语料
CLUEbenchmark/FewCLUE
FewCLUE 小样本学习测评基准,中文版
CLUEbenchmark/pCLUE
pCLUE: 1000000+多任务提示学习数据集
CLUEbenchmark/SimCLUE
3000000+语义理解与匹配数据集。可用于无监督对比学习、半监督学习等构建中文领域效果最好的预训练模型
CLUEbenchmark/SuperCLUElyb
SuperCLUE琅琊榜:中文通用大模型匿名对战评价基准
CLUEbenchmark/PyCLUE
Python toolkit for Chinese Language Understanding(CLUE) Evaluation benchmark
CLUEbenchmark/SuperCLUE-Llama2-Chinese
Llama2开源模型中文版-全方位测评,基于SuperCLUE的OPEN基准 | Llama2 Chinese evaluation with SuperCLUE
CLUEbenchmark/SuperCLUE-Safety
SC-Safety: 中文大模型多轮对抗安全基准
CLUEbenchmark/SuperCLUE-Agent
SuperCLUE-Agent: 基于中文原生任务的Agent智能体核心能力测评基准
CLUEbenchmark/SuperCLUE-Open
中文通用大模型开放域多轮测评基准 | An Open Domain Benchmark for Foundation Models in Chinese
CLUEbenchmark/SuperCLUE-RAG
中文原生检索增强生成测评基准
CLUEbenchmark/MobileQA
离线端阅读理解应用 QA for mobile, Android & iPhone
CLUEbenchmark/modelfun
一站式自动化开源标注平台
CLUEbenchmark/SuperCLUE-Math6
SuperCLUE-Math6:新一代中文原生多轮多步数学推理数据集的探索之旅
CLUEbenchmark/SuperCLUE-Auto
汽车行业中文大模型测评基准,基于多轮开放式问题的细粒度评测
CLUEbenchmark/LGEB
LGEB: Benchmark of Language Generation Evaluation
CLUEbenchmark/SuperCLUE-Llama3-Chinese
Llama3开源模型中文版-全方位测评,基于SuperCLUE基准 | Llama3 Chinese Evaluation with SuperCLUE
CLUEbenchmark/SuperCLUE-Video
中文原生多层次文生视频测评基准
CLUEbenchmark/SuperCLUEgkzw
SuperCLUE高考作文机器自动阅卷系统
CLUEbenchmark/SuperCLUE-Role
SuperCLUE-Role中文原生角色扮演测评基准
CLUEbenchmark/SuperCLUE-Industry
中文原生工业测评基准
CLUEbenchmark/SuperCLUE-Code3
中文原生等级化代码能力测试基准
CLUEbenchmark/SuperCLUE-Fin
中文金融大模型测评基准,六大类二十五任务、等级化评价,国内模型获得A级
CLUEbenchmark/SuperCLUE-Image
中文原生文生图测评基准
CLUEbenchmark/SuperCLUE-ICabin
汽车智能座舱大模型测评基准
CLUEbenchmark/SuperCLUE-Long
中文原生长文本测评基准