chinese-dataset
There are 12 repositories under chinese-dataset topic.
brightmart/nlp_chinese_corpus
大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP
chaoswork/sft_datasets
开源SFT数据集整理,随时补充
zake7749/Gossiping-Chinese-Corpus
PTT 八卦版問答中文語料
secsilm/zi-dataset
汉字数据集,包括汉字的相关信息,例如笔画数、部首、拼音、英文释义/同义词等。
CLUEbenchmark/QBQTC
QBQTC: 大规模搜索匹配数据集
lvyufeng/SciBERT_CN
Pretrained model for Chinese Scientific Text
Eurus-Holmes/CHABCNet
[CHABCNet] ABCNet on the Chinese dataset, building on Detectron2 (Facebook AI Research)
hsinmin/HanSig
A large-scale offline Chinese handwritten signature dataset
seanpm2001/AI2001_Category-Linguistics-SC-Chinese-Traditional
🧠️🖥️2️⃣️0️⃣️0️⃣️1️⃣️🔠️🔢️ The linguistic:Chinese-Traditional category for AI2001, containing Chinese (Traditional) language linguistic datasets
seanpm2001/AI2001_Category-Linguistics-SC-Chinese-Simplified
🧠️🖥️2️⃣️0️⃣️0️⃣️1️⃣️🔠️🔢️ The linguistic:Chinese-Simplified category for AI2001, containing Chinese (Simplified) language linguistic datasets
DwendwenHappy/Chumor
Chumor 1.0: A Truly Funny and Challenging Chinese Humor Understanding Dataset from Ruo Zhi Ba