Issues
- 0
有文件缺失/损坏
#28 opened by BestAnHongjun - 0
- 4
图文数据集multi-modal下的15M和22M是什么关系
#13 opened by Legen927 - 0
数据集噪声很大
#26 opened by vanpersie32 - 2
教材数据集TextBook-cn无法下载
#23 opened by forgottenlodger - 0
目前数据集无法下载,该怎么解决一下
#25 opened by zhangfan-algo - 0
请问能提供一下ChinaNews数据集每一个id对应文章的日期吗?
#24 opened by mzh1996 - 1
请问common crawl数据的年份是?有做句子级的清洗吗?
#15 opened by guang11644331 - 0
请问有没有清洗前的数据可以提供下载
#22 opened by loredunk - 0
WanJuan1.0 data error
#21 opened by simplew2011 - 0
数据清洗
#20 opened by mynameischaos - 0
视频数据中,有台词或者其他文本信息嘛?
#19 opened by lucasjinreal - 2
- 0
请问提供了分词器tokenizer 的下载吗?
#18 opened by JerryDaHeLian - 1
怎么作为训练数据给到模型微调
#14 opened by gaoasi - 1
你好,可以问下为什么下载链接打不开吗?
#16 opened by berooo - 3
数据集下载链接404了
#2 opened by silverriver - 1
multi-modal中的图片总量有多少呢?
#9 opened by guozhiyao - 1
数据集和开源数据重复问题
#10 opened by songge25 - 1
考题数据std_ans字段含义?
#11 opened by JingyiWang3 - 1
请问law-cn中的数据均为法律文书吗
#12 opened by zhangyu68 - 2
- 2
请问文本数据集哪里可以下载呢。
#1 opened by enze5088 - 2
数据集下载地址无法下载
#3 opened by CoderJackZhu - 1
how to download?
#4 opened by hzy312 - 1
where is the dataset download link?
#6 opened by lucasjinreal - 1
- 0