/cnSoftBei

Primary LanguagePythonMIT LicenseMIT

cnSoftBei 2021 News Classification

Dataset Description

toutiao.txt

Notice that this file is encoded in UTF-8, so make sure that you have set the correct encoding.

Data Format:

Label Content Keyword 1 ... Keyword n
其他 京城最值得你来场文化之旅的博物馆 保利集团 ... 新**

Labels:
财经、房产、教育、科技、军事、汽车、体育、游戏、娱乐、其他

These labels are totally same as labels in the official dataset.