This is Dataset of our paper Hashtags, Emotions, and Comments: A Large-Scale Dataset to Understand Fine-Grained Social Emotions to Online Topics [pdf]
You can download it from OneDrive
There are two files of our datasets:
- hashtag&emotion.txt It contains hashtags and their emotion votes.
- hashtag&comment.txt It contains user comments involved in the discussion initialized by a hashtag.
The data structure is described in the following.
- hashtag&emotion.txt Each line consists of six fields: , , <Total # of Voters>, <Rank 1 Emotion>, <Rank 2 Emotion>, <Rank 3 Emotion>. Fields are devided by Tab. The top three emotions shown with the emoji (in []) and # of voters, seperated with a colon.
- hashtag&comment.txt Each line consists of three files: , , . Fields are divided by Tab.
- Both hashtag&emotion.txt and hashtag&comment are in Chinese and encoded with UTF-8.
- The data of our dataset is sorted in alphabetical order.
- The dataset is released under a Creative Commons Attribution 3.0 Unported License (http://creativecommons.org/licenses/by/3.0/).