about dataset
sjm1992st opened this issue · 1 comments
There are four files in the folder data_"preprocessed", and i think the 0, 1, 2 in the opinion.txt should be the pos,neg,neu, said in the paper. But i don't understand what's the meanful about 0, 1, 2 in the target.txt and the 0, 1, 2, 3, 4 in the target_polarity.txt?
The 0, 1, 2 in target.txt and opinion.txt give the annotations of aspect terms and opinion terms respectively. A word labeled with 0 means that it is outside the span of an aspect (opinion) term. A word labeled with 1 indicates that it is the beginning word of an aspect (opinion) term, and a word labeled with 2 indicates that it is inside of an aspect (opinion) term.
In target_polarity.txt, 0 means that the word is outside the span of any aspect term as only aspect terms have sentiment labels. 1, 2, 3, 4 refer to pos, neg, neu, and conflict sentiment polarity respectively.
You can refer to section 3.1 and 4.1 in the paper for more details on the data preparation and experimental details.