This repository contains the data of the paper: Span Detection for Vietnamese Aspect-Based Sentiment Analysis.
UIT-ViSD4SA is a benchmark Vietnamese smartphone feedback dataset for ABSA and span detection. UIT-ViSD4SA consisting of 35,396 human-annotated spans on 11,122 feedback comments, and each is manually annotated according to its spans towards ten fine-grained aspect categories with sentiment polarities. We split the dataset into a training set (7,784), a development set (1,113) and a test set (2,225) randomly.
!pip install jsonlines
import jsonlines
data = []
with jsonlines.open('train.jsonl') as f:
for line in f.iter():
data.append((line['text'], {'labels': line['labels']}))
Please cite the following paper if you found it useful in your work.