/UIT-ViSD4SA

ViSD4SA, a Vietnamese Span Detection for Aspect-based sentment analysis dataset

UIT-ViSD4SA PACLIC 35

General Introduction

This repository contains the data of the paper: Span Detection for Vietnamese Aspect-Based Sentiment Analysis.

UIT-ViSD4SA is a benchmark Vietnamese smartphone feedback dataset for ABSA and span detection. UIT-ViSD4SA consisting of 35,396 human-annotated spans on 11,122 feedback comments, and each is manually annotated according to its spans towards ten fine-grained aspect categories with sentiment polarities. We split the dataset into a training set (7,784), a development set (1,113) and a test set (2,225) randomly.

Data Example

examle

Read File

!pip install jsonlines

import jsonlines

data = []

with jsonlines.open('train.jsonl') as f:

    for line in f.iter():
       
        data.append((line['text'], {'labels': line['labels']}))

Citation

Please cite the following paper if you found it useful in your work.

Contact