[Arxiv] [Paper] [Codes] [Benchmark]
Sentiment analysis is a crucial task that aims to understand people's emotional states and predict emotional categories based on multimodal information. It consists of several subtasks, such as emotion recognition in conversation (ERC), aspect-based sentiment analysis (ABSA), and multimodal sentiment analysis (MSA). However, unifying all subtasks in sentiment analysis presents numerous challenges, including modality alignment, unified input/output forms, and dataset bias. To address these challenges, we propose a Task-Specific Prompt method to jointly model subtasks and introduce a multimodal generative framework called UniSA. Additionally, we organize the benchmark datasets of main subtasks into a new Sentiment Analysis Evaluation benchmark, SAEval. We design novel pre-training tasks and training methods to enable the model to learn generic sentiment knowledge among subtasks to improve the model's multimodal sentiment perception ability. Our experimental results show that UniSA performs comparably to the state-of-the-art on all subtasks and generalizes well to various subtasks in sentiment analysis.
Our work represents a step forward in the unified modeling of sentiment analysis subtasks, and we hope that it will inspire future research in this direction. We encourage more researchers to join the study on multi-tasks unified modeling for sentiment analysis, and contribute their wisdom to build sentiment intelligence.
-
Clone the repository recursively
git clone https://github.com/dawn0815/UniSA.git
-
Create conda environment
cd config conda env create -f environment.yaml
In this paper, we use the proposed SAEval Benchmark to train and evaluate our UniSA. The SAEval is a benchmark for sentiment analysis to evaluate the model's performance on various subtasks. All datasets were standardized to the same format and divided into training, validation and test sets. For more information about SAEval Benchmark, please refer to the link.
Here we provide download link for some of the processed datasets. Please obtain licenses for the original datasets before downloading our processed data!
For IEMOCAP, MELD, MOSI, and MOSEI, you can download textual data via the following link:
and, you can download the multimodal features according to this link.
You can download the model weights we provide for research only!
Model | Stage | File Size | Link |
---|---|---|---|
model_0 | pretrain_stage_1 | 1.7G | [Google Drive] |
model_1 | pretrain_stage_2 | 1.7G | [Google Drive] |
model_2 | finetune | 3.4G | [Google Drive] |
- Download the model weights and unzip the folder
- Create a folder named
checkpoint_saved
under this repos - Move the model weights folder into
checkpoint_saved
The entire training process is divided into two phases of pre-training and fine-tuning. You can train your own model step by step according to steps 1,2,3 or you can use the model weights we provide for fine-tuning/inference.
Before you proceed to the following steps, you need to prepare your dataset according to SAEval.
-
pretrain-stage-1
bash run_pretrain_stage1.sh
-
pretrain-stage-2
bash run_pretrain_stage2.sh
-
fine-tune
bash run_finetune.sh
-
inference
bash run_inference.sh
-
few-shot
bash run_few_shot.sh
Here are the experimental results of UniSA on the SAEval compared to SOTA models of various subtasks.
We encourage more researchers to join the study on multi-tasks unified modeling for sentiment analysis, and contribute their wisdom to build sentiment intelligence.
If you found this work useful, consider giving this repository a star and citing our paper as followed:
@inproceedings{li-unisa,
author = {Li, Zaijing and Lin, Ting-En and Wu, Yuchuan and Liu, Meng and Tang, Fengxiao and Zhao, Ming and Li, Yongbin},
title = {UniSA: Unified Generative Framework for Sentiment Analysis},
year = {2023},
publisher = {Association for Computing Machinery},
url = {https://doi.org/10.1145/3581783.3612336},
booktitle = {Proceedings of the 31st ACM International Conference on Multimedia},
pages = {6132–6142},
numpages = {11},
series = {MM '23}
}
- Thank UniMSE for their contributions to the processing of datasets (e.g., MOSI, MOSEI, IEMOCAP and MELD).
- Thank KM-BART for their contributions to the multi-modal BART framework.
- Thanks Tianshu Yu for his constructive comments.
UniSA is released without any restrictions but restrictions may apply to individual tasks (which are derived from existing datasets) or backbone (e.g., GPT2, T5, and BART). We refer users to the original licenses accompanying each dataset and backbone.