/CubeMLP

The implementation of CubeMLP

Primary LanguagePython

CubeMLP

The implementation of CubeMLP: A MLP-based Model for Multimodal Sentiment Analysis and Depression Estimation (Accepted as full paper in ACM Multimedia 2022)

The CubeMLP is a pure MLP structure for multimodal processing, which is simple yet effective. Specifically, CubeMLP consists of three independent MLP units, each of which has two affine transformations. CubeMLP accepts all relevant modality features as input and mixes them across three axes (sequential axis, channel axis, and modality axis). After extracting the characteristics using CubeMLP, the mixed multimodal features are flattened for task predictions. The overview of CubeMLP is shown as: CubeMLP Overview And each MLP unit is composed of two separate fully-connected layer with activations: CubeMLP Detail

The codes require the PyTorch and numpy installation.

To run the codes:

python Train.py --dataset mosi_SDK --batch_size 128 --features_compose_t mean --features_compose_k cat --d_hiddens 50-3-128=10-3-32  --d_outs 50-3-128=10-3-32 --res_project 1-1 --bias --ln_first  --dropout_mlp 0.1-0.1-0.1 --dropout 0.1-0.1-0.1-0.1 --bert_freeze part --bert_lr_rate 0.01  --learning_rate 4e-3

The final results may be slightly fluctuating, but the overall results should correspond to the metrics in the paper.(Sometimes, the results are even better than the paper's.)

The processed MOSI and MOSEI dataset has been uploaded to Google Drive: https://drive.google.com/drive/folders/1MNp1qycJLfY87xUDouNU9gm2O5BsYRFE and Baidu Disk: https://pan.baidu.com/s/1CRbE4rPUhEfmCysuY4_q3A (Code: rif6).

If you have any questions, just contact me: sunhaoxx@zju.edu.cn or create an issue in this Github repository.