/MultiModal-SentimentAnalysis

This GitHub repository hosts the code and resources for a Multi-Modal Sentiment Analysis project. Sentiment analysis is a crucial task in natural language processing, and this project takes a multi-modal approach by combining both text and image data to better understand and classify sentiment.

Primary LanguageJupyter NotebookMIT LicenseMIT

MultiModal-SentimentAnalysis

Project Structure

├─ data        # Data directory containing text and image data
├─ dataset     # Module constructing Dataset class for iteration
├─ models      # Main models directory
│  ├─ attentionModel
│  ├─ baseline
│  ├─ flava
├─ output      # Output directory containing predicted files and relevant figures
├─ papers      # Directory for storing relevant papers
├─ preprocess  # Data preprocessing module
├─ test        # Directory for testing
├─ .gitignore
├─ constant.py 
├─ flava.ipynb        # FLAVA model related file
├─ LICENSE
├─ main.py            # Core file
├─ predict.sh         # Prediction script
├─ README.md 
├─ requirements.txt 
├─ test_without_label.txt
├─ train.sh           # Training script
├─ train.txt
├─ utils.py           # Store Trainer and other utility functions
└─ __init__.py

Models

Baseline

Attention Model

FLAVA

Results

  • Baseline

    • Cat: 65.00%

      • only_img: 65.75%
      • only_txt: 58.00%
    • Add: 65.00%

  • EncoderLayer: 63.38%

How to Run

  1. For training, simply run:
bash train.sh

You can choose the model type and adjust other hyperparameters as needed.

  1. For prediction, run:
bash predict.sh

This script will generate the predictions and output files to the output/test.txt