This repository contains a pytorch implementation of the paper "Multiple Images Joint Learning for Image Aesthetic Assessment"(Subject to ToMM)
We proposed MILNet employ multiple images for aesthetic assessment. First, we adopt semantic information retrieval reference; Then, we use Graph Convolution Network(GCN) to reason the relation of different nodes; Finally, we utilize the improved loss function AdaEMD to stabilize training process.
- System: Linux(e.g. Ubuntu/CentOS/Arch), macOS, or Windows Subsystem of Linux (WSL)
- Python version >=3.6
- Pytorch == 1.10.2 Cuda == 10.2
- TensorboardX
- Opencv == 4.5.5
- Clone repo
git clone https://github.com/dafeigediaozhatian/MILNet
cd MILNet
- Install dependencies(pytorch, scikit-learn, opencv-python, pandas. Recommend to use Anaconda.)
# Create a new conda environment
conda create -n MILNet python==3.8
conda activate MILNet
# Install other packages
pip install -r requirements.txt
-
AVA dataset(19998)
- Download the original AVA dataset and image featre split into data/. The directory structure should be like:
data feature_16928_dataloader.py gcn_dataloader_6144.py ILG_train.txt ILG_test.txt
- Download the original AVA dataset and image featre split into data/. The directory structure should be like:
-
AVA dataset(19928)
- the same as AVA dataset(19998)
-
Extract feature We extract the AVA dataset to MLSP, and feature reduce to 6144 for training. all the files can be download on here,code is #ddkd#.
Traning scripts for two datasets can be found in #MILNet.py#. The dataroot argument should be modified to path_to_<dataset_name>. Run the follwing command for training:
# Training and test on AVA_19998
python MILNet.py
@inproceedings{shi2022multiple,
title={Multiple image joint learning for image aesthetic assessment},
author={Tengfei Shi, Zhenyu Wu, Chenglizhao Chen, Aiming Hao, Yuming Fang},
booktitle={ToMM},
year={2023}
}