Here is the official Pytorch implementation of AGM proposed in "Boosting Multi-modal Model Performance with Adaptive Gradient Modulation".
Paper Title: Boosting Multi-modal Model Performance with Adaptive Gradient Modulation
Authors: Hong Li * , Xingyu Li * , Pengbo Hu, Yinuo Lei, Chunxiao Li, Yi Zhou
Accepted by: ICCV 2023
[arXiv]
This dataset can be downloaded from here.
This dataset can be downloaded from here. Data preprocessing can refer to here.
This raw dataset can be downloaded from here. Also, the processed data can be obtained from here.
This dataset can be downloaded from here.
This dataset can be downloaded from here.
- Python: 3.9.13
- CUDA Version: 11.3
- Pytorch: 1.12.1
- Torchvision: 0.13.1
To train the model using the following command:
python main.py --data_root '' --device cuda:0 --methods Normal --modality Multimodal --fusion_type late_fusion --random_seed 999 --expt_dir checkpoint --expt_name test --batch_size 64 --EPOCHS 100 --learning_rate 0.0001 --dataset AV-MNIST --alpha 2.5 --SHAPE_contribution False
@article{li2023boosting,
title={Boosting Multi-modal Model Performance with Adaptive Gradient Modulation},
author={Li, Hong and Li, Xingyu and Hu, Pengbo and Lei, Yinuo and Li, Chunxiao and Zhou, Yi},
journal={arXiv preprint arXiv:2308.07686},
year={2023}
}