jasongief/APL

[2024 AAAI] Object-Aware Adaptive-Positivity Learning for Audio-Visual Question Answering

Python

Object-aware Adaptive-Positivity Learning for Audio-Visual Question Answering(AAAI'2024) [arXiv]

Authors: Zhangbin Li, Dan Guo, Jinxing Zhou, Jing Zhang, and Meng Wang

Requirements

python3.7 +
pytorch1.7.1
numpy
ast

Usage

Cloning this repo

git clone https://github.com/zhangbin-ai/APL.git

Getting Started

Training

python train.py \
--batch-size 64 \
--epochs 30 \
--lr 1e-4 \
--gpu 0 \
--checkpoint APL_dir \
--checkpoint_file checkpoint01 \
--save_model_flag True \

Testing
```
python test.py
```

Citing Us

If you find this work useful, please consider citing it.

@article{li2023object,
 title={Object-aware Adaptive-Positivity Learning for Audio-Visual Question Answering},
 author={Li, Zhangbin and Guo, Dan and Zhou, Jinxing and Zhang, Jing and Wang, Meng},
 journal={arXiv preprint arXiv:2312.12816},
 year={2023}
}

Acknowledgement

The computation is supported by the HPC Platform of Hefei University of Technology.