/ICLR_VTNet

Primary LanguagePython

VTNet: Visual Transformer Network for Object Goal Navigation

Installation

The code is tested with Ubuntu18.04 and CUDA10.2.

pip install -r requirements.txt

Training

Before pre-training the VT, you could download the dataset here.

Pre-training

python main_pretraining.py --gpu-ids 0 --workers 4 --model PreTrainedVisualTransformer --detr --title a3c --work-dir ./work_dirs/

The training dataset could be downloaded here and the link of DETR features is here.

A3C training

python main.py --gpu-ids 0 --workers 4 --model VTNetModel --detr --title a3c_vtnet --work-dir ./work_dirs/

Testing

python full_eval.py --gpu-ids 0 --detr --save-model-dir {SAVE_MODEL_DIR} --results-json ./result.json --model VTNetModel --title a3c_previstrans_base

Citation

If you find our work useful in your research, please consider citing:

@inproceedings{
    du2021vtnet,
    title={{\{}VTN{\}}et: Visual Transformer Network for Object Goal Navigation},
    author={Heming Du and Xin Yu and Liang Zheng},
    booktitle={International Conference on Learning Representations},
    year={2021},
    url={https://openreview.net/forum?id=DILxQP08O3B}
}