/Learning_Similarity_between_Graphs_Images

Pytorch Implementation of Learning Similarity between Scene Graphs and Images with Transformers (GICON))

Primary LanguagePythonMIT LicenseMIT

Learning Similarity between Scene Graphs and Images

PyTorch Implementation of the Paper Learning Similarity between Scene Graphs and Images with Transformers.

The project page is AVAILABLE!

Now it is just the draft version. You can use the code to evaluate your SGG model:D

1. Installation

Download GICON Repo with:

git clone https://github.com/yrcong/Learning_Similarity_between_Graphs_Images.git

To run the code, we use

python==3.7
pytorch==1.11.0
torchvision==0.12.0 

2. Data Preparation for Visual Genome:

a) Download the the images of Visual Genome Part1 and Part2. Unzip and place all images in a folder data/vg/images/

b) Download the annotations of VG (in COCO-format) and unzip it in the data/ forder.

c) Download the pretrained model location_bound and location_free and put the checkpoints under ckpt/.

3. TODO: OpenImages

4. Benchmarking

a) Create a pickle file containing the predictions of the scene graph generation model. We provide a template (BGNN prediction PKL file), You can download it and save it under the working path.

please check the data format and ensure your prediction file is consistent with this template!

To compute R-Precision for Locaiton-Free Graphs (K=100):

python benchmark.py --batch_size 100 --image_layer_num 6 --graph_layer_num 6 --resume ckpt/location_free.pth --eval --prediction bgnn_prediction.pkl

To compute R-Precision for Locaiton-Bound Graphs (K=100):

python benchmark.py --batch_size 100 --image_layer_num 6 --graph_layer_num 6 --resume ckpt/location_bound.pth --eval --prediction bgnn_prediction.pkl --node_bbox

5. TODO: Training

6. TODO: Evaluation