lucazanella/lavad

A problem about step2:creat_index for XD.

Closed this issue · 2 comments

Hello, thank you for your work. I encountered a problem while reproducing the experiment:
As mentioned in your supplymentary, for XD datasets, using blip2-flan-t5-xxl directly will get the best results, but in slurm/xd_violence/02_create_index.sh, there are still five model generated captions loaded at the same time, how can I modify the code?
Looking forward to your reply, thank you!

# in slurm/xd_violence/02_create_index.sh
cap_model_names=(
    "$xd_violence_dir/captions/raw/Salesforce/blip2-opt-6.7b-coco/"
    "$xd_violence_dir/captions/raw/Salesforce/blip2-opt-6.7b/"
    "$xd_violence_dir/captions/raw/Salesforce/blip2-flan-t5-xxl/"
    "$xd_violence_dir/captions/raw/Salesforce/blip2-flan-t5-xl/"
    "$xd_violence_dir/captions/raw/Salesforce/blip2-flan-t5-xl-coco/"
)

# in src/models/create_index.py
for captions_dir in captions_dirs:
        captions_dir = Path(captions_dir)
        cap_model_name = captions_dir.name
        assert cap_model_name in CAP_MODEL_NAMES  # five models

Can I temporarily delete the other models directly in the code?
I am a student, and the graphics card resources are very tight, so I dare not try too much. Looking forward to your reply, thank you very much!

# in slurm/xd_violence/02_create_index.sh
cap_model_names=(
    "$xd_violence_dir/captions/raw/Salesforce/blip2-flan-t5-xxl/"
)

Hi, and thank you for your interest! As you correctly pointed out, the current script creates an index containing captions from all blip2 models, but for XD-Violence, you should only use captions from blip2-flan-t5-xxl. Indeed, index_name is set to flan-t5-xxl in slurm/xd_violence/03_clean_captions.sh, which ensure it uses the index created using blip2-flan-t5-xxl captions only.

The snippet of code you suggest:

cap_model_names=(
    "$xd_violence_dir/captions/raw/Salesforce/blip2-flan-t5-xxl/"
)

is correct.

Thank you for pointing that out!

Thank you very much!