Question About Spurious OOD Dataset Generation for WaterBirds

Question

Question About Spurious OOD Dataset Generation for WaterBirds

Closed this issue 3 years ago · 4 comments

Hi,

Thanks for your nice work in pointing out the impact of spurious effect on the OOD. I have a question about the generation of OOD dataset for WaterBirds.

The code seems that the OOD dataset is directly the Place365 dataset instead of the example shown in the figure 1 (Right), a boat in the lake. I think the shown example in the paper indicates that the Spurious OOD figure should contain an OOD object inside. Is my understanding correct? Feel free to correct me if there is any problem. Thanks!

Answer 1 · 2022-04-18T14:17:18.000Z

Thanks for raising this issue. Yes, as also mentioned in readme, generate_placebg.py is used to generate background images. Feel free to place different OOD objects of your interest on top of background images and see how the detection performance is influenced. In our initial experiments, we tried several objects but it turns out they do out have much impact on the performance especially when the network relies on background spurious correlation during training.

Answer 2 · 2022-04-19T02:43:47.000Z

Thanks for your reply and it's clear. One follow-up question is what kinds of the 'topping' objects/datasets you used in the trials? I would like to try with different 'toppings' as well.

Answer 3 · 2022-04-23T06:52:59.000Z

Thanks for the question! By the definition of OOD, objects with labels other than the ID labels are considered valid OOD. For Waterbirds, objects like "boat", "cat" or "dogs" are all suitable toppings.

Answer 4 · 2022-04-23T17:29:20.000Z

Clear. Thanks for your reply!