allenai/satlas

why does the marine infra dataset contain extra TCI images when the config specifies 3 time steps are used?

Closed this issue · 2 comments

Thanks again for all the great work on satlas! I have a question about the marine infra dataset.

I think this indicates the tci, tci0, and tci1 files are used for training

"Channels": ["tci", "fake", "fake", "virtual:overlap_tci_0", "fake", "fake", "virtual:overlap_tci_1", "fake", "fake"],

but the v1 dataset marine infra dataset contains 5 tci files.

satlas-dataset-marine-infrastructure-v1/1302_2844_f0d695513b9e44ea873e2a1cecce6f3f
├── gt.json
└── images
    └── f0d695513b9e44ea873e2a1cecce6f3f
        ├── tci.png
        ├── virtual:overlap_tci_0.png
        ├── virtual:overlap_tci_1.png
        ├── virtual:overlap_tci_2.png
        └── virtual:overlap_tci_3.png


are the other two tci files used in training or inference?

We started with five images for annotation but then we ended up targeting the annotations towards the first three images in the time series and trained using those same images. Many later annotations only used three images.

So it is correct that the other two are not used in training. And during inference only input the latest three images at each location.

got it, thank you!