Question about 13th band

Question

Question about 13th band

PlekhanovaElena opened this issue 7 months ago · 2 comments

Hi there,

I've noticed that there is in the code of transforms.py in the get_pretrained_s2_train_transform function there is an imput of 0s-filled B10 band:

B10 = np.zeros((1, *image.shape[1:]), dtype=image.dtype)
image = np.concatenate([image[:10], B10, image[10:]], axis=0)

I'm just curious - why do you do this?

Kind regards,
Elena

Answer 1 · 2024-05-30T18:37:00.000Z

The SSL4EO vision encoders we use are pretrained on 13 channels (https://torchgeo.readthedocs.io/en/stable/api/models.html#sentinel-2), but our S2-100K inputs are just 12 channels so we zero-pad one channel.

Answer 2 · 2024-06-03T06:24:26.000Z

Aha, got it, thank you for the explanation!