NVlabs/SegFormer

How the final output is converted to full resolution mask?

working12 opened this issue · 0 comments

If the input is HxWx3 (RGB Image) the output is (H/4 x W/4 x num_classes). But the ground truth mask is (H,W,num_classes). Can you point out the code where this is being done? I am unable to locate that portion after reading the code multiple times.