When VRP encoder is DINOv2, how to define mid-level and high-level features?

Question

When VRP encoder is DINOv2, how to define mid-level and high-level features?

suyan451 opened this issue 6 months ago · 1 comments

Answer 1 · 2024-07-15T08:09:28.000Z

self.layer1, self.layer2, self.layer3, self.layer4 = nn.Sequential(*self.dinov2.blocks[:8]), nn.Sequential(*self.dinov2.blocks[8:12]), nn.Sequential(*self.dinov2.blocks[12:21]), nn.Sequential(*self.dinov2.blocks[21:24])