question about the input img size

Question

question about the input img size

yuzehui1996 opened this issue 6 years ago · 8 comments

yuzehui1996 commented 6 years ago

hey! I am confused with the input size of the training data. I train a dataset which input size is (3,112,112), should I have to make some changes to the model?

Answer 1 · 2018-10-22T13:50:13.000Z

This model is made for CIFAR, which is (3, 32, 32), so the last avg pooling operation will not coincide for bigger input. You can just change it for .mean(-1).mean(-1). It will do the same as average pooling but without complaining about the image size.

Answer 2 · 2018-10-22T14:09:43.000Z

I can't get the point you say. Could you please explian it?
x = self.conv_1_3x3.forward(x)
x = F.relu(self.bn_1.forward(x), inplace=True)
x = self.stage_1.forward(x)
x = self.stage_2.forward(x)
x = self.stage_3.forward(x)
x = F.avg_pool2d(x, 8, 1)
x = x.view(-1, self.stages[3])
return self.classifier(x)

Do you mean the 'F.avg_pool2d(x,8,1)' ???

Answer 3 · 2018-10-22T14:10:19.000Z

Yes! Just change it by x.mean(3).mean(2)

Answer 4 · 2018-10-22T14:17:42.000Z

I replace the avg_pool with 'x=x.mean(-1).mean(-1)', but it doesn't work. Did I misunderstand your suggestion? BTW, the error shows that "cuda out of memory", and it shows the error in the stage_1.forward.

Answer 5 · 2018-10-22T14:23:12.000Z

Well, that's another matter. Can you try with batch_size = 1?

Answer 6 · 2018-10-22T14:29:36.000Z

It sill doesn't work!
the error is :
ValueError: Expected input batch_size (441) to match target batch_size (1).

Answer 7 · 2018-10-22T14:30:20.000Z

And I resize the img to (3,32,32) and it works.

Answer 8 · 2018-10-22T14:34:26.000Z

If you do:

model(image)
where image is of size (1, 3, 112, 112), it should work.