Does pretrained ResNet-50 actually use BatchNorm?

Question

Does pretrained ResNet-50 actually use BatchNorm?

Solomon1588 opened this issue 6 years ago · 1 comments

@zhaoweicai You said "For ResNet, the BatchNorm layers are merged into Scale layers". But I can't find any Batch Norm operation in the Scale Layer? Just in terms of the train.prototxt, it looks like that the training of Resnet-50 doesn't use BN?

Answer 1 · 2018-12-28T04:10:44.000Z

"BN merged to Scale layer" means there is no BN layer anymore. Those two layers become a single Scale layer, because at inference BN is just a Scale layer. And that Scale layer is frozen during training. This is a typical strategy for object detection