Does pretrained ResNet-50 actually use BatchNorm?
Solomon1588 opened this issue · 1 comments
Solomon1588 commented
@zhaoweicai You said "For ResNet, the BatchNorm layers are merged into Scale layers". But I can't find any Batch Norm operation in the Scale Layer? Just in terms of the train.prototxt, it looks like that the training of Resnet-50 doesn't use BN?
zhaoweicai commented
"BN merged to Scale layer" means there is no BN layer anymore. Those two layers become a single Scale layer, because at inference BN is just a Scale layer. And that Scale layer is frozen during training. This is a typical strategy for object detection