The learning rate for different part of parameters

Question

The learning rate for different part of parameters

YangWangsky opened this issue 6 years ago · 4 comments

YangWangsky commented 6 years ago

It's a nice work, thanks for the code provided by the owner. And I notice the following the comment:

According from the prototxt in Caffe implement, learning rate must multiply by 10.0 in pyramid module.

I have found the Caffe implement by Hao Zhang, however I can't find the prototxt for training. How can I get it?

Thanks.

Answer 1 · 2018-10-09T02:19:10.000Z

@YangWangsky ,

The original author didn't provide the training code. So I tried to implement followed by the paper description.

Answer 2 · 2018-10-09T02:28:19.000Z

@hellochick
Thanks for your reply. I also read the original paper, and there seems no description about different learning rate for the different parts. Could you please help me find the setting for learning rate?

Answer 3 · 2018-10-13T14:18:35.000Z

Hey,

If you take a look at the details in the original caffe prototxt, you can find these settings.

For example, look at line 7062-7070, you can see these codes:

layer {
  name: "conv5_4/bn"
  type: "BN"
  bottom: "conv5_4"
  top: "conv5_4"
  param {
    lr_mult: 10
    decay_mult: 0
  }

And this indicates your learning rate multiplied by 10.

Answer 4 · 2018-10-14T04:59:40.000Z

@hellochick
Ok, I get it. I have not used caffe before, thank you for your reply！！