problem in finetune

Question

problem in finetune

136823xuewei opened this issue 6 years ago · 6 comments

I use the original flickr30k dataset rather than precomputed features ,and use the pretrained model resnet50.I set finetune True. But, i get poor result of recall. R@1,R@5,R@10 is nearly zero.I can't find the reason about the bad result.How do you set the parameters when you use resnet with finetune?
Thank you.

Answer 1 · 2019-03-18T14:52:29.000Z

Try learning rate 0.00002. Look at section 3.2 for more details of training.

Answer 2 · 2019-03-20T01:05:48.000Z

Thanks for your reply!

Answer 3 · 2019-03-29T07:56:49.000Z

@136823xuewei Did you succeed in getting good results using raw images and finetuning rather than using precomputed features?

@fartashf I trained on the COCO dataset without precomputed features but failed to get good results (@1 less than 1.). Using precomputed features is all good.

The learning rate is 0.00002 when training on raw images.

Could someone help me?

Answer 4 · 2019-04-02T04:49:14.000Z

I can still reproduce the result of the following command using the latest commit on the master:

python train.py --data_name coco --logger_name runs/X --max_violation

It takes at least 1000 iterations to reach above 1% R@1 that is expected.

By default the learning rate is 0.0002. I reproduced it using python 2.7.14 and pytorch 0.3.1. For python 3 and pytorch 1 checkout the details on their corresponding branches.

Answer 5 · 2019-04-11T08:59:23.000Z

if we fix the image full encoder in the first epoch and train the joint embedding model, do we need to write some code to force the image cnn encoder always in evaluation mode? i.e. no dropout and ...

Answer 6 · 2019-04-15T13:53:08.000Z

Not necessarily. Even if the image encoder is not being trained, dropout might help in training robust embeddings.