MiniImageNet 5way1shot acc

Question

MiniImageNet 5way1shot acc

chmxu opened this issue 5 years ago · 8 comments

I followed the default setting to train on miniimagenet and use the best_model.pth, which returns acc 59.28%, with a huge gap to the reported one which i don't think is resulted from random choice of episodes. Any idea?

Answer 1 · 2019-11-26T06:17:44.000Z

Thank you for your interest in our code base.

First note that each meta-training run can yield different result. I experienced similar issues with many other few-shot learning algorithms. Also, different versions of packages may cause different behaviors (e.g. Python 3 might use different random seed than Python 2).

However, the accuracy of 59% seems to be lower than what I and other users of the repository have experienced. Would it be possible for you to report the configuration you used?

Answer 2 · 2019-11-26T06:40:35.000Z

python3.6 pytorch1.2.0 qpth0.0.15
this is my train script
python train.py --gpu 1,2 --save-path "./experiments/miniImageNet_MetaOptNet_SVM" --train-shot 15 --head SVM --network ResNet --dataset miniImageNet --eps 0.1 --episodes-per-batch 2

Answer 3 · 2019-11-26T08:06:07.000Z

I think the result is suboptimal because episodes-per-batch is set to 2. In our experiments, we set episodes-per-batch to 8 by default.

Answer 4 · 2019-11-27T02:40:04.000Z

I train a model with 4 1080Ti with episodes-per-batch set as 8 and use a best_model which returns 64.13% acc on meta-val set. The model gets 60.42% acc on meta-test set, 2.2% lower than the reported one.

Answer 5 · 2019-11-27T02:45:18.000Z

As mentioned in #8, each meta-training run can result in slightly different result. Also, #25 suggests that the result of both ProtoNet and MetaOptNet can vary across different environments. I experienced similar issues with many other few-shot learning algorithms. Also, I guess the versions of packages should matter. In my environment, I never had <61% accuracy on MetaOptNet-SVM when label smoothing is applied.

The message of our paper is about the gap between non-parametric base learners and parametric base learners, and I believe that the gap should exist within the same environment.

Answer 6 · 2019-11-27T02:48:20.000Z

no offence but this means we can't compare different methods fairly :)

Answer 7 · 2019-11-27T02:54:00.000Z

That’s a good point. This is why I carefully read ablation studies section when I read few-shot recognition papers.

Different papers use different engineering factors like regularization and data loaders, and it makes a fair comparison very difficult.

Answer 8 · 2019-11-27T03:30:09.000Z

Yes that's helpful. I share your opinion. Thanks!