cifar10 baseline performance

Question

cifar10 baseline performance

bobye opened this issue 8 years ago · 10 comments

Hi,

I was trying to reproduce the baseline performance of ConvNet (reported in your NIPS paper). But I tried the default setting (cifar10_full_multistep_solver.prototxt) as the provided one in the directory and only got around 79~80% accuracy. Can you give me some advices on how to boost for the extra 2 percentages on the test data?

I also tried to train the SSL from scratch (without initiating from a baseline model) and still got 79~80% accuracy.

Jianbo

Answer 1 · 2017-05-13T00:34:09.000Z

@bobye Do you generate the lmdb properly?

Answer 2 · 2017-05-13T00:48:52.000Z

@wenwei202 Hi, which script should I run? create_cifar10.sh or create_padded_cifar10.sh? I used the former one to create lmdb.

Answer 3 · 2017-05-13T00:49:47.000Z

create_cifar10.sh is good.

Answer 4 · 2017-05-13T00:55:50.000Z

@wenwei202 Should I use very small learning rate the later phase of training? I observed that the test accuracy does not improve for 200000 steps based on cifar10_full_multistep_solver.prototxt

Answer 5 · 2017-05-25T10:12:29.000Z

@wenwei202 thanks in advance, In your paper“learning structured sparsity in dnns”, you conduct cifar10 experiment with "ConvNet" on caffe, is it the same "cifar10_full" net architecture in caffe?

Answer 6 · 2017-05-26T05:43:07.000Z

@upupnoel No, it's different, we added a dropout layer. @bobye Did you finally duplicate the results? cifar10 should be easily duplicated.

Answer 7 · 2017-05-26T06:53:18.000Z

@wenwei202 dropout is added following the last layer, while the last layer only has 10 neurons, which seems a little bit strange. Does this boost your performance in practice?

Answer 8 · 2017-05-26T16:49:38.000Z

@wenwei202 Not yet :< can you give a standalone script that reproduces its result on your computer? I tried cifar10_full_multistep_solver.prototxt but had not waited for anther 100000 steps. It has been good that you wrote some documentations on the usages.

Answer 9 · 2017-05-30T22:23:20.000Z

@bobye I tried those scripts, and found the net and solver were both for SSL.
Please use cifar10_full_baseline_multistep_solver.prototxt to duplicate the baseline, which is the same with the one in https://github.com/BVLC/caffe/ but whose iterations were prolonged a bit.

@upupnoel The dropout helped for SSL while seems it didn't help the baseline much.

Answer 10 · 2017-05-30T22:40:25.000Z

@wenwei202 Thx, I will try them out!