confuse on CASIA_train_test.prototxt training loss

Question

confuse on CASIA_train_test.prototxt training loss

Closed this issue 9 years ago · 9 comments

I use the CASIA_train_test.prototxt with my dataset(2000person and 50 image for each total 10W+).
I want to training deepid network using CASIA_train_test.prototxt. Only change the "ip1" layer output num from 10575 to 2000.
but it cannot make the soft loss reduce, and the test accuracy is very small.

In the right case, what's the accuracy will be achieved?
and what about the loss?

my alignment data like this.

Answer 1 · 2015-07-05T07:13:43.000Z

This is a very difficult problem, which has confused deep learning researchers for years. All I can do is to give you some suggestions.

Reduce the original lr, e.g. set lr=0.001 at first.
Have you used the 'msra' weight filler? In my experience, it works well for initiate the weights.
Use smaller network first.

Answer 2 · 2015-07-05T07:35:08.000Z

1.yes, I try set the lr from 0.1 to 0.0001, but it do not work fine.
2.yes I used the 'msra' weight filler and try the 'xavier' weight filler too
3.in small dataset (20 persons) it can achieve the accuracy to 80%. but not work on whole dataset
4.I try the mnist network can achieve 50% accuracy, but not good enough.

can you share a trained model to me, then I can see some detail, or I think I can do some finetune work.

Answer 3 · 2015-07-05T11:56:21.000Z

Why not train a model using CASIA-Webface by yourself?

Answer 4 · 2015-07-06T02:32:32.000Z

Yes, I'm training the model. But it's very useful to have some information from your success models.
In new process I have made the loss reduce. :) and the test accuracy have achieved 70%(whole dataset).

In your success model, what test accuracy will be achieved?

Answer 5 · 2015-07-06T08:37:06.000Z

I did not set a test set. I used all the CASIA-webface as training set, and copy them to be the test set.

The final accuracy on training set is about 89.5%~91.5%. As I remembered, after training with lr=0.01, the accuracy is about 80%.

Answer 6 · 2015-07-06T11:33:37.000Z

thanks, this is very useful information.
with lr=0.01 my accuracy is 82.81% in training datasets, but 66.08% in test datasets.

Is that mean I can do Verification based on this models?

Answer 7 · 2015-07-06T11:45:59.000Z

You should at least continue training with lr=0.001 for about 150,000 iteration.

Answer 8 · 2015-07-07T06:42:44.000Z

after lr=0.001 my training accuracy achieve ~100%, and test accuracy achieve ~77%
but the verification on lfw with this model, can only achieve ~60% using VerificationDemo.m.

this very strange, there is something wrong with my job.

here is some diff from your process.
by your alignment code, the output data will link this .

but my crop the face to full the image, is this right?

Answer 9 · 2015-07-07T06:51:39.000Z

I just follow the CASIA-Webface for setting the parameter. You should alignment the face as what you have done with your training data.