Light CNN for Deep Face Recognition, in pytorch

A pytorch implementation of A Light CNN for Deep Face Representation with Noisy Labels from the paper by Xiang Wu, Ran He, Zhenan Sun and Tieniu Tan. The official and original Caffe code can be found here.

Installation
Datasets
Training
Evaluate
Performance
Citation
References

Installation

Install pytorch following the website.
Clone this repository.
- Note: We currently only run it on Python 2.7.

Datasets

Download face dataset such as CASIA-WebFace, VGG-Face and MS-Celeb-1M.
- The MS-Celeb-1M clean list is uploaded: Baidu Yun, Google Drive.
All face images are converted to gray-scale images and normalized to 144x144 according to landmarks.
According to the five facial points, we not only rotate two eye points horizontally but also set the distance between the midpoint of eyes and the midpoint of mouth(ec_mc_y), and the y axis of midpoint of eyes(ec_y) .
The aligned LFW images are uploaded on Baidu Yun.

Dataset size ec_mc_y ec_y

Training set 144x144 48 48

Testing set 128x128 48 40

Dataset	size	ec_mc_y	ec_y
Training set	144x144	48	48
Testing set	128x128	48	40

Training

To train Light CNN using the train script simply specify the parameters listed in train.py as a flag or manually change them.

python train.py --root_path=/path/to/your/datasets/ \
		--train_list=/path/to/your/train/list.txt \
		--val_list=/path/to/your/val/list.txt \
		--save_path=/path/to/your/save/path/ \
		--num_classes=n

Tips:
- The lists of train and val datasets are followed by the format of caffe. The details of data loader is shown in load_imglist.py. Or you can use torchvision.datasets.ImageFolder to load your datasets.
- The num_classes denotes the number of identities in your training dataset.
- When training by pytorch, you can set a larger learning rate than caffe and it is faster converaged by pytorch than caffe for Light CNN.
- We enlarge the learning rate for the parameters of fc2 which may lead better performance. If the training is collapsed on your own datasets, you can decrese it.
- We modify the implementation of SGD with momentum since the official pytorch implementation is different from Sutskever et. al. The details are shown in here.

Evaluation

To evaluate a trained network:

python extract_features.py --resume=/path/to/your/model \
			   --root_path=/path/to/your/datasets/ \
			   --img_list=/path/to/your/list.txt \
			   --save_path=/path/to/your/save/path/ \
			   --num_classes=n

You can use vlfeat or sklearn to evaluate the features on ROC and obtain EER and TPR@FPR for your testing datasets.
The model is released on Google Drive.
- Note that the released model contains the whole state of the light CNN module and optimizer. The details of loading model can be found in train.py.
The features of lfw and megaface are released.

Performance

The Light CNN performance on lfw 6,000 pairs.

Model	100% - EER	TPR@FAR=1%	TPR@FAR=0.1%	TPR@FAR=0
caffe(original)	98.80%	98.60%	96.77%	94.97%
pytorch	98.70%	98.47%	95.13%	89.53%

The Light CNN performance on lfw BLUFR protocols

Model	VR@FAR=0.1%	DIR@FAR=1%
caffe(original)	97.45%	84.89%
pytorch	96.80%	83.06%

The Light CNN performance on MegaFace

Model	Rank-1	TPR@FAR=1e-6
caffe(original)	65.532%	75.854%
pytorch	65.782%	76.288%

Citation

If you use our models, please cite the following paper:

@article{wulight,
  title={A Light CNN for Deep Face Representation with Noisy Labels},
  author={Wu, Xiang and He, Ran and Sun, Zhenan and Tan, Tieniu}
  journal={arXiv preprint arXiv:1511.02683},
  year={2015}
}

yichuan9527/LightCNN