# 当运行tensorflow出现问题时
echo 'export PATH=/usr/bin/:$PATH' >> .bashrc
# 训练
cd /mnt/disk/facenet
export PYTHONPATH=/mnt/disk/facenet/src
# 人脸检测与定位
python src/align/align_dataset_mtcnn.py /mnt/disk/datasets/CASIA-maxpy-clean/ /mnt/disk/datasets/casia_maxpy_mtcnnpy_182 --image_size 182 --margin 44
# train 设置epoch个数
python src/train_softmax.py --logs_base_dir /mnt/disk/logs/facenet/ --models_base_dir /mnt/disk/models/facenet/ --data_dir /mnt/disk/datasets/casia_maxpy_mtcnnpy_182 --image_size 160 --model_def models.inception_resnet_v1 --lfw_dir /mnt/disk/datasets/lfw/lfw_mtcnnpy_160 --optimizer RMSPROP --learning_rate -1 --max_nrof_epochs 80 --keep_probability 0.8 --random_crop --random_flip --learning_rate_schedule_file data/learning_rate_schedule_classifier_casia.txt --weight_decay 5e-5 --center_loss_factor 1e-2 --center_loss_alfa 0.9
# 显示tensorboard,打开新的终端
ssh -L 16006:127.0.0.1:6006 -p 10199 root@megatron.pkuml.org
export LC_ALL=C
tensorboard --logdir=/mnt/disk/logs/facenet
# 打开http://127.0.0.1:16006查看
# 人脸检测
export PYTHONPATH=/mnt/disk/facenet/src
for N in {1..4}; do python src/align/align_dataset_pku.py /share/dataset/train/1_1_04_0/prob /mnt/disk/datasets/dongnanmen_160 --image_size 160 --margin 32 --random_order --gpu_memory_fraction 0.25 & done
This is a TensorFlow implementation of the face recognizer described in the paper "FaceNet: A Unified Embedding for Face Recognition and Clustering". The project also uses ideas from the paper "A Discriminative Feature Learning Approach for Deep Face Recognition" as well as the paper "Deep Face Recognition" from the Visual Geometry Group at Oxford.
The code is tested using Tensorflow r1.2 under Ubuntu 14.04 with Python 2.7 and Python 3.5. The test cases can be found here and the results can be found here.
Date | Update |
---|---|
2017-05-13 | Removed a bunch of older non-slim models. Moved the last bottleneck layer into the respective models. Corrected normalization of Center Loss. |
2017-05-06 | Added code to train a classifier on your own images. Renamed facenet_train.py to train_tripletloss.py and facenet_train_classifier.py to train_softmax.py. |
2017-03-02 | Added pretrained models that generate 128-dimensional embeddings. |
2017-02-22 | Updated to Tensorflow r1.0. Added Continuous Integration using Travis-CI. |
2017-02-03 | Added models where only trainable variables has been stored in the checkpoint. These are therefore significantly smaller. |
2017-01-27 | Added a model trained on a subset of the MS-Celeb-1M dataset. The LFW accuracy of this model is around 0.994. |
2017‑01‑02 | Updated to code to run with Tensorflow r0.12. Not sure if it runs with older versions of Tensorflow though. |
Model name | LFW accuracy | Training dataset | Architecture |
---|---|---|---|
20170511-185253 | 0.987 | CASIA-WebFace | Inception ResNet v1 |
20170512-110547 | 0.992 | MS-Celeb-1M | Inception ResNet v1 |
The code is heavily inspired by the OpenFace implementation.
The CASIA-WebFace dataset has been used for training. This training set consists of total of 453 453 images over 10 575 identities after face detection. Some performance improvement has been seen if the dataset has been filtered before training. Some more information about how this was done will come later. The best performing model has been trained on a subset of the MS-Celeb-1M dataset. This dataset is significantly larger but also contains significantly more label noise, and therefore it is crucial to apply dataset filtering on this dataset.
One problem with the above approach seems to be that the Dlib face detector misses some of the hard examples (partial occlusion, silhouettes, etc). This makes the training set to "easy" which causes the model to perform worse on other benchmarks. To solve this, other face landmark detectors has been tested. One face landmark detector that has proven to work very well in this setting is the Multi-task CNN. A Matlab/Caffe implementation can be found here and this has been used for face alignment with very good results. A Python/Tensorflow implementation of MTCNN can be found here. This implementation does not give identical results to the Matlab/Caffe implementation but the performance is very similar.
Currently, the best results are achieved by training the model as a classifier with the addition of Center loss. Details on how to train a model as a classifier can be found on the page Classifier training of Inception-ResNet-v1.
A couple of pretrained models are provided. They are trained using softmax loss with the Inception-Resnet-v1 model. The datasets has been aligned using MTCNN.
The accuracy on LFW for the model 20170512-110547 is 0.992+-0.003. A description of how to run the test can be found on the page Validate on LFW.