Attribute and Simile Classifiers for Face Verification

Directory Structure

src folder contains the source code.
data folder contains .npy files which contain calculated histograms.

Overview

Low Level Feature extraction: In this part, we extract a vector corresponding to each face region of a face.
Attribute Classifier: The extracted vectors for a face is checked for the presence of various attributes like Male, Attractive, Asian, Indian, etc. and to what extent.
Simile Classifier: The extracted vectors for a face is checked for the similarity of face regions with the reference people.
Verification Classifier: Two faces F1 and F2 are passed through all the learnt attribute classifiers and simile classifiers to obtain a trait vector for each face. Now verification classifier is trained to find whether both these trait vectors belong to the same person’s face or not.

Datasets used

LFW: Used to train the attribute classifier and verification classifier.
CelebA: Used to train the attribute classifier.
Celebrity Face Recognition Dataset: Used to train the simile classifier.

Low Level Feature extraction details

Face Landmark Detection using the pretrained model shape_predictor_81_face_landmarks.dat
Face alignment to tilt faces perfectly parallel to the horizontal axis.
Face region extraction
Low level feature extracted by calculating edge magnitude and Edge orientation space.
For each face region and each space taken into consideration, histogram is calculated over 100 bins.

Face Alignment

Face Region Extraction

Low Level Feature Extraction

Attribute Classifier details

We loaded the saved histograms.
These histograms are divided into train dataset and validation dataset using test size = 0.3
We have picked some handful of attributes from the given set of attributes. These attributes are chosen based on the face verification problem.
For each chosen attribute, we chose some features using which we have trained our SVM classifiers.
While training SVMs, we have performed hyperparameter tuning using grid search.
After the model is trained, the models are saved.
The models are now tested for accuracy over the validation dataset.

Simile Classifier details

We loaded the saved histograms
We generated labels for this dataset. If the histograms belong to the same person, then the label is +1. Else it is -1.
The dataset is divided into train data and test data. The train labels -1 is divided using the test size = 0.3 and the train labels +1 is divided using the test size = 0.15
We have chosen some handful of reference persons from the dataset.
We have trained SVM models for the face regions: eyes, nose and mouth.
While training SVMs, we have performed hyperparameter tuning using grid search.
After the model is trained, the models are saved.
The models are now tested for accuracy over the validation dataset.

Verification Classifier

We divided the dataset randomly into positive and negative images.
We extracted the low level features for these pairs of images.
We fetched the output of simile and attribute classifiers for these pairs of images (using the low level features extracted above).
The output of simile and attribute classifier from each pair of images is concatenated.
We generated labels for this dataset. If the outputs of simile and attribute classifiers belong to the same person, then the label is +1. Else it is 0.
We trained the SVM.

True Positive	True Negative

False Positive	False Negative

BONUS

Since this paper is from 2009, it focused only upon classification using SVMs. We completed its implementations, and then tried to think of better techniques to improve verification accuracy. So, we applied various deep learning techniques and in turn, gained a much better understanding of deep learning and how convolutional neural networks help in improving accuracy.

- 10% Overall accuracy improvement with Deep Learning -

t5pathak/Face_Verification