CS596-029 Project Topic 1 - Face Recognition

Brief Intro

This project (Topic 1 - Face Recognition) is implemented with guidance by Professor Eugene Chang in module CS596-029, Machine Learning and Data Mining, svuca.edu.

Project Directory Structure

├── data # TRAING & TEST DATA, AND SNAPSHOT OF TRAINED MODELS
│   ├── testing # TEST DATA
│   │   ├── 00-friends-Cropped # UNCLASSIFIED IMAGES
│   │   ├── 00-friends-Raw # UNCLASSIFIED IMAGES
│   │   ├── 01-Tracy-Cropped # CLASS 1 TEST DATA
│   │   ├── 02-Trish-Cropped # CLASS 2 TEST DATA
│   │   └── 03-Steven-Cropped # CLASS 3 TEST DATA
│   ├── trained_models
│   └── training # TRAINING DATA
│       ├── 01-Tracy-Cropped # CLASS 1 CROPPED FACES
│       ├── 01-Tracy-Raw # CLASS 1 RAW DATA
│       ├── 02-Trish-Cropped # CLASS 2 CROPPED FACES
│       ├── 02-Trish-Raw # CLASS 1 RAW DATA
│       ├── 03-Steven-Cropped # CLASS 3 CROPPED FACES
│       └── 03-Steven-Raw # CLASS 1 RAW DATA
├── docs # PROJECT POWERPOINT, DEVELOPMENT LOGS
├── references # CODE OR DOCUMENTS USED AS REFERENCES
└── src # PROGRAM SOURCE CODE

Usage

Dependencies

Please make sure you are in src folder before executing any python command.
I'm using these libraries (cv2, re, numpy, pandas, pickle, sklearn, uuid), please ensure you have them before execution.
- OpenCV: this is the instructions using homebrew on OSX to install OpenCV.
```
brew tap homebrew/science
brew install opencv
```
- Other python packages: pip install -r requirements.txt, the requirements file is just recommendation, you can install the packages on your own if you like.

Execution

The program has 2 python scripts:

process_images.py: This script does all image processing tasks such as face extraction (Black and White), squaring, flipping and equalization. It reads images from Raw folders (refer to Directory Structure), then extract the cropped face outputs into Cropped folder.
- Configurations: all parameters are configured and included inside the python file. You can change ID, Label to process more training data for different class (person). Make sure they are consistent.
- Usage: python process_images.py, when the image is popup, please click on the face you want to extract and press 's' to save it. If you want to skip processing the current image, press 'n' (next), if at any point you want to quit the process, press 'q'. Your last image processing progress will be kept in the *.npy files in RAW image folder.
recognize_faces.py: This script is the core data learning & validation. It loads the training data, compute PCA, generate eigenfaces and classifier, snapshot the classifier and PCA object, and predict results of the test data.
- Configurations: all parameters are configured and included inside the python file. You can play around with parameter REUSE_TRAINED_MODEL_FILE, change it to empty string if you want to recompute the model.
- Usage: python recognize_faces.py

Progress & Features

Completed features
1. Using OpenCV to detect face
2. Crop face and pre-process faces
3. Preparing database for training & testing
4. Generate more training data by Facebook Data Scraping
5. Face Flipping: to produce 2 images of the same person (for better recognition of both sides)
6. Ability to click on the square box in OpenCV to choose which face can be extracted (since OpenCV is not always correct)
7. Ability to load and store the progress of face extraction & review so we can resume later with newer images
8. Implement GridSearchCV, kernel RBF to find best settings in multiple C and Gamma values.
9. Add auto-scaling when displaying & processing large images.
10. Printing Confusion Matrix in result (using SKLearn) Printing Classification Report in result (using SKLearn)
11. Store and Load trained model using Pickle
TODO List
1. Applying adaboost, either with SVM or other classifier and compare the result. [experiment]
2. Build some UI function to show "Name" directly on the face in TEST photo. [fancy feature]
3. Find a way to deal with face angle (when it’s not straight) to improve result further.

Sample Output

This is the sample output from the recognize_faces.py.

======TRAINING DATA=======
- Mode, id, name, total images: training, 01, Tracy, 162
- Mode, id, name, total images: training, 02, Trish, 78
- Mode, id, name, total images: training, 03, Steven, 64

======TESTING DATA=======
- Mode, id, name, total images: testing, 00, friends, 0
- Mode, id, name, total images: testing, 01, Tracy, 28
- Mode, id, name, total images: testing, 02, Trish, 30
- Mode, id, name, total images: testing, 03, Steven, 13

==================RESULT==================
Confusion Matrix: 
        Tracy  Trish  Steven
Tracy      27      1       0
Trish       8     22       0
Steven      1      0      12
Classification Report: 
             precision    recall  f1-score   support

      Tracy       0.75      0.96      0.84        28
      Trish       0.96      0.73      0.83        30
     Steven       1.00      0.92      0.96        13

avg / total       0.88      0.86      0.86        71


        How to comprehend the report:
        - Recall value: "Given a true face of person X, how likely does the classifier detect it is X?
        - Precision value: "If the classifier predicted a face person X, how likely is it to be correct?

stevenvo/cs596-029-topic-1-face-recognition