Face recognition can be done by extracting face features. However, face recognition can be spoofed by photo or video recording to mimic person. There are some type of attack of spoof:
-
Replay attack
Replay attack in face recognition spoofing occurs when an attacker uses a recorded video or series of images of the victim to deceive the face recognition system.
-
Print attack
Print attack in face recognition spoofing occurs when an attacker uses printed photos of the victim to deceive the face recognition system.
Therefore, we need to add face recognition with anti-spoof detection using deep learning model.
The system should handle variations in lighting conditions, facial expressions, and angles, and be robust against different spoofing attacks.
This model was trained using the CelebA Spoof Dataset (GitHub | Kaggle).
To achieve the requirements, we need to use image augmentation techniques so that the model can be robust to detect spoof attack.
Lighting Conditions
Lighting condition can be obtained by adding the following techniques:
- ISO Noise - Camera has a capability to adjust ISO number based on the environment. ISO number will be higher in the darker condition. However, the higher ISO number, the more noise will come
- Motion Blur - With darker condition, camera will slow the shutter speed so that it can absorb as many light as possible. However, slower shutter speed can give some blurriness to the image
Angles
Angles variation can be obtained by adding the following techniques:
- Horizontal and Vertical flip
- Affine
- Shift Scale Rotate
Others
Other image augmentation techniques can help model to train variation condition of the image:
- Image Compression - Correlate with the quality of image
- Gaussian Noise - Correlate with noise image
- Normalize - To make the image can be learned easily by the model
- Coarse Dropout - Bad image still can be used to spoof attack, so we need model to learn about it.
- Resizing
MobileNet v2 is used to show the training process in this notebook. MobileNet v2 is light model and can be used for general purpose. I want to use FaceNet since it is specialized for face recognition, but I couldn't find the way to import the pre-trained model.
However, because of the lack of resources, the training process is not completely done. In the real case, the adequate resources should be available for training.
Open the notebook here.
Because of the limit time to complete this project, I narrow down the scope into two steps:
- Show how I did the data preprocessing by using several image augmentation techniques in Jupyter notebook
- Show how the pre-trained model is built into API so that I can predict sample video to get the result through the API
Also, this model only tested for recognizing real person and fake person from a photo through webcam.
When I tried to download the smallest zip file of CelebA dataset from this Google Drive, I found the file was corrupted. Hence, need to download whole zip to solve the issue.
I am using several libraries to finish this project:
- Python 3.10
- ONNXRuntime
- FastAPI
Run stream prediction will use webcam to stream a video and do the real-time prediction.
stream-prediction.mp4
Run offline prediction will take a video from directory and store the result into directory. The video is trimmed because prediction takes several minutes to complete.
script-prediction.mp4
Run prediction through API is uploading the video to API (using Postman) and store the result into directory. The video is trimmed because prediction takes several minutes to complete.
api-prediction.mp4
Export environmental variable, BASE_PATH
export BASE_PATH=$(pwd)
Install the requirements first by this command:
pip install -r requirements.txt
Run this command to use webcam and do not store any result video:
python main.py
To predict specific video and store the result to another folder, run this command
python main.py --input /path/to/input_video --output /path/to/output_video
For more details about the argument, see below
--input (-i)
- Path to video for predictions, if not passed webcam stream will be processed
--output (-o)
- Path to save processed video, if not passed the processed video will not be saved
--model_path (-m)
- Path to pretrained ONNX model
--threshold (-t)
- Real face probability threshold above which the prediction is considered true, default 0.5
Export environmental variable, BASE_PATH
export BASE_PATH=$(pwd)
Steps:
- Install all requirements using this command:
pip install -r requirements.txt
- Run this command:
uvicorn main:app --port=4444 --reload
- Open Postman
- Create new request
- Change method to POST
- Put the endpoint link:
localhost:4444/video/detect-faces
- Go to
Body
tab - Select
form-data
- Put value in
Key
column withvideo_file
and click the dropdown and select File - Upload the recorded video
- Click Send button
- Run command:
uvicord main:app --port=4444 --reload
- Open in the browser:
localhost:4444/docs
There are two resources to complete this technical test:
- Fine tune on CelebA notebook. Visit this link.
- Face Recognition Anti Spoofing project. Visit this link.
I used some code from those resources with some modification