Development code on face semantic segmentation for Extended Labeled Faces In-The-Wild (ELFW). Dataset and further details at the Project Site.
Examples from the ELFW dataset. Original LFW categories background, skin, and hair, new categories beard-mustache, sunglasses, head-wearable, and exclusively synthetic mouth-mask. (left) Re-labeled faces with manual refinement compared to the original LFW labels in blue background, (right-top) faces previously unlabeled in LFW, and (right-bottom) synthetic object augmentation with sunglasses, mouth-masks, and occluding hands.
run_trainer.py
: main file to be run for training (see below).trainer.py
: the trainer, i.e. SGD, scheduler, epochs, loss, and all deep learning artillery.models.py
: the NN architectures, namely FCN, DeeplabV3, and GCN.elfw.py
: the dataloader and label conversion utilities for the ELFW dataset.transform.py
: image transformations (scaling, flips, relabeling,...) for data augmentation.metrics.py
: useful compendium of metrics including pixel accuracy, mean accuracy, mean IoU, frequency weighted, and Mean F1-Score.utils.py
: some utilities for console output, time metering, or early-stopper.visualize.py
: handy visdom class for performance visualization on web navigator.tester.py
: use this file for segmenting an image of your own once having a trained model.
Training settings are described in run_trainer.py
. Some arguments are called via console, while other hyperparameters are fixed. See list_experiments.sh
for an exhaustive list of experiments carried out during the project.
max_epochs = 301 # Maximum number of epochs
lr = 1E-3 # Learning rate
lr_decay = 0.2 # Learning rate decay factor
w_decay = 5E-4 # Weight decay, typically [5e-4]
momentum = 0.99 # Momentum, typically [0.9-0.99]
lr_milestones = [35,90,180] # lr milestones for a multistep lr scheduler
augment = True # random transformations for data augmentation
gcn_levels = 3 # Number of GCN levels, typically 3 for 256x256 and 4 for 512x512 image sizes
Trained models not at disposal for the moment.
Python 3.5
,PyTorch 1.1.0
,TorchVision 0.3.0
,Visdom 0.1.8
,PIL 6.2.1
.- Data augmentation:
OpenCV 3.1.0
andDlib
.
-
FCN: Long, J., Shelhamer, E., Darrell, T.: Fully convolutional networks for semantic segmentation. In: Proceedings of the IEEE conference on computer vision and pattern recognition. pp. 3431–3440 (2015).
-
DeeplabV3: Chen, L.C., Papandreou, G., Kokkinos, I., Murphy, K., Yuille, A.L.: Deeplab: Semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected crfs. IEEE transactions on pattern analysis and machine intelligence 40(4), 834–848 (2018).
-
GCN: Peng, C., Zhang, X., Yu, G., Luo, G., & Sun, J. (2017). Large kernel matters--improve semantic segmentation by global convolutional network. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 4353-4361). [ Not considered for the paper ]
The folder ./demos
contains code snippets to test the GCN model over the webcam. Have a look to Jupyter Notebooks or Colab.
The folder ./scripts
contains some useful code tools for labeling and processing the dataset. The main files are:
computeClassWeights.py
: computes weight for class balancing over the training loss.lfw-makeThemWearMasks.py
: overlays synthetic masks (must provide) on face images.lfw-makeThemWearSunglasses.py
: same for sunglasses.lfw-putYourHandsOnMeWithDlib.py
: same for hands based on Dlib.lfw-scribbleMe.py
: tool for label annotation by filling superpixels on mouse scribbling.lfw-refineMe.py
: tool for refining the annotated segments.
Rafael Redondo and Jaume Gibert (c) 2019-20 Eurecat