Visit our project page for accessing the paper, and the pre-computed results.
This is the implementation (in PyTorch) of the following paper:
Kevis-Kokitsi Maninis, Ilija Radosavovic, and Iasonas Kokkinos. "Attentive Single-Tasking of Multiple Tasks", in CVPR 2019.
@InProceedings{MRK19,
Author = {Kevis-Kokitsi Maninis and Ilija Radosavovic and Iasonas Kokkinos},
Title = {Attentive Single-Tasking of Multiple Tasks},
Booktitle = {IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
Year = {2019}
}
In this work we address task interference in universal networks by considering that a network is trained on multiple tasks, but performs one task at a time, an approach we refer to as "single-tasking multiple tasks". The network thus modifies its behaviour through task-dependent feature adaptation, or task attention. This gives the network the ability to accentuate the features that are adapted to a task, while shunning irrelevant ones. We further reduce task interference by forcing the task gradients to be statistically indistinguishable through adversarial training, ensuring that the common backbone architecture serving all tasks is not dominated by any of the task-specific gradients.
Results in three multi-task dense labelling problems consistently show: (i) a large reduction in the number of parameters while preserving, or even improving performance and (ii) a smooth trade-off between computation and multi-task accuracy.
Example performance on PASCAL Context, for ResNet-101 backbone (more results in the paper):
Experiment | Edge Detection (F) | Semantic Segmentation (mIoU) | Human Parts (mIoU) | Surface Normals (mErr) | Saliency (mIoU) | Average Drop (%) |
---|---|---|---|---|---|---|
Single Task | 72.7 | 68.30 | 60.70 | 14.61 | 65.40 | - |
MTL w/o ASTMT | 69.2 | 63.20 | 55.10 | 16.04 | 63.60 | 6.81 |
MTL w/ ASTMT | 72.4 | 68.00 | 61.12 | 14.68 | 65.71 | 0.04 |
This repository is released under the following LICENSE.
This code was tested with Python 3.6, PyTorch 0.4.1/1.0, and CUDA 9.0.
-
Install PyTorch
conda install pytorch=0.4.1 torchvision cuda90 -c pytorch
-
Install additional dependencies.
conda install scikit-image pillow pip install graphviz opencv-python easydict pycocotools pip install tensorboard tensorboardx tensorflow
-
Clone the multi-task repo.
git clone https://github.com/facebookresearch/astmt.git
-
Add the multi-task package directory, easiest if you initialize it into your ~/.bashrc.
echo "export PYTHONPATH=$PYTHONPATH:/path/to/this/repo/" >> ~/.bashrc
-
Move (copy)
mypath.py
underdutil/
. Complete the/path/to/something/
paths.
The experiments that result in the findings of the paper can be found under experiments/dense_predict/
.
For example, in order to test ASTMT on PASCAL (5 tasks) using modulation (squeeze and excitation, and residual adapters), and adversarial loss, run:
cd pascal_resnet
python main.py --resume_epoch 60
To train the model run:
python main.py
This will train a resnet26 version of ASTMT. To use different setup (deeper network, etc.) please check config.py
inside each experiment.
cd /path/to/experiment
tensorboard --logdir . --port XXXX
.If you are using port forwarding to your local machine, access throughlocalhost:XXXX
.
Evaluation scripts are run at the end of each experiment and the results are dumped into the experiment's folder. The evaluation part of boundary detection is disabled by default, since we used the MATLAB-based repo of seism.
Briefly, the following metrics has been implemented:
edge
: F-measures per dataset (odsF), F-measure per Instance (oisF), and Average Precision (AP).semseg
: mean Intersection over Union (mIoU) per class.human_part
: mean Intersection over Union (mIoU) per class.normals
: Difference in predicted and ground-truth angles (mean, median, RMSE, < 11.25, < 22.5, < 30).sal
: Maximal mean Intersection over Union (max mIoU) and maximal F (maxF).albedo
: Mean Squared Error (MSE).depth
: Mean Squared Error (MSE).
Enjoy :)