Weakly-Segmentation

List of useful codes and papers for weakly supervised Semantic/Instance/Panoptic/Few Shot Segmentation

Weakly-Segmentation
- Top Work
  - By Dataset
    - PASCAL VOC2012
  - By Years
    - ICCV2019
- Resources
  - Tutorial
- Implementation
Related Tasks
Reading List
- Under Review
- Published
  - context
  - graph
  - bbox-level
  - webly
  - Saliency
  - localization
  - spp
  - affinity
  - region
  - network
  - regularizer
  - evaluation measure
  - architecture
  - generative adversarial
  - scene understanding
  - other useful
  - application
- Others

Top Work

By Dataset

PASCAL VOC2012

method	val	test	notes
DSRG_CVPR2018	61.4	63.2	deep seeded region growing, resnet-lfov\|vgg-aspp
psa_CVPR2018	61.7	63.7	pixel affinity network, resnet38
MDC_CVPR2018	60.4	60.8	multi-dilated convolution, vgg-lfov
MCOF_CVPR2018	60.3	61.2	iterative, RegionNet(sppx), resnet-lfov
GAIN_CVPR2018	55.3	56.8
DCSP_BMVC2017	58.6	59.2	adversarial for saliency, and generate cues by cam+saliency(harmonic mean)
GuidedSeg_CVPR2017	55.7	56.7	saliency, TBD
BDSSW_CVPR2018	63.0	63.9	webly, filter+enhance
WegSeg_arxiv	63.1	63.3	webly(pure), Noise filter module
SeeNet_NIPS2018	63.1	62.8	based on DCSP
Graph_ECCV2018	63.6	64.5	graph partition
Graph_ECCV2018	64.5	65.6	use simple ImageNet dataset additionally
CIAN_CVPR2019	64.1	64.7	cross image affinity network
FickleNet_CVPR2019	64.9	65.3	use dropout (a generalization of dilated convolution)

By Years

ICCV2019

Joint Learning of Saliency Detection and Weakly Supervised Semantic Segmentation
Self-Supervised Difference Detection for Weakly-Supervised Semantic Segmentation

CVPR2019

FickleNet: Weakly and Semi-supervised Semantic Image Segmentation using Stochastic Inference

Resources

see this for more weakly lists and resources.
see this for more semantic/instance/panoptic/video segmentation lists and resources. see this for more implementations
a good architecture summary paper:Learning a Discriminative Feature Network for Semantic Segmentation

Tutorial

Unsupervised Visual Learning Tutorial. CVPR 2018 [part 1] [part 2]
Weakly Supervised Learning for Computer Vision. CVPR 2018 [web] [part 1] [part 2]

Implementation

pytorch-segmentation-detection a library for dense inference and training of Convolutional Neural Networks, 68.0%

rdn Dilated Residual Networks, 75.6%, may be the best available semantic segmentation in PyTorch?

Detectron.pytorch A pytorch implementation of Detectron. Both training from scratch and inferring directly from pretrained Detectron weights are available. only for coco now

AdvSemiSeg Adversarial Learning for Semi-supervised Semantic Segmentation. heavily borrowed from a pytorch DeepLab implementation (Link)

PyTorch-ENet PyTorch implementation of ENet

tensorflow-deeplab-resnet Tensorflow implementation of deeplab-resnet(deeplabv2, resnet101-based): complete and detailed

tensorflow-deeplab-lfov Tensorflow implementation of deeplab-LargeFOV(deeplabv2, vgg16-based): complete and detailed

resnet38 Wider or Deeper: Revisiting the ResNet Model for Visual Recognition: implemented using MXNET

pytorch_deeplab_large_fov: deeplab v1

pytorch-deeplab-resnetDeepLab resnet v2 model in pytorch

DeepLab-ResNet-Pytorch Deeplab v3 model in pytorch,

BDWSS Bootstrapping the Performance of Webly Supervised Semantic Segmentation

psa Learning Pixel-level Semantic Affinity with Image-level Supervision for Weakly Supervised Semantic Segmentation

DSRG: Caffe, CAM and DRFI provided

SEC

original: Caffe
BDSSW: MXNET
SEC-tensorflow: tensorflow

Related Tasks

Few-shot segmentation

One-shot learning for semantic segmentation, BMVC2017
Conditional networks for few-shot semantic segmentation, ICLR2018 Workshop
Few-Shot Segmentation Propagation with Guided Networks, preprint
Few-Shot Semantic Segmentation with Prototype Learning, BMVC2018
Attention-based Multi-Context Guiding for Few-Shot Semantic Segmentation, AAAI2019
CANet: Class-Agnostic Segmentation Networks with Iterative Refinement and Attentive Few-Shot Learning, CVPR2019
One-Shot Segmentation in Clutter, ICML 2018

Weakly-supervised Instance Segmentation

Weakly Supervised Instance Segmentation using Class Peak Response, CVPR2018
Weakly Supervised Learning of Instance Segmentation with Inter-pixel Relations, CVPR2019
Object Counting and Instance Segmentation with Image-level Supervision, CVPR2019
Cyclic Guidance for Weakly Supervised Joint Detection and Segmentation, CVPR2019
Where are the Masks: Instance Segmentation with Image-level Supervision, BMVC2019
Label-PEnet: Sequential Label Propagation and Enhancement Networks for Weakly Supervised Instance Segmentation, ICCV2019

Weakly-supervised Panoptic Segmentation

Weakly- and Semi-Supervised Panoptic Segmentation, ECCV2018

Reading List

Under Review

Published

context

Context Encoding for Semantic Segmentation: CVPR2018. use TEN
The Role of Context for Object Detection and Semantic Segmentation in the Wild: CVPR2014
Objects as Context for Detecting Their Semantic Parts: CVPR2018
Exploring context with deep structured models for semantic segmentation: TPAMI2017
dilated convolution
Deep TEN: Texture encoding network !!: CVPR2017. A global context vector, pooled from all spatial positions, can be concatenated to local features
Refinenet: Multi-path refinement networks for high-resolution semantic segmentation: CVPR2017. local features across different scales can be fused to encode global context
Non-local neural networks: CVPR2018. a densely connected graph with pairwise edges between all pixels

graph

Associating Inter-Image Salient Instances for Weakly Supervised Semantic Segmentation: ECCV2018

bbox-level

Box-driven Class-wise Region Masking and Filling Rate Guided Loss for Weakly Supervised Semantic Segmentation, CVPR2019

webly

Weakly Supervised Semantic Segmentation Based on Web Image Cosegmentation: BMVC2017, training model using masks of web images which are generated by cosegmentation
Webly Supervised Semantic Segmentation: CVPR2017
Weakly Supervised Semantic Segmentation using Web-Crawled Videos: CVPR2017, learns a class-agnostic decoder(attention map -> binary mask), pseudo masks are generated from video frames by solving a graph-based optimization problem.
Bootstrapping the Performance of Webly Supervised Semantic Segmentation: target + web domain, target model filters web images, refine mask by combine target and web masks.
Learning from Weak and Noisy Labels for Semantic Segmentation: TPAMI2017
WebSeg: Learning Semantic Segmentation from Web Searches: arxiv, directly learning from keywork retrievaled web images. using saliency and region(MCG with edge)
STC: A Simple to Complex Framework for Weakly-supervised Semantic Segmentation: TPAMI 2017, Initial, Enhanced, Powerful three DCNN model. inital mask(generated by saliency and label using simple images) -> initial model -> enhanced mask(generated using simple images) -> Enhanced model -> powerful mask(generated using complex images) -> powerful model
- saliency can not handle complex images, so BMVC2017 uses coseg instead

Saliency

Exploiting Saliency for Object Segmentation from Image Level Labels: CVPR2017
Discovering Class-Specific Pixels for Weakly-Supervised Semantic Segmentation: BMVC2017
- combine saliency(off-shelf) and CAM to get cues, use harmonic mean function
- adapt CAM from head of Segmentation Network
- use erasing to get multiple objects' saliency

localization

Adversarial Complementary Learning for Weakly Supervised Object Localization, CVPR2018. two branchs, remove high activations from feature map. code
Tell me where to look: Guided Attention Inference Network, CVPR2018. origin image soft erasing(CAM after sigmoid as attention) -> end2end training, force erased images have zero activation
Self-Erasing Network for Integral Object Attention， NIPS2018: prohibit attentions from spreading to unexpected background regions.
- cam -> tenary mask(attention, background, potential)
- self erasing only in attention + potential region(sign flip in background region instead of setting to 0 simply)
- self produced psedo label for background region(difference to SPG: 1.psedo label for background and attention 2.supervise low layer)
Self-produced Guidance for Weakly-supervised Object localization, ECCV2018:
- self supervised use top down framework, for single label classification prob. add pixel-wise supervision when only have image level label
- B1, B2 sharing
- bottom guide top inversely(B1+B2 -> C)

spp

Superpixel convolutional networks using bilateral inceptions
Learning Superpixels with Segmentation-Aware Affinity Loss: good intro for superpixel algs.

affinity

Learning Pixel-level Semantic Affinity with Image-level Supervision for Weakly Supervised Semantic Segmentation: image-level, semantic affinity, learn a network to predict affinity
Adaptive Affinity Field for Semantic Segmentation: ECCV2018, semantic affinity. add a pairwise term in seg loss(similarity metric: KL divergence), use an adversarial method to determine optimal neighborhood size

region

Region-Based Convolutional Networks for Accurate Object Detection and Segmentation
Simultaneous Detection and Segmentation, 2014
Feedforward semantic segmentation with zoom-out features: 2015

network

regularizer

evaluation measure

architecture

The Devil is in the Decoders, BMVC2017
Dilated Residual Networks, CVPR2017. Dilated structure design for classification and localization.
Understanding Convolution for Semantic Segmentation, WACV2018. hybrid dilated convolution(2-2-2 -> 1-2-3)
Smoothed Dilated Convolutions for Improved Dense Prediction, KDD2018. separable and share conv(for smoothing) + dilated conv
Deeplab v1, v2, v3, v3+
Learning Fully Dense Neural Networks for Image Semantic Segmentation, AAAI2019

generative adversarial

Deep dual learning for semantic image segmentation:CVPR2017, image translation
Semantic Segmentation using Adversarial Networks, NIPS2016 workshop
- add gan loss branch, Segnet as generator, D: GT mask or predicted mask
Adversarial Learning for Semi-Supervised Semantic Segmentation: BMVC2018
- semi supervised: SegNet as G, FCN-type D(discriminate each location), use output of D as psedo label for unlabeled data
Semi and weakly Supervised Semantic Segmentation Using Generative Adversarial Network: ICCV2017, use SegNet as D, treat fake as new class
- weakly, use conditionalGan, pixel-level, image-level, generated data are included in loss. performance boosts less when increasing fully data
generative adversarial learning towards Fast weakly supervised detection: CVPR2018
Adaptive Affinity Field for Semantic Segmentation: ECCV2018, semantic affinity. add a pairwise term in seg loss(similarity metric: KL divergence), use an adversarial method to determine optimal neighborhood size

scene understanding

ScanComplete: Large-Scale Scene Completion and Semantic Segmentation for 3D Scans
SeGAN: Segmenting and Generating the Invisible

other useful

Learning to Segment Every Thing: semi-supervised, weight transfer function (from bbox parameters to mask parameters)
Simple Does It: Weakly Supervised Instance and Semantic Segmentation: bbox-level, many methods, using graphcut, HED, MCG
Multi-Evidence Filtering and Fusion for Multi-Label Classification, Object Detection and Semantic Segmentation Based on Weakly Supervised Learning: tricky, curriculum learning: image level -> instance level -> pixel level
Combining Bottom-Up, Top-Down, and Smoothness Cues for Weakly Supervised Image Segmentation: CVPR2017
Improving Weakly-Supervised Object Localization By Micro-Annotation: BMVC2016, object classes always co-occur with same background elements(boat, train). propose a new annotation method. add human annotations to improve localization results of CAM, annotating based on clusters of dense features. each class uses a spectral clustering.(CAM has problem)
Co-attention CNNs for Unsupervised Object Co-segmentation: IJCAI 2018
Coarse-to-fine Image Co-segmentation with Intra and Inter Rank Constraints, IJCAI2018
Annotation-Free and One-Shot Learning for Instance Segmentation of Homogeneous Object Clusters, IJCAI2018
Image-level to Pixel-wise Labeling: From Theory to Practice: fully, analysis the effect of image labels on seg results. add a generator(recover original image). image label(binary, use a threshold small than 0.5, eg:0.25), IJCAI2018

application

SeGAN: Segmenting and Generating the Invisible: CVPR2018, generate occluded parts
Learning Hierarchical Semantic Image Manipulation through Structured Representations: NIPS2018, manipulate image on object-level by modify bbox

Others

priors

Superpixels: An Evaluation of the State-of-the-Art link
Learning Superpixels with Segmentation-Aware Affinity Losslink
Superpixel based Continuous Conditional Random Field Neural Network for Semantic Segmentation link

diffusion

Learning random-walk label propagation for weakly-supervised semantic segmentation: scribble

Convolutional Random Walk Networks for Semantic Image Segmetation: fully, affinity branch(low level)

Soft Proposal Networks for Weakly Supervised Object Localization: attention, semantic affinity

Learning Pixel-level Semantic Affinity with Image-level Supervision for Weakly Supervised Semantic Segmentation: image-level, semantic affinity

analysis

image level to pixel wise labeling: from theory to practice: IJCAI 2018 analysis the effectiveness of class-level labels for segmentation(GT, predicted) Attention based Deep Multiple Instance Learning: ICML 2018. CAM from MIL perspective view

post processing

listed in : Co-attention CNNs for Unsupervised Object Co-segmentation

Otsu’s method
GrabCut
CRF

common methods

refine segmentation results using image-level labels
multi-label classification branch(BDWSS)
generative branch(to original image)
crf

X-Lai/Semantic-Segmentation