/SCNet

The official PyTorch implementation of CVPR 2020 paper "Improving Convolutional Networks with Self-Calibrated Convolutions"

Primary LanguagePythonMIT LicenseMIT

SCNet

The official PyTorch implementation of CVPR 2020 paper "Improving Convolutional Networks with Self-Calibrated Convolutions"

Update

  • 2020.5.15
    • Pretrained model of SCNet-50_v1d with more than 2% improvement on ImageNet top1 acc (80.47 v.s. 77.81). compared with original version of SCNet-50 is released!
    • SCNet-50_v1d achieves comparable performance on other applications such as object detection and instance segmentation to our original SCNet-101 version.
    • Because of limited GPU resources, the pretrained model of SCNet-101_v1d will be released later, as well as more applications' results.

Introduction

we present a novel self-calibrated convolution that explicitly expands fields-of-view of each convolutional layer through internal communications and hence enriches the output features. In particular, unlike the standard convolutions that fuse spatial and channel-wise information using small kernels (e.g., 3 × 3), our self-calibrated convolution adaptively builds long-range spatial and inter-channel dependencies around each spatial location through a novel self-calibration operation. Thus, it can help CNNs generate more discriminative representations by explicitly incorporating richer information. Our self-calibrated convolution design is simple and generic, and can be easily applied to augment standard convolutional layers without introducing extra parameters and complexity. Extensive experiments demonstrate that when applying our self-calibrated convolution into different backbones, the baseline models can be significantly improved in a variety of vision tasks, including image recognition, object detection, instance segmentation, and keypoint detection, with no need to change network architectures.

Figure 1: Diagram of self-calibrated convolution.

Useage

Requirement

PyTorch>=0.4.1

Examples

git clone https://github.com/backseason/SCNet.git

from scnet import scnet50
model = scnet50(pretrained=True)

Input image should be normalized as follows:

normalize = transforms.Normalize(mean=[0.485, 0.456, 0.406],
                                  std=[0.229, 0.224, 0.225])

(The pretrained model should be downloaded automatically by default. You may also choose to download them manually by the links listed below.)

Pretrained models

model #Params MAdds FLOPs top-1 error top-5 error Link 1 Link 2
SCNet-50 25.56M 4.0G 7.9G 22.19 6.08 GoogleDrive BaiduYun pwd: 95p5
SCNet-50_v1d 25.58M 4.7G 9.4G 19.53 4.68 GoogleDrive BaiduYun pwd: hmmt
SCNet-101 44.57M 7.2G 14.4G 21.06 5.75 GoogleDrive BaiduYun pwd: 38oh

Applications (more coming soon...)

Object detection

We use Faster R-CNN architecture with feature pyramid networks (FPNs) as baselines. We adopt the widely used mmdetection framework to run all our experiments. Performances are reported on the COCO minival set.

backbone AP AP.5 AP.75 APs APm APl
ResNet-50 37.6 59.4 40.4 21.9 41.2 48.4
SCNet-50 40.8 62.7 44.5 24.4 44.8 53.1
SCNet-50_v1d 41.8 62.9 45.5 24.8 45.3 54.8
ResNet-101 39.9 61.2 43.5 23.5 43.9 51.7
SCNet-101 42.0 63.7 45.5 24.4 46.3 54.6

Instance segmentation

We use Mask R-CNN architecture with feature pyramid networks (FPNs) as baselines. We adopt the widely used mmdetection framework to run all our experiments. Performances are reported on the COCO minival set.

backbone AP AP.5 AP.75 APs APm APl
esNet-50 35.0 56.5 37.4 18.3 38.2 48.3
SCNet-50 37.2 59.9 39.5 17.8 40.3 54.2
SCNet-50_v1d 38.5 60.6 41.3 20.8 42.0 52.6
ResNet-101 36.7 58.6 39.3 19.3 40.3 50.9
SCNet-101 38.4 61.0 41.0 18.2 41.6 56.6

Other applications such as Instance segmentation, Object detection, Semantic segmentation, and Human keypoint detection can be found on https://mmcheng.net/scconv/.

Citation

If you find this work or code is helpful in your research, please cite:

@inproceedings{liu2020scnet,
 title={Improving Convolutional Networks with Self-Calibrated Convolutions},
 author={Jiang-Jiang Liu and Qibin Hou and Ming-Ming Cheng and Changhu Wang and Jiashi Feng},
 booktitle={IEEE CVPR},
 year={2020},
}

Contact

If you have any questions, feel free to contact me via: j04.liu(at)gmail.com.