Deep-CSC-Networks-For-Image-Fusion

Deep Convolutional Sparse Coding Networks for Image Fusion. arxiv

Shuang Xu *, Zixiang Zhao *, Yicheng Wang, Kai Sun, Chunxia Zhang, Junmin Liu, Jiangshe Zhang. (* equal contributions)

Abstract

Image fusion is a significant problem in many fields including digital photography, computational imaging and remote sensing, to name but a few. Recently, deep learning has emerged as an important tool for image fusion. This paper presents three deep convolutional sparse coding (CSC) networks for three kinds of image fusion tasks (i.e., infrared and visible image fusion, multi-exposure image fusion, and multi-modal image fusion), where the CSC model and the iterative shrinkage and thresholding algorithm are generalized into dictionary convolution units. As a result, all hyper-parameters in the CSC model are learned from data. Our extensive experiments and comprehensive comparisons reveal the superiority of the proposed networks with regard to quantitative evaluation and visual inspection.

Software

CSC-IVFN: Coming Soon

CSC-MEFN: website

CSC-MMFN: website

CSC Unfolding

The CSC optimizes the following problem,

$\min_{\boldsymbol z}\frac{1}{2}\|\boldsymbol{x}-\boldsymbol{d}*\boldsymbol{z}\|_2^2+\lambda g(\boldsymbol{z})$

where $\lambda$ is a hyperparameter, * denotes the convolution operator, $\boldsymbol{z}\in R^{q\times h \times w}$ is the sparse feature map (or say, code) and $g(\cdot)$ is a sparse regularizer. This problem can be solved by ISTA, and its updating rule is as below,

$\boldsymbol{z}^{(k+1)} \leftarrow \mathrm{prox}_{\lambda/\rho}\left(\boldsymbol{z}^{(k)}+\frac{1}{\rho}\boldsymbol{d}^T*(\boldsymbol{x}-\boldsymbol{d}*\boldsymbol{z}^{(k)})\right).$

We replace some operations with deep neural networks' elements and rewritten the updating rule, that is,

$\boldsymbol{z}^{(k+1)} = f\left( {\rm BN}\left( \boldsymbol{z}^{(k)}+\mathrm{Conv}_1(\boldsymbol{x}-\mathrm{Conv}_0(\boldsymbol{z}^{(k)})) \right) \right).$

The above equation is called as the dictnary convolutional unit (DCU).

Network Structure

In our paper, DCUs are regarded as the hidden layers of deep networks. Then, we design three kinds of networks for infrared and visible image fusion, multi-epxosure image fusion, and multi-modal image fusion, as shown in the following figure.

shuangxu96/Deep-CSC-Networks-For-Image-Fusion

Deep-CSC-Networks-For-Image-Fusion

Abstract

Software

CSC Unfolding

Network Structure