DoubleU-Net starts with a VGG19 as encoder sub-network, which is followed by decoder sub-network. In the network, the input image is fed to the modified UNet(UNet1), which generates predicted masks (i.e., output1). We then multiply the input image and the produced masks (i.e., output1), which acts as an input for the second modified U-Net(UNet2) that produces another the generated mask (output2). Finally, we concatenate both the masks (output1 and output2) to get the final predicted mask (output).
The following datasets are used in this experiment:
- MICCAI 2015 Segmentation challenge(CVC-ClinicDB for training and ETIS-Larib for Testing)
- CVC-ClinicDB
- Lesion Boundary segmentation challenge/li>
- 2018 Data Science Bowl challenge
- Batch size = 16
- Number of epoch = 300
Dataset Name | Loss | Optimizer | Learning Rate |
MICCAI 2015 Challenge Dataset | Binary crossentropy | Nadam | 1e-5 |
CVC-ClinicDB | Binary crossentropy | Nadam | 1e-5 |
Lesion Boundary segmentation challenge | Dice loss | Adam | 1e-4 |
2018 Data Science Bowl challenge | Dice loss | Nadam | 1e-4 |
The model is trained on CVC-ClinicDB and tested on the ETIS-Larib polyps dataset.