Authors: Deng-Ping Fan, Tao Zhou, Ge-Peng Ji, Yi Zhou, Geng Chen, Huazhu Fu, Jianbing Shen, and Ling Shao.
-
This repository provides code for "Inf-Net: Automatic COVID-19 Lung Infection Segmentation from CT Images" TMI-2020. (arXiv Pre-print & medrXiv & 中译版)
-
If you have any questions about our paper, feel free to contact us. And if you are using COVID-SemiSeg Dataset, Inf-Net or evaluation toolbox for your research, please cite this paper (BibTeX).
-
We elaborately collect COVID-19 imaging-based AI research papers and datasets awesome-list.
-
[2020/10/25] Uploading 中文翻译版.
-
[2020/10/14] Updating the legend (1 * 1 -> 3 * 3; 3 * 3 -> 1 * 1) of Fig.3 in our manuscript.
-
[2020/08/15] Updating the equation (2) in our manuscript.
R_i = C(f_i, Dow(e_att)) * A_i -> R_i = C(f_i * A_i, Dow(e_{att})); -
[2020/08/15] Optimizing the testing code, now you can test the custom data without
gt_path
-
[2020/05/15] Our paper is accepted for publication in IEEE TMI
-
[2020/05/13] 💥 Upload pre-trained weights. (Uploaded by Ge-Peng Ji)
-
[2020/05/12] 💥 Release training/testing/evaluation code. (Updated by Ge-Peng Ji)
-
[2020/05/01] Create repository.
- Inf-Net: Automatic COVID-19 Lung Infection Segmentation from CT Images
Table of contents generated with markdown-toc
Figure 1. Example of COVID-19 infected regions in CT axial slice, where the red and green masks denote the
ground-glass opacity (GGO) and consolidation, respectively. The images are collected from [1].
[1] COVID-19 CT segmentation dataset, link: https://medicalsegmentation.com/covid19/, accessed: 2020-04-11.
-
Preview:
Our proposed methods consist of three individual components under three different settings:
-
Inf-Net (Supervised learning with segmentation).
-
Semi-Inf-Net (Semi-supervised learning with doctor label and pseudo label)
-
Semi-Inf-Net + Multi-Class UNet (Extended to Multi-class Segmentation, including Background, Ground-glass Opacities, and Consolidation).
-
-
Dataset Preparation:
Firstly, you should download the testing/training set (Google Drive Link) and put it into
./Dataset/
repository. -
Download the Pretrained Model:
ImageNet Pre-trained Models used in our paper ( VGGNet16, ResNet, and Res2Net), and put them into
./Snapshots/pre_trained/
repository. -
Configuring your environment (Prerequisites):
Note that Inf-Net series is only tested on Ubuntu OS 16.04 with the following environments (CUDA-10.0). It may work on other operating systems as well but we do not guarantee that it will.
-
Creating a virtual environment in terminal:
conda create -n SINet python=3.6
. -
Installing necessary packages:
pip install -r requirements.txt
.
- Installing THOP for counting the FLOPs/Params of model.
-
Figure 2. The architecture of our proposed Inf-Net model, which consists of three reverse attention
(RA) modules connected to the paralleled partial decoder (PPD).
-
Train
-
We provide multiple backbone versions (see this line) in the training phase, i.e., ResNet, Res2Net, and VGGNet, but we only provide the Res2Net version in the Semi-Inf-Net. Also, you can try other backbones you prefer to, but the pseudo labels should be RE-GENERATED with corresponding backbone.
-
Turn off the semi-supervised mode (
--is_semi=False
) turn off the flag of whether use pseudo labels (--is_pseudo=False
) in the parser ofMyTrain_LungInf.py
and just run it! (see this line)
-
-
Test
-
When training is completed, the weights will be saved in
./Snapshots/save_weights/Inf-Net/
. You can also directly download the pre-trained weights from Google Drive. -
Assign the path
--pth_path
of trained weights and--save_path
of results save and inMyTest_LungInf.py
. -
Just run it and results will be saved in
./Results/Lung infection segmentation/Inf-Net
-
Figure 3. Overview of the proposed Semi-supervised Inf-Net framework.
-
Data Preparation for pseudo-label generation. (Optional)
-
Dividing the 1600 unlabeled image into 320 groups (1600/K groups, we set K=5 in our implementation), in which images with
*.jpg
format can be found in./Dataset/TrainingSet/LungInfection-Train/Pseudo-label/Imgs/
. (I suppose you have downloaded all the train/test images following the instructions above) Then you only just run the code stored in./SrcCode/utils/split_1600.py
to split it into multiple sub-dataset, which are used in the training process of pseudo-label generation. The 1600/K sub-datasets will be saved in./Dataset/TrainingSet/LungInfection-Train/Pseudo-label/DataPrepare/Imgs_split/
-
You can also skip this process and download them from Google Drive that is used in our implementation.
-
-
Generating Pseudo Labels (Optional)
-
After preparing all the data, just run
PseudoGenerator.py
. It may take at least day and a half to finish the whole generation. -
You can also skip this process and download intermediate generated file from Google Drive that is used in our implementation.
-
When training is completed, the images with pseudo labels will be saved in
./Dataset/TrainingSet/LungInfection-Train/Pseudo-label/
.
-
-
Train
-
Firstly, turn off the semi-supervised mode (
--is_semi=False
) and turn on the flag of whether using pseudo labels (--is_pseudo=True
) in the parser ofMyTrain_LungInf.py
and modify the path of training data to the pseudo-label repository (--train_path='Dataset/TrainingSet/LungInfection-Train/Pseudo-label'
). Just run it! -
When training is completed, the weights (trained on pseudo-label) will be saved in
./Snapshots/save_weights/Inf-Net_Pseduo/Inf-Net_pseudo_100.pth
. Also, you can directly download the pre-trained weights from Google Drive. Now we have prepared the weights that is pre-trained on 1600 images with pseudo labels. Please note that these valuable images/labels can promote the performance and the stability of training process, because of ImageNet pre-trained models are just design for general object classification/detection/segmentation tasks initially. -
Secondly, turn on the semi-supervised mode (
--is_semi=True
) and turn off the flag of whether using pseudo labels (--is_pseudo=False
) in the parser ofMyTrain_LungInf.py
and modify the path of training data to the doctor-label (50 images) repository (--train_path='Dataset/TrainingSet/LungInfection-Train/Doctor-label'
). Just run it.
-
-
Test
-
When training is completed, the weights will be saved in
./Snapshots/save_weights/Semi-Inf-Net/
. You also can directly download the pre-trained weights from Google Drive. -
Assign the path
--pth_path
of trained weights and--save_path
of results save and inMyTest_LungInf.py
. -
Just run it! And results will be saved in
./Results/Lung infection segmentation/Semi-Inf-Net
.
-
Here, we provide a general and simple framework to address the multi-class segmentation problem. We modify the original design of UNet that is used for binary segmentation, and thus, we name it as Multi-class UNet. More details can be found in our paper.
Figure 3. Overview of the proposed Semi-supervised Inf-Net framework.
-
Train
-
Just run
MyTrain_MulClsLungInf_UNet.py
-
Note that
./Dataset/TrainingSet/MultiClassInfection-Train/Prior
is just borrowed from./Dataset/TestingSet/LungInfection-Test/GT/
, and thus, two repositories are equally.
-
-
Test
-
When training is completed, the weights will be saved in
./Snapshots/save_weights/Semi-Inf-Net_UNet/
. Also, you can directly download the pre-trained weights from Google Drive. -
Assigning the path of weights in parameters
snapshot_dir
and runMyTest_MulClsLungInf_UNet.py
. All the predictions will be saved in./Results/Multi-class lung infection segmentation/Consolidation
and./Results/Multi-class lung infection segmentation/Ground-glass opacities
.
-
We provide one-key evaluation toolbox for LungInfection Segmentation tasks, including Lung-Infection and Multi-Class-Infection. Please download the evaluation toolbox Google Drive.
-
Prerequisites: MATLAB Software (Windows/Linux OS is both works, however, we suggest you test it in the Linux OS for convenience.)
-
run
cd ./Evaluation/
andmatlab
open the Matlab software via terminal -
Just run
main.m
to get the overall evaluation results. -
Edit the parameters in the
main.m
to evaluate your custom methods. Please refer to the instructions in themain.m
.
We also build a semi-supervised COVID-19 infection segmentation (COVID-SemiSeg) dataset, with 100 labelled CT scans from the COVID-19 CT Segmentation dataset [1] and 1600 unlabeled images from the COVID-19 CT Collection dataset [2]. Our COVID-SemiSeg Dataset can be downloaded at Google Drive.
[1]“COVID-19 CT segmentation dataset,” https://medicalsegmentation.com/covid19/, accessed: 2020-04-11. [2]J. P. Cohen, P. Morrison, and L. Dao, “COVID-19 image data collection,” arXiv, 2020.
-
Lung infection which consists of 50 labels by doctors (Doctor-label) and 1600 pseudo labels generated (Pseudo-label) by our Semi-Inf-Net model. Download Link.
-
Multi-Class lung infection which also composed of 50 multi-class labels (GT) by doctors and 50 lung infection labels (Prior) generated by our Semi-Inf-Net model. Download Link.
-
The Lung infection segmentation set contains 48 images associate with 48 GT. Download Link.
-
The Multi-Class lung infection segmentation set has 48 images and 48 GT. Download Link.
== Note that ==: In our manuscript, we said that the total testing images are 50. However, we found there are two images with very small resolution and black ground-truth. Thus, we discard these two images in our testing set. The above link only contains 48 testing images.
To compare the infection regions segmentation performance, we consider the two state-of-the-art models U-Net and U-Net++. We also show the multi-class infection labelling results in Fig. 5. As can be observed, our model, Semi-Inf-Net & FCN8s, consistently performs the best among all methods. It is worth noting that both GGO and consolidation infections are accurately segmented by Semi-Inf-Net & FCN8s, which further demonstrates the advantage of our model. In contrast, the baseline methods, DeepLabV3+ with different strides and FCNs, all obtain unsatisfactory results, where neither GGO and consolidation infections can be accurately segmented.
Overall results can be downloaded from this link.
Lung infection segmentation results can be downloaded from this link
Multi-class lung infection segmentation can be downloaded from this link
Figure 4. Visual comparison of lung infection segmentation results.
Figure 5. Visual comparison of multi-class lung infection segmentation results, where the red and green labels
indicate the GGO and consolidation, respectively.
Ori GitHub Link: https://github.com/HzFu/COVID19_imaging_AI_paper_list
Figure 6. This is a collection of COVID-19 imaging-based AI research papers and datasets.
https://arxiv.org/pdf/2004.14133.pdf
Please cite our paper if you find the work useful:
@article{fan2020inf,
title={Inf-Net: Automatic COVID-19 Lung Infection Segmentation from CT Images},
author={Fan, Deng-Ping and Zhou, Tao and Ji, Ge-Peng and Zhou, Yi and Chen, Geng and Fu, Huazhu and Shen, Jianbing and Shao, Ling},
journal={IEEE TMI},
year={2020}
}
-
The COVID-SemiSeg Dataset is made available for non-commercial purposes only.
-
You will not, directly or indirectly, reproduce, use, or convey the COVID-SemiSeg Dataset or any Content, or any work product or data derived therefrom, for commercial purposes.
We would like to thank the whole organizing committee for considering the publication of our paper in this special issue (Special Issue on Imaging-Based Diagnosis of COVID-19) of IEEE Transactions on Medical Imaging. More papers refer to Link.
If you want to improve the usability of code or any other pieces of advice, please feel free to contact me directly (E-mail).
-
Support
NVIDIA APEX
training. -
Support different backbones ( VGGNet (done), ResNet, ResNeXt Res2Net (done), iResNet, and ResNeSt etc.)
-
Support distributed training.
-
Support lightweight architecture and faster inference, like MobileNet, SqueezeNet.
-
Support distributed training
-
Add more comprehensive competitors.
-
If the image cannot be loaded in the page (mostly in the domestic network situations).
-
I tested the U-Net, however, the Dice score is different from the score in TABLE II (Page 8 on our manuscript)?
Note that, the our Dice score is the mean dice score rather than the max Dice score. You can use our evaluation tool box Google Drive. The training set of each compared model (e.g., U-Net, Attention-UNet, Gated-UNet, Dense-UNet, U-Net++, Inf-Net (ours)) is the 48 images rather than 48 image+1600 images.