COVID-19 Few-Shot Segmentation

Few-shot learning has gained ample momentum in the past few years, thanks to its adaptability to small labeled datasets. This is especially relevant in medical imaging, where datasets often have limited high-quality annotations, like in CT scans of COVID-19 infection.

This project aims to provide an intuitive framework, with detailed documentation and easily usable code, to show the benefits of using few-shot learning for semantic segmentation of COVID-19 lungs infection. Also, by the very nature of few-shot learning, the framework is readily applicable to automatic detection/segmentation of other types of lung infection without need for re-training on more specific datasets.

The few-shot models used in this project are based on PANet, ADNet and ALPNet, which are popular in the medical image segmentation landscape. Have a look at their papers!

Thanks for your interest. 🙂

Getting Started

Clone/fork the repository.
Install the required dependencies, listed in requirements.txt
Change the paths in the script files ./experiments/<...>.sh according to your root directory.

Data Preparation

Download the COVID-19-CT-Seg dataset.
Put the CT scans under ./data/COVID-19-CT-Seg/Data and the segmentation masks under ./data/COVID-19-CT-Seg/<Infection_Mask/Lung_Mask/Lung_and_Infection_Mask>
Run the data pre-processing notebook

☝ This pre-processing pipeline can be used for other medical imaging datasets with minor adjustments, feel free to experiment!

Basic Usage

Adjust the experimental settings in ./configs/config_<panet/adnet/alpnet>.yaml
Each model is trained and evaluated in a five-fold cross-validation fashion. For each fold:
1. Train on training split by running ./experiments/train_<panet/adnet/alpnet>.sh
2. Evaluate on validation split by running ./experiments/validate_<panet/adnet/alpnet>.sh
3. Remember to change FOLD in the scripts after each cycle.

☝ If you have a computer cluster or a machine with multiple GPUs, you can try distributed training. Just change the last command in the train scripts, e.g.

torchrun --nproc_per_node=2  distributed_train.py     # Using 2 GPUs

to run an equivalent training schedule but with distributed settings.

Do note that this feature is still under development so it might not work on every machine.

Results

You can use this noteboook to visualise the results of your experiments
The table below shows a summary of the evaluation scores obtained by training for 50,000 iterations with a ResNet50 backbone.

PANet

ADNet

ALPNet

Left Lung

Dice

Precision

Recall

96.47 ± 0.99

97.03 ± 0.82

95.96 ± 2.18

92.82 ± 1.00

88.03 ± 2.12

98.20 ± 1.12

95.42 ± 1.81

93.56 ± 1.73

97.41 ± 2.76

Right Lung

Dice

Precision

Recall

96.92 ± 0.65

96.79 ± 0.82

97.07 ± 1.16

93.39 ± 0.93

89.45 ± 1.08

97.71 ± 1.47

95.58 ± 1.84

92.49 ± 3.52

98.97 ± 0.77

Infection

Dice

Precision

Recall

47.65 ± 16.36

45.95 ± 15.41

73.64 ± 9.41

34.13 ± 15.08

27.28 ± 13.74

84.57 ± 8.14

64.62 ± 14.98

61.91 ± 14.58

77.15 ± 7.17

fbono97/few-shot-covid19-seg

COVID-19 Few-Shot Segmentation

Getting Started

Data Preparation

Basic Usage

Results