PMFSNet

PMFSNet : Polarized Multi-scale Feature Self-attention Network For Lightweight Medical Image Segmentation

To-do List

Training and testing code
Training weights of other models on our experiments
Pre-training weights of PMFSNet on ImageNet2012

Environment

Please prepare an environment with python=3.8:

conda create -n py38 python=3.8
conda activate py38
git clone git@github.com:yykzjh/PMFSNet.git
cd PMFSNet

Please install the environment dependencies by requirements.txt:

pip install -r requirements.txt

Data Preparation

We obtained three public datasets and processed them in some way. All datasets are placed in the ./datasets directory after unzipping.

3D CBCT Tooth dataset:

./datasets/NC-release-data-checked/
	sub_volumes/160-160-96_2048.npz
	train/
		images/
			1000889125_20171009.nii.gz
			......
			X2360674.nii.gz
		labels/
			1000889125_20171009.nii.gz
			......
			X2360674.nii.gz
	valid/
		images/
			1000813648_20180116.nii.gz
			......
			X2358714.nii.gz
		labels/
			1000813648_20180116.nii.gz
			......
			X2358714.nii.gz

MMOTU:

./datasets/MMOTU/
	train/
		images/
			1.JPG
			......
			1465.JPG
		labels/
			1.PNG
			......
			1465.PNG
	valid/
		images/
			3.JPG
			......
			1469.JPG
		labels/
			3.PNG
			......
			1469.PNG

ISIC 2018:

./datasets/ISIC-2018/
	train/
		images/
			ISIC_0000000.jpg
			......
			ISIC_0016072.jpg
		annotations/
			ISIC_0000000_segmentation.png
			......
			ISIC_0016072_segmentation.png
	test/
		images/
			ISIC_0000003.jpg
			......
			ISIC_0016060.jpg
		annotations/
			ISIC_0000003_segmentation.png
			......
			ISIC_0016060_segmentation.png

Training

Running train.py script can easily start the training. Customize the training by passing in the following arguments:

--dataset: dataset name, optional 3D-CBCT-Tooth, MMOTU, ISIC-2018
--model: model name, see below implemented architectures for details
--pretrain_weight: pre-trained weight file path
--dimension: dimension of dataset images and models, for PMFSNet only
--scaling_version: scaling version of PMFSNet, for PMFSNet only
--epoch: training epoch

Training demo:

python ./train.py --dataset 3D-CBCT-Tooth --model PMFSNet --dimension 3d --scaling_version TINY --epoch 20
python ./train.py --dataset MMOTU --model PMFSNet --pretrain_weight ./pretrain/PMFSNet2D-basic_ILSVRC2012.pth --dimension 2d --scaling_version BASIC --epoch 2000
python ./train.py --dataset ISIC-2018 --model PMFSNet --dimension 2d --scaling_version BASIC --epoch 150

Testing

Evaluating model performance on a test set

Running test.py script to start the evaluation. The input arguments are the same as the training script. Testing demo:

python ./test.py --dataset 3D-CBCT-Tooth --model PMFSNet --pretrain_weight ./pretrain/PMFSNet3D-TINY_Tooth.pth --dimension 3d --scaling_version TINY
python ./test.py --dataset MMOTU --model PMFSNet --pretrain_weight ./pretrain/PMFSNet2D-BASIC_MMOTU.pth --dimension 2d --scaling_version BASIC
python ./test.py --dataset ISIC-2018 --model PMFSNet --pretrain_weight ./pretrain/PMFSNet2D-BASIC_ISIC2018.pth --dimension 2d --scaling_version BASIC

Inferring a single image segmentation result

Running inference.py script to start the inference. The additional argument --image_path denotes the path of the inferred image. Inferring demo:

python ./inference.py --dataset 3D-CBCT-Tooth --model PMFSNet --pretrain_weight ./pretrain/PMFSNet3D-TINY_Tooth.pth --dimension 3d --scaling_version TINY --image_path ./images/1001250407_20190923.nii.gz
python ./inference.py --dataset MMOTU --model PMFSNet --pretrain_weight ./pretrain/PMFSNet2D-BASIC_MMOTU.pth --dimension 2d --scaling_version BASIC --image_path ./images/453.JPG
python ./inference.py --dataset ISIC-2018 --model PMFSNet --pretrain_weight ./pretrain/PMFSNet2D-BASIC_ISIC2018.pth --dimension 2d --scaling_version BASIC --image_path ./images/ISIC_0000550.jpg

Implemented Architectures

3D CBCT Tooth:
- UNet3D: 3D U-Net: learning dense volumetric segmentation from sparse annotation
- VNet: V-net: Fully convolutional neural networks for volumetric medical image segmentation
- DenseVNet: Automatic multi-organ segmentation on abdominal CT with dense V-networks
- AttentionUNet3D: Attention u-net: Learning where to look for the pancreas
- DenseVoxelNet: Automatic 3D cardiovascular MR segmentation with densely-connected volumetric convnets
- MultiResUNet3D: MultiResUNet: Rethinking the U-Net architecture for multimodal biomedical image segmentation
- UNETR: Unetr: Transformers for 3d medical image segmentation
- SwinUNETR: Swin unetr: Swin transformers for semantic segmentation of brain tumors in mri images
- TransBTS: Transbts: Multimodal brain tumor segmentation using transformer
- nnFormer: nnFormer: volumetric medical image segmentation via a 3D transformer
- 3DUXNet: 3d ux-net: A large kernel volumetric convnet modernizing hierarchical transformer for medical image segmentation
MMOTU
- MobileNetV2: Mobilenetv2: Inverted residuals and linear bottlenecks
- PSPNet: Pyramid scene parsing network
- DANet: Dual attention network for scene segmentation
- SegFormer: SegFormer: Simple and efficient design for semantic segmentation with transformers
- UNet: U-net: Convolutional networks for biomedical image segmentation
- TransUNet: Transunet: Transformers make strong encoders for medical image segmentation
- BiSeNetV2: Bisenet v2: Bilateral network with guided aggregation for real-time semantic segmentation
- MedT: Medical transformer: Gated axial-attention for medical image segmentation
ISIC 2018
- UNet: U-net: Convolutional networks for biomedical image segmentation
- AttU_Net: Attention u-net: Learning where to look for the pancreas
- CANet: CA-Net: Comprehensive attention convolutional neural networks for explainable medical image segmentation
- BCDUNet: Bi-directional ConvLSTM U-Net with densley connected convolutions
- CENet: Ce-net: Context encoder network for 2d medical image segmentation
- CPFNet: CPFNet: Context pyramid fusion network for medical image segmentation
- CKDNet: Cascade knowledge diffusion network for skin lesion diagnosis and segmentation

Results

3D CBCT Tooth

Comparison results of different methods on the 3D CBCT tooth dataset.

Method	FLOPs(G)	Params(M)	HD(mm)	ASSD(mm)	IoU(%)	SO(%)	DSC(%)	Weights
UNet3D	2223.03	16.32	113.79	22.40	70.62	70.72	36.67	UNet3D_Tooth
DenseVNet	23.73	0.87	8.21	1.14	84.57	94.88	91.15	DenseVNet_Tooth
AttentionUNet3D	2720.79	94.48	147.10	61.10	52.52	42.49	64.08	AttentionUNet3D_Tooth
DenseVoxelNet	402.32	1.78	41.18	3.88	81.51	92.50	89.58	DenseVoxelNet_Tooth
MultiResUNet3D	1505.38	17.93	74.06	8.17	76.19	81.70	65.45	MultiResUNet3D_Tooth
UNETR	229.19	93.08	107.89	17.95	74.30	73.14	81.84	UNETR_Tooth
SwinUNETR	912.35	62.19	82.71	7.50	83.10	86.80	89.74	SwinUNETR_Tooth
TransBTS	306.80	33.15	29.03	4.10	82.94	90.68	39.32	TransBTS_Tooth
nnFormer	583.49	149.25	51.28	5.08	83.54	90.89	90.66	nnFormer_Tooth
3D UX-Net	1754.79	53.01	108.52	19.69	75.40	73.48	84.89	3DUXNet_Tooth
PMFSNet3D-TINY(Ours)	15.14	0.63	5.57	0.79	84.68	95.10	91.30

MMOTU

Pre-training weights of PMFSNet2D-BASIC, PMFSNet2D-SMALL, and PMFSNet2D-TINY on ImageNet2012 dataset.

Method	Weights
PMFSNet2D-BASIC	PMFSNet2D-BASIC_ILSVRC2012
PMFSNet2D-SMALL	PMFSNet2D-SMALL_ILSVRC2012
PMFSNet2D-TINY	PMFSNet2D-TINY_ILSVRC2012

Comparison results of different methods on the MMOTU dataset.

Method	FLOPs(G)	Params(M)	IoU(%)	mIoU(%)
PSPNet	38.71	53.32	82.01	89.41
DANet	10.95	47.44	82.20	89.53
SegFormer	2.52	7.72	82.46	89.88
U-Net	41.90	31.04	79.91	86.80
TransUNet	24.61	105.28	81.31	89.01
BiseNetV2	3.40	5.19	79.37	86.13
PMFSNet2D-BASIC(Ours)	2.21	0.99	82.02	89.36

ISIC 2018

Comparison results of different methods on the ISIC 2018 dataset.

Method	FLOPs(G)	Params(M)	IoU(%)	DSC(%)	ACC(%)	Weights
U-Net	41.93	31.04	76.77	86.55	95.00	UNet_ISIC2018
AttU-Net	51.07	34.88	78.19	87.54	95.33	AttU_Net_ISIC2018
CA-Net	4.62	2.79	68.82	80.96	92.96	CANet_ISIC2018
BCDU-Net	31.96	18.45	76.46	86.26	95.19	BCDUNet_ISIC2018
CE-Net	6.83	29.00	78.05	87.47	95.40	CENet_ISIC2018
CPF-Net	6.18	43.27	78.47	87.70	95.52	CPFNet_ISIC2018
CKDNet	12.69	59.34	77.89	87.35	95.27	CKDNet_ISIC2018
PMFSNet2D-BASIC(Ours)	2.21	0.99	78.82	87.92	95.59

yykzjh/PMFSNet