/DA-TransUnet

DA-TransUNet: Combining Dual Attention of Position and Channel with Transformer U-net for Medical Image Segmentation

Primary LanguagePythonGNU General Public License v3.0GPL-3.0

DA-TransUnet

DA-TransUNet: Integrating Positional and Channel Dual Attention with Transformer-Based U-Net for Enhanced Medical Image Segmentation (https://arxiv.org/abs/2310.12570)

1.Prepare pre-trained ViT models

  • Get models and training parameters in this link: R50-ViT-B_16,At the same time, the parameter file (.pth) in the paper is also stored.(You can download and compress it, put it into the model file and rename it TU_Synapse224, and then use the test code (python test.py --dataset Synapse --vit_name R50-ViT-B_16) to get the test results.)

2.Prepare data

Please use the preprocessed data for research purposes.

3.Environment

Please prepare an environment with python=3.7(conda create -n envir python=3.7.12), and then use the command "pip install -r requirements.txt" for the dependencies.

4.Train/Test

Run the train script on synapse dataset. The batch size can be reduced to 12 or 16 to save memory(please also decrease the base_lr linearly), and both can reach similar performance.

CUDA_VISIBLE_DEVICES=0 python train.py --dataset Synapse --vit_name R50-ViT-B_16
  • Run the test script on synapse dataset. It supports testing for both 2D images and 3D volumes.
python test.py --dataset Synapse --vit_name R50-ViT-B_16

Reference

Citation

@article{sun2024transunet, title={DA-TransUNet: integrating spatial and channel dual attention with transformer U-net for medical image segmentation}, author={Sun, Guanqun and Pan, Yizhi and Kong, Weikun and Xu, Zichang and Ma, Jianhua and Racharak, Teeradaj and Nguyen, Le-Minh and Xin, Junyi}, journal={Frontiers in Bioengineering and Biotechnology}, volume={12}, pages={1398237}, year={2024}, publisher={Frontiers Media SA} }