Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Feb 28;10(5):e26775.
doi: 10.1016/j.heliyon.2024.e26775. eCollection 2024 Mar 15.

SCANeXt: Enhancing 3D medical image segmentation with dual attention network and depth-wise convolution

Affiliations

SCANeXt: Enhancing 3D medical image segmentation with dual attention network and depth-wise convolution

Yajun Liu et al. Heliyon. .

Abstract

Existing approaches to 3D medical image segmentation can be generally categorized into convolution-based or transformer-based methods. While convolutional neural networks (CNNs) demonstrate proficiency in extracting local features, they encounter challenges in capturing global representations. In contrast, the consecutive self-attention modules present in vision transformers excel at capturing long-range dependencies and achieving an expanded receptive field. In this paper, we propose a novel approach, termed SCANeXt, for 3D medical image segmentation. Our method combines the strengths of dual attention (Spatial and Channel Attention) and ConvNeXt to enhance representation learning for 3D medical images. In particular, we propose a novel self-attention mechanism crafted to encompass spatial and channel relationships throughout the entire feature dimension. To further extract multiscale features, we introduce a depth-wise convolution block inspired by ConvNeXt after the dual attention block. Extensive evaluations on three benchmark datasets, namely Synapse, BraTS, and ACDC, demonstrate the effectiveness of our proposed method in terms of accuracy. Our SCANeXt model achieves a state-of-the-art result with a Dice Similarity Score of 95.18% on the ACDC dataset, significantly outperforming current methods.

Keywords: 3D medical image segmentation; Depth-wise convolution; Dual attention; InceptionNeXt; Swin transformer.

PubMed Disclaimer

Conflict of interest statement

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Figures

Figure 1
Figure 1
Overview of our SCANeXt structure.
Figure 2
Figure 2
Components of the spatial-wise transformer.
Figure 3
Figure 3
Components of the channel-wise transformer.
Figure 4
Figure 4
Components of the depthwise convolution module.
Figure 5
Figure 5
Qualitative comparison of the segmentation performance for the Synapse dataset.
Figure 6
Figure 6
Qualitative comparison of the segmentation performance for the BraTS dataset.
Figure 7
Figure 7
Qualitative comparison of the segmentation performance for the ACDC dataset.
Figure 8
Figure 8
The model size vs. DSC is shown in this plot. Circle size indicates computational complexity by FLOPs.

Similar articles

Cited by

References

    1. Dosovitskiy Alexey, Beyer Lucas, Kolesnikov Alexander, Weissenborn Dirk, Zhai Xiaohua, Unterthiner Thomas, Dehghani Mostafa, Minderer Matthias, Heigold Georg, Gelly Sylvain, et al. An image is worth 16×16 words: transformers for image recognition at scale. 2020. arXiv:2010.11929 arXiv preprint.
    1. Chen Jieneng, Lu Yongyi, Yu Qihang, Luo Xiangde, Adeli Ehsan, Wang Le Lu Yan, Yuille Alan L., Zhou Yuyin. Transunet: transformers make strong encoders for medical image segmentation. 2021. arXiv:2102.04306 arXiv preprint.
    1. Zhang Zhuangzhuang, Zhang Weixiong. Pyramid medical transformer for medical image segmentation. 2021. arXiv:2104.14702 arXiv preprint.
    1. Cao Hu, Wang Yueyue, Chen Joy, Jiang Dongsheng, Zhang Xiaopeng, Tian Qi, Wang Manning. European Conference on Computer Vision. Springer; 2022. Swin-Unet: Unet-like pure transformer for medical image segmentation; pp. 205–218.
    1. Lin Ailiang, Chen Bingzhi, Xu Jiayu, Zhang Zheng, Lu Guangming, Zhang David. DS- TransUNet: dual swin transformer U-Net for medical image segmentation. IEEE Trans. Instrum. Meas. 2022;71:1–15.

LinkOut - more resources