Enhancing medical image segmentation with a multi-transformer U-Net
- PMID: 38435997
- PMCID: PMC10909362
- DOI: 10.7717/peerj.17005
Enhancing medical image segmentation with a multi-transformer U-Net
Abstract
Various segmentation networks based on Swin Transformer have shown promise in medical segmentation tasks. Nonetheless, challenges such as lower accuracy and slower training convergence have persisted. To tackle these issues, we introduce a novel approach that combines the Swin Transformer and Deformable Transformer to enhance overall model performance. We leverage the Swin Transformer's window attention mechanism to capture local feature information and employ the Deformable Transformer to adjust sampling positions dynamically, accelerating model convergence and aligning it more closely with object shapes and sizes. By amalgamating both Transformer modules and incorporating additional skip connections to minimize information loss, our proposed model excels at rapidly and accurately segmenting CT or X-ray lung images. Experimental results demonstrate the remarkable, showcasing the significant prowess of our model. It surpasses the performance of the standalone Swin Transformer's Swin Unet and converges more rapidly under identical conditions, yielding accuracy improvements of 0.7% (resulting in 88.18%) and 2.7% (resulting in 98.01%) on the COVID-19 CT scan lesion segmentation dataset and Chest X-ray Masks and Labels dataset, respectively. This advancement has the potential to aid medical practitioners in early diagnosis and treatment decision-making.
Keywords: CT or X-ray lung images; Medical image segmentation; Multi-transformer; Unet.
©2024 Dan et al.
Conflict of interest statement
The authors declare there are no competing interests.
Figures





Similar articles
-
Improved UNet with Attention for Medical Image Segmentation.Sensors (Basel). 2023 Oct 20;23(20):8589. doi: 10.3390/s23208589. Sensors (Basel). 2023. PMID: 37896682 Free PMC article.
-
Dual encoder network with transformer-CNN for multi-organ segmentation.Med Biol Eng Comput. 2023 Mar;61(3):661-671. doi: 10.1007/s11517-022-02723-9. Epub 2022 Dec 29. Med Biol Eng Comput. 2023. PMID: 36580181
-
High-Resolution Swin Transformer for Automatic Medical Image Segmentation.Sensors (Basel). 2023 Mar 24;23(7):3420. doi: 10.3390/s23073420. Sensors (Basel). 2023. PMID: 37050479 Free PMC article.
-
An improved feature extraction algorithm for robust Swin Transformer model in high-dimensional medical image analysis.Comput Biol Med. 2025 Apr;188:109822. doi: 10.1016/j.compbiomed.2025.109822. Epub 2025 Feb 20. Comput Biol Med. 2025. PMID: 39983364 Review.
-
Transformer-based approaches for neuroimaging: an in-depth review of their role in classification and regression tasks.Rev Neurosci. 2024 Sep 30;36(2):209-228. doi: 10.1515/revneuro-2024-0088. Print 2025 Feb 25. Rev Neurosci. 2024. PMID: 39333087 Review.
Cited by
-
IDCC-SAM: A Zero-Shot Approach for Cell Counting in Immunocytochemistry Dataset Using the Segment Anything Model.Bioengineering (Basel). 2025 Feb 14;12(2):184. doi: 10.3390/bioengineering12020184. Bioengineering (Basel). 2025. PMID: 40001703 Free PMC article.
-
Joint segmentation of sternocleidomastoid and skeletal muscles in computed tomography images using a multiclass learning approach.Radiol Phys Technol. 2024 Dec;17(4):854-861. doi: 10.1007/s12194-024-00839-1. Epub 2024 Sep 6. Radiol Phys Technol. 2024. PMID: 39242477 Free PMC article.
-
Flood change detection model based on an improved U-net network and multi-head attention mechanism.Sci Rep. 2025 Jan 26;15(1):3295. doi: 10.1038/s41598-025-87851-6. Sci Rep. 2025. PMID: 39865097 Free PMC article.
References
-
- Adams R, Bischof L. Seeded region growing. IEEE Transactions on Pattern Analysis and Machine Intelligence. 1994;16(6):641–647. doi: 10.1109/34.295913. - DOI
-
- Batra A, Singh S, Pang G, Basu S, Jawahar C, Paluri M. Improved road connectivity by joint learning of orientation and segmentation. Proceedings of the IEEE/CVF conference on computer vision and pattern recognition; Piscataway. 2019. pp. 10385–10393.
-
- Candemir S, Jaeger S, Palaniappan K, Musco JP, Singh RK, Xue Z, Karargyris A, Antani S, Thoma G, McDonald CJ. Lung segmentation in chest radiographs using anatomical atlases with nonrigid registration. IEEE Transactions on Medical Imaging. 2013;33(2):577–590. doi: 10.1109/TMI.2013.2290491. - DOI - PMC - PubMed
-
- Cao H, Wang Y, Chen J, Jiang D, Zhang X, Tian Q, Wang M. Swin-unet: Unet-like pure transformer for medical image segmentation. European conference on computer vision; Cham. 2022. pp. 205–218.
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Medical