DSGA-Net: Deeply separable gated transformer and attention strategy for medical image segmentation network
- PMID: 38559323
- PMCID: PMC7615776
- DOI: 10.1016/j.jksuci.2023.04.006
DSGA-Net: Deeply separable gated transformer and attention strategy for medical image segmentation network
Abstract
To address the problems of under-segmentation and over-segmentation of small organs in medical image segmentation. We present a novel medical image segmentation network model with Depth Separable Gating Transformer and a Three-branch Attention module (DSGA-Net). Firstly, the model adds a Depth Separable Gated Visual Transformer (DSG-ViT) module into its Encoder to enhance (i) the contextual links among global, local, and channels and (ii) the sensitivity to location information. Secondly, a Mixed Three-branch Attention (MTA) module is proposed to increase the number of features in the up-sampling process. Meanwhile, the loss of feature information is reduced when restoring the feature image to the original image size. By validating Synapse, BraTs2020, and ACDC public datasets, the Dice Similarity Coefficient (DSC) of the results of DSGA-Net reached 81.24%,85.82%, and 91.34%, respectively. Moreover, the Hausdorff Score (HD) decreased to 20.91% and 5.27% on the Synapse and BraTs2020. There are 10.78% and 0.69% decreases compared to the Baseline TransUNet. The experimental results indicate that DSGA-Net achieves better segmentation than most advanced methods.
Keywords: Depth separable; Gated attention mechanism; Medical image segmentation; Transformer.
Conflict of interest statement
Declaration of Competing Interest The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
Figures









Similar articles
-
[Multi-scale medical image segmentation based on pixel encoding and spatial attention mechanism].Sheng Wu Yi Xue Gong Cheng Xue Za Zhi. 2024 Jun 25;41(3):511-519. doi: 10.7507/1001-5515.202310001. Sheng Wu Yi Xue Gong Cheng Xue Za Zhi. 2024. PMID: 38932537 Free PMC article. Chinese.
-
DAWTran: dynamic adaptive windowing transformer network for pneumothorax segmentation with implicit feature alignment.Phys Med Biol. 2023 Aug 18;68(17). doi: 10.1088/1361-6560/aced79. Phys Med Biol. 2023. PMID: 37541224
-
BiFTransNet: A unified and simultaneous segmentation network for gastrointestinal images of CT & MRI.Comput Biol Med. 2023 Oct;165:107326. doi: 10.1016/j.compbiomed.2023.107326. Epub 2023 Aug 8. Comput Biol Med. 2023. PMID: 37619324
-
Sparse Dynamic Volume TransUNet with multi-level edge fusion for brain tumor segmentation.Comput Biol Med. 2024 Apr;172:108284. doi: 10.1016/j.compbiomed.2024.108284. Epub 2024 Mar 15. Comput Biol Med. 2024. PMID: 38503086 Review.
-
A lightweight PCT-Net for segmenting neural fibers in low-quality CCM images.Comput Biol Med. 2025 May;190:110051. doi: 10.1016/j.compbiomed.2025.110051. Epub 2025 Mar 22. Comput Biol Med. 2025. PMID: 40121803 Review.
Cited by
-
An improved reversible watermarking scheme using embedding optimization and quaternion moments.Sci Rep. 2024 Aug 9;14(1):18485. doi: 10.1038/s41598-024-69511-3. Sci Rep. 2024. PMID: 39122777 Free PMC article.
-
Medical image segmentation model based on local enhancement driven global optimization.Sci Rep. 2025 May 25;15(1):18281. doi: 10.1038/s41598-025-02393-1. Sci Rep. 2025. PMID: 40414982 Free PMC article.
-
Application of visual transformer in renal image analysis.Biomed Eng Online. 2024 Mar 5;23(1):27. doi: 10.1186/s12938-024-01209-z. Biomed Eng Online. 2024. PMID: 38439100 Free PMC article. Review.
References
-
- Bitter C, Elizondo DA, Yang Y. Natural language processing: a prolog perspective. Artif Intell Rev. 2010;33(1-2):151.
-
- Cao H, Wang Y, Chen J, Jiang D, Zhang X, Tian Q, Wang M. Swin-unet: Unet-like pure transformer for medical image segmentation; Computer Vision–ECCV 2022 Workshops: Tel Aviv, Israel, October 23–27, 2022, Proceedings, Part III; Cham. 2023. Feb, pp. 205–218.
-
- Chen J, Lu Y, Yu Q, Luo X, Adeli E, Wang Y, et al. Zhou Y. Transunet: Transformers make strong encoders for medical image segmentation. arXiv preprint. 2021:arXiv:2102.04306
-
- Chen B, Liu Y, Zhang Z, Lu G, Kong AWK. Transattunet: Multi-level attention-guided u-net with transformer for medical image segmentation. arXiv preprint. 2021:arXiv:2107.05274
Grants and funding
LinkOut - more resources
Full Text Sources
Miscellaneous