Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2024 Dec 5;14(12):9620-9652.
doi: 10.21037/qims-24-723. Epub 2024 Nov 29.

A literature review of artificial intelligence (AI) for medical image segmentation: from AI and explainable AI to trustworthy AI

Affiliations
Review

A literature review of artificial intelligence (AI) for medical image segmentation: from AI and explainable AI to trustworthy AI

Zixuan Teng et al. Quant Imaging Med Surg. .

Abstract

Background and objective: Medical image segmentation is a vital aspect of medical image processing, allowing healthcare professionals to conduct precise and comprehensive lesion analyses. Traditional segmentation methods are often labor intensive and influenced by the subjectivity of individual physicians. The advent of artificial intelligence (AI) has transformed this field by reducing the workload of physicians, and improving the accuracy and efficiency of disease diagnosis. However, conventional AI techniques are not without challenges. Issues such as inexplicability, uncontrollable decision-making processes, and unpredictability can lead to confusion and uncertainty in clinical decision-making. This review explores the evolution of AI in medical image segmentation, focusing on the development and impact of explainable AI (XAI) and trustworthy AI (TAI).

Methods: This review synthesizes existing literature on traditional segmentation methods, AI-based approaches, and the transition from conventional AI to XAI and TAI. The review highlights the key principles and advancements in XAI that aim to address the shortcomings of conventional AI by enhancing transparency and interpretability. It further examines how TAI builds on XAI to improve the reliability, safety, and accountability of AI systems in medical image segmentation.

Key content and findings: XAI has emerged as a solution to the limitations of conventional AI by providing greater transparency and interpretability, allowing healthcare professionals to better understand and trust AI-driven decisions. However, XAI itself faces challenges, including those related to safety, robustness, and value alignment. TAI has been developed to overcome these challenges, offering a more reliable framework for AI applications in medical image segmentation. By integrating the principles of XAI with enhanced safety and dependability, TAI addresses the critical need for TAI systems in clinical settings.

Conclusions: TAI presents a promising future for medical image segmentation, combining the benefits of AI with improved reliability and safety. Thus, TAI is a more viable and dependable option for healthcare applications, and could ultimately lead to better clinical outcomes for patients, and advance the field of medical image processing.

Keywords: Medical image segmentation; artificial intelligence (AI); explainable AI (XAI); trustworthy AI (TAI).

PubMed Disclaimer

Conflict of interest statement

Conflicts of Interest: All authors have completed the ICMJE uniform disclosure form (available at https://qims.amegroups.com/article/view/10.21037/qims-24-723/coif). The authors have no conflicts of interest to declare.

Figures

Figure 1
Figure 1
Medical image segmentation based on AI. AI, artificial intelligence.
Figure 2
Figure 2
Some AI models: (A) an overall structure diagram of StillGAN; (B) an overview of TransUNet; the actual architecture is 3D. A 2D image is used here for the purpose of simplicity. AI, artificial intelligence; 3D, three-dimensional; 2D, two-dimensional; Conv, convolution; BN, batch normalization.
Figure 3
Figure 3
Some variations on existing structures: (A) Block of the Sharp U-Net; (B) testing pipeline of the SAM. W, width; M, mask; H, height; GT, ground truth; SAM, segmentation arbitrary model.
Figure 4
Figure 4
Medical image segmentation based on XAI. AI, artificial intelligence; XAI, explainable artificial intelligence.
Figure 5
Figure 5
Classification of XAI. XAI, explainable artificial intelligence.
Figure 6
Figure 6
LIME. (A) LIME interpretation of individual prediction process; (B) interpretations generated by LIME. LIME, local interpretable model-agnostic explanations.
Figure 7
Figure 7
Saliency maps. (A) Overall architecture of saliency maps. (B) Comparison of saliency maps represented as heat maps: (a) original image with polyp mask superimposed; (b) average expert; (c) average novice; (d) WM-DOVA energy map; (e) Bruce and Tsotsos model; (f) GBVS; (g) Itti-Koch model; (h) Seo model; (i) SIM; (j) SUN. High saliency areas correspond to hot regions in the image. WM-DOVA, Window Median Depth of Valleys; GBVS, Graph-Based Visual Saliency; SIM, Saliency Image Model; SUN, Saliency Understanding Network.
Figure 8
Figure 8
LRP. (A) Architecture of the model proposed by Aham et al. (B) The pipeline of the method, including the LRP module, the LRS module, and the classification module. LRP, layer-wise relevance propagation; LRS, lesion region segmentation.
Figure 9
Figure 9
IG. (A) Prediction of grade attribution of diabetic retinopathy from retinal fundus images. The original image is displayed on the left, and the properties (overlaid in grayscale on the original image) are displayed on the right. In the original images, the study annotated the lesions visible to humans and confirmed that the attributes pointed to them. (B) Comparison of IG and guided IG in the diagnosis of diabetic retinopathy. IG, integrated gradient; DR, diabetic retinopathy.
Figure 10
Figure 10
Counterfactual interpretation. (A) An overview of TraCE applied to the introspective analysis of CXR-based predictive models; (B) attribution framework. CXR, chest X-ray; H, height; W, width; D, depth; Conv., convolution.
Figure 11
Figure 11
Grad-CAM. (A) Architecture of Grad-CAM. (B) Architecture of C-CAM. Grad-CAM, gradient-weighted class activation mapping, C-CAM, causal class activation mapping; CNN, convolutional neural network; RNN, recurrent neural network; LSTM, long short-term memory; FC, fully connected; GS, global structure.
Figure 12
Figure 12
Benefits of XAI on medical image segmentation. XAI, explainable artificial intelligence; AI, artificial intelligence.
Figure 13
Figure 13
Medical image segmentation based on TAI. XAI, explainable artificial intelligence; TAI, trustworthy artificial intelligence.
Figure 14
Figure 14
Characteristics of TAI. TAI, trustworthy artificial intelligence.

Similar articles

Cited by

References

    1. Lee LK, Liew SC, Thong WJ. A review of image segmentation methodologies in medical image. Adv Comput Commun Eng Technol 2015;1069-80.
    1. Hesamian MH, Jia W, He X, Kennedy P. Deep Learning Techniques for Medical Image Segmentation: Achievements and Challenges. J Digit Imaging 2019;32:582-96. 10.1007/s10278-019-00227-x - DOI - PMC - PubMed
    1. Chen A, Zhu L, Zang H, Ding Z, Zhan S. Computer-aided diagnosis and decision-making system for medical data analysis: a case study on prostate MR images. J Manag Sci Eng 2019;4:266-78.
    1. Abdel-Basset M, Chang V, Mohamed R. A novel equilibrium optimization algorithm for multi-thresholding image segmentation problems. Neural Comput Appl 2021;33:10685-718.
    1. Yang D, Xu Z, Li W, Myronenko A, Roth HR, Harmon S, Xu S, Turkbey B, Turkbey E, Wang X, Zhu W, Carrafiello G, Patella F, Cariati M, Obinata H, Mori H, Tamura K, An P, Wood BJ, Xu D. Federated semi-supervised learning for COVID region segmentation in chest CT using multi-national data from China, Italy, Japan. Med Image Anal 2021;70:101992. 10.1016/j.media.2021.101992 - DOI - PMC - PubMed

LinkOut - more resources