Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2025 Aug 2;17(15):2558.
doi: 10.3390/cancers17152558.

Application of Artificial Intelligence in Pancreatic Cyst Management: A Systematic Review

Affiliations
Review

Application of Artificial Intelligence in Pancreatic Cyst Management: A Systematic Review

Donghyun Lee et al. Cancers (Basel). .

Abstract

Background: Pancreatic cystic lesions (PCLs), including intraductal papillary mucinous neoplasms (IPMNs) and mucinous cystic neoplasms (MCNs), pose a diagnostic challenge due to their variable malignant potential. Current guidelines, such as Fukuoka and American Gastroenterological Association (AGA), have moderate predictive accuracy and may lead to overtreatment or missed malignancies. Artificial intelligence (AI), incorporating machine learning (ML) and deep learning (DL), offers the potential to improve risk stratification, diagnosis, and management of PCLs by integrating clinical, radiological, and molecular data. This is the first systematic review to evaluate the application, performance, and clinical utility of AI models in the diagnosis, classification, prognosis, and management of pancreatic cysts.

Methods: A systematic review was conducted in accordance with PRISMA guidelines and registered on PROSPERO (CRD420251008593). Databases searched included PubMed, EMBASE, Scopus, and Cochrane Library up to March 2025. The inclusion criteria encompassed original studies employing AI, ML, or DL in human subjects with pancreatic cysts, evaluating diagnostic, classification, or prognostic outcomes. Data were extracted on the study design, imaging modality, model type, sample size, performance metrics (accuracy, sensitivity, specificity, and area under the curve (AUC)), and validation methods. Study quality and bias were assessed using the PROBAST and adherence to TRIPOD reporting guidelines.

Results: From 847 records, 31 studies met the inclusion criteria. Most were retrospective observational (n = 27, 87%) and focused on preoperative diagnostic applications (n = 30, 97%), with only one addressing prognosis. Imaging modalities included Computed Tomography (CT) (48%), endoscopic ultrasound (EUS) (26%), and Magnetic Resonance Imaging (MRI) (9.7%). Neural networks, particularly convolutional neural networks (CNNs), were the most common AI models (n = 16), followed by logistic regression (n = 4) and support vector machines (n = 3). The median reported AUC across studies was 0.912, with 55% of models achieving AUC ≥ 0.80. The models outperformed clinicians or existing guidelines in 11 studies. IPMN stratification and subtype classification were common focuses, with CNN-based EUS models achieving accuracies of up to 99.6%. Only 10 studies (32%) performed external validation. The risk of bias was high in 93.5% of studies, and TRIPOD adherence averaged 48%.

Conclusions: AI demonstrates strong potential in improving the diagnosis and risk stratification of pancreatic cysts, with several models outperforming current clinical guidelines and human readers. However, widespread clinical adoption is hindered by high risk of bias, lack of external validation, and limited interpretability of complex models. Future work should prioritise multicentre prospective studies, standardised model reporting, and development of interpretable, externally validated tools to support clinical integration.

Keywords: IPMN; MCN; artificial intelligence; benign; diagnosis; machine learning; malignant; pancreatic cyst; prognosis.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflicts of interest.

Figures

Figure 1
Figure 1
PRISMA flow diagram illustrating the study selection process for this systematic review. The flow outlines the number of records identified through database searches (*), the number of duplicates removed, studies screened, full-text articles assessed for eligibility, and studies included in the final review. There were 503 studies excluded (**), mostly because the studies did not address the use of artificial intelligence or did not focus on pancreatic cysts specifically. * represent the total number of record identified. ** represents the studies excluded in the study.
Figure 2
Figure 2
(a) Sub-domains of included studies in the review, with majority of studies on diagnostic prediction models; (b) characteristics of included studies according to design; and (c) machine learning models employed/developed.
Figure 3
Figure 3
Range and median of AUC values across studies grouped by diagnostic domain (e.g., diagnosis, segmentation, risk stratification, management, and prognosis). Each bar displays the minimum and maximum AUC values reported in the studies within that domain, with the central marker indicating the median AUC.
Figure 4
Figure 4
Risk of bias of included studies using Prediction Model Risk of bias assessment tool (PROBAST).
Figure 5
Figure 5
Reporting adherence of included studies to transparent reporting of a multivariable prediction model for individual prognosis or diagnosis (TRIPOD) tool.

References

    1. Zhang H., Qie Y. Applying Deep Learning to Medical Imaging: A Review. Appl. Sci. 2023;13:10521. doi: 10.3390/app131810521. - DOI
    1. Khosravi M., Zare Z., Mojtabaeian S.M., Izadi R. Artificial Intelligence and Decision-Making in Healthcare: A Thematic Analysis of a Systematic Review of Reviews. Health Serv. Res. Manag. Epidemiol. 2024;11:23333928241234863. doi: 10.1177/23333928241234863. - DOI - PMC - PubMed
    1. Rajpurkar P., Chen E., Banerjee O., Topol E.J. AI in health and medicine. Nat. Med. 2022;28:31–38. doi: 10.1038/s41591-021-01614-0. - DOI - PubMed
    1. Javaheri H., Ghamarnejad O., Widyaningsih R., Bade R., Lukowicz P., Karolus J., Stavrou G.A. Enhancing Perioperative Outcomes of Pancreatic Surgery with Wearable Augmented Reality Assistance System: A Matched-Pair Analysis. Ann. Surg. Open. 2024;5:e516. doi: 10.1097/AS9.0000000000000516. - DOI - PMC - PubMed
    1. Schlanger D., Graur F., Popa C., Mois E., Al Hajjar N. The role of artificial intelligence in pancreatic surgery: A systematic review. Updates Surg. 2022;74:417–429. doi: 10.1007/s13304-022-01255-z. - DOI - PubMed

LinkOut - more resources