Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Meta-Analysis
. 2025 Jul 31:27:e72420.
doi: 10.2196/72420.

Effectiveness of Radiomics-Based Machine Learning Models in Differentiating Pancreatitis and Pancreatic Ductal Adenocarcinoma: Systematic Review and Meta-Analysis

Affiliations
Meta-Analysis

Effectiveness of Radiomics-Based Machine Learning Models in Differentiating Pancreatitis and Pancreatic Ductal Adenocarcinoma: Systematic Review and Meta-Analysis

Lechang Zhang et al. J Med Internet Res. .

Abstract

Background: Pancreatic ductal adenocarcinoma (PDAC) and mass-forming pancreatitis (MFP) share similar clinical, laboratory, and imaging features, making accurate diagnosis challenging. Nevertheless, PDAC is highly malignant with a poor prognosis, whereas MFP is an inflammatory condition typically responding well to medical or interventional therapies. Some investigators have explored radiomics-based machine learning (ML) models for distinguishing PDAC from MFP. However, systematic evidence supporting the feasibility of these models is insufficient, presenting a notable challenge for clinical application.

Objective: This study intended to review the diagnostic performance of radiomics-based ML models in differentiating PDAC from MFP, summarize the methodological quality of the included studies, and provide evidence-based guidance for optimizing radiomics-based ML models and advancing their clinical use.

Methods: PubMed, Embase, Cochrane, and Web of Science were searched for relevant studies up to June 29, 2024. Eligible studies comprised English cohort, case-control, or cross-sectional designs that applied fully developed radiomics-based ML models-including traditional and deep radiomics-to differentiate PDAC from MFP, while also reporting their diagnostic performance. Studies without full text, limited to image segmentation, or insufficient outcome metrics were excluded. Methodological quality was appraised by means of the radiomics quality score. Since the limited applicability of QUADAS-2 in radiomics-based ML studies, the risk of bias was not formally assessed. Pooled sensitivity, specificity, area under the curve of summary receiver operating characteristics (SROC), likelihood ratios, and diagnostic odds ratio were estimated through a bivariate mixed-effects model. Results were presented with forest plots, SROC curves, and Fagan's nomogram. Subgroup analysis was performed to appraise the diagnostic performance of radiomics-based ML models across various imaging modalities, including computed tomography (CT), magnetic resonance imaging, positron emission tomography-CT, and endoscopic ultrasound.

Results: This meta-analysis included 24 studies with 14,406 cases, including 7635 PDAC cases. All studies adopted a case-control design, with 5 conducted across multiple centers. Most studies used CT as the primary imaging modality. The radiomics quality score scores ranged from 5 points (14%) to 17 points (47%), with an average score of 9 (25%). The radiomics-based ML models demonstrated high diagnostic performance. Based on the independent validation sets, the pooled sensitivity, specificity, area under the curve of SROC, positive likelihood ratio, negative likelihood ratio, and diagnostic odds ratio were 0.92 (95% CI 0.91-0.94), 0.90 (95% CI 0.85-0.94), 0.94 (95% CI 0.74-0.99), 9.3 (95% CI 6.0-14.2), 0.08 (95% CI 0.07-0.11), and 110 (95% CI 62-194), respectively.

Conclusions: Radiomics-based ML models demonstrate high diagnostic accuracy in differentiating PDAC from MFP, underscoring their potential as noninvasive tools for clinical decision-making. Nonetheless, the overall methodological quality was moderate due to limitations in external validation, standardized protocols, and reproducibility. These findings support the promise of radiomics in clinical diagnostics while highlighting the need for more rigorous, multicenter research to enhance model generalizability and clinical applicability.

Keywords: artificial intelligence; differential diagnosis; imaging modalities; meta-analysis; systematic review.

PubMed Disclaimer

Conflict of interest statement

Conflicts of Interest: None declared.

Figures

Figure 1.
Figure 1.. Literature screening process.
Figure 2.
Figure 2.. Meta-analysis results of radiomics-based machine learning models in differentiating pancreatic ductal adenocarcinoma from mass-forming pancreatitis within the training set. Data were derived from studies [25262830323540]. (A) Coupled forest plots of pooled sensitivity and specificity. (B) Summary receiver operating characteristic curve. (C) Deeks’ funnel plot. (D) Fagan nomogram. AUC: area under the curve; ESS: effective sample size; LR: logistic regression; Prob: probability; Sens: sensitivity; Spec: specificity; SROC: summary receiver operating characteristic.
Figure 3.
Figure 3.. Meta-analysis results of radiomics-based machine learning models in differentiating pancreatic ductal adenocarcinoma from mass-forming pancreatitis within the independent validation set. Data were derived from studies [252629323540-424546undefinedundefined]. (A) Coupled forest plots of pooled sensitivity and specificity. (B) Summary receiver operating characteristic curve. (C) Deeks’ funnel plot. (D) Fagan nomogram. AUC: area under the curve; ESS: effective sample size; LR: logistic regression; Prob: probability; Sens: sensitivity; Spec: specificity; SROC: summary receiver operating characteristic.
Figure 4.
Figure 4.. Meta-analysis results of radiomics-based machine learning models in differentiating pancreatic ductal adenocarcinoma from mass-forming pancreatitis within the cross-validation set. Data were derived from studies [232431333436-394344undefinedundefinedundefined]. (A) Coupled forest plots of pooled sensitivity and specificity. (B) Summary receiver operating characteristic curve. (C) Deeks’ funnel plot. (D) Fagan nomogram. AUC: area under the curve; ESS: effective sample size; LR: logistic regression; Prob: probability; Sens: sensitivity; Spec: specificity; SROC: summary receiver operating characteristic.

Similar articles

References

    1. Sung H, Ferlay J, Siegel RL, et al. Global Cancer Statistics 2020: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J Clin. 2021 May;71(3):209–249. doi: 10.3322/caac.21660. doi. Medline. - DOI - PubMed
    1. Siegel RL, Miller KD, Jemal A. Cancer statistics, 2018. CA Cancer J Clin. 2018 Jan;68(1):7–30. doi: 10.3322/caac.21442. doi. Medline. - DOI - PubMed
    1. Kamisawa T, Wood LD, Itoi T, Takaori K. Pancreatic cancer. Lancet. 2016 Jul 2;388(10039):73–85. doi: 10.1016/S0140-6736(16)00141-0. doi. Medline. - DOI - PubMed
    1. Vasen H, Ibrahim I, Ponce CG, et al. Benefit of surveillance for pancreatic cancer in high-risk individuals: outcome of long-term prospective follow-up studies from three European expert centers. J Clin Oncol. 2016 Jun 10;34(17):2010–2019. doi: 10.1200/JCO.2015.64.0730. doi. Medline. - DOI - PubMed
    1. Gonda TA, Everett JN, Wallace M, Simeone DM, PRECEDE Consortium Recommendations for a more organized and effective approach to the early detection of pancreatic cancer from the PRECEDE (Pancreatic Cancer Early Detection) consortium. Gastroenterology. 2021 Dec;161(6):1751–1757. doi: 10.1053/j.gastro.2021.08.036. doi. Medline. - DOI - PubMed

MeSH terms

LinkOut - more resources