. 2025 Feb 24;22(2):e1004432.

doi: 10.1371/journal.pmed.1004432. eCollection 2025 Feb.

A systematic review of machine learning-based prognostic models for acute pancreatitis: Towards improving methods and reporting quality

Brian Critelli¹, Amier Hassan¹, Ila Lahooti², Lydia Noh³, Jun Sung Park², Kathleen Tong², Ali Lahooti¹, Nathan Matzko¹, Jan Niklas Adams⁴, Lukas Liss⁴, Justin Quion⁵, David Restrepo⁵, Melica Nikahd⁶, Stacey Culp⁶, Adam Lacy-Hulbert⁷, Cate Speake⁸, James Buxbaum⁹, Jason Bischof¹⁰, Cemal Yazici¹¹, Anna Evans-Phillips¹², Sophie Terp¹³, Alexandra Weissman¹⁴, Darwin Conwell¹⁵, Philip Hart², Mitchell Ramsey², Somashekar Krishna², Samuel Han², Erica Park², Raj Shah², Venkata Akshintala¹⁶, John A Windsor¹⁷, Nikhil K Mull¹⁸, Georgios Papachristou², Leo Anthony Celi^{5

19}, Peter Lee²

Affiliations

¹ Department of Gastroenterology and Hepatology, Weill Cornell Medical College, New York, New York, United States of America.
² Department of Gastroenterology and Hepatology, Ohio State University Wexner Medical Center, Columbus, Ohio, United States of America.
³ Northeast Ohio Medical School, Rootstown, Ohio, United States of America.
⁴ Department of Process and Data Science, Rheinisch-Westfälische Technische Hochschule Aachen University, Aachen, Germany.
⁵ Department of Computational Physiology, Massachusetts Institute of Technology, Cambridge, Massachusetts, United States of America.
⁶ Department of Bioinformatics, Ohio State University Wexner Medical Center, Columbus, Ohio, United States of America.
⁷ Department of Systems Immunology, Benaroya Research Institute at Virginia Mason, Seattle, Washington, United States of America.
⁸ Department of Interventional Immunology, Benaroya Research Institute at Virginia Mason, Seattle, Washington, United States of America.
⁹ Department of Gastroenterology, University of Southern California, Los Angeles, California, United States of America.
¹⁰ Department of Emergency Medicine, Ohio State University Wexner Medical Center, Columbus, Ohio, United States of America.
¹¹ Department of Gastroenterology, University of Illinois at Chicago, Chicago, Illinois, United States of America.
¹² Department of Gastroenterology, University of Pittsburgh Medical Center, Pittsburgh, Pennsylvania, United States of America.
¹³ Department of Emergency Medicine, University of Southern California, Los Angeles, California, United States of America.
¹⁴ Department of Emergency Medicine, University of Pittsburgh Medical Center, Pittsburgh, Pennsylvania, United States of America.
¹⁵ Department of Medicine, University of Kentucky, Lexington, Kentucky, United States of America.
¹⁶ Department of Gastroenterology, Johns Hopkins Medical Center, Baltimore, Maryland, United States of America.
¹⁷ Department of Surgical and Translational Research Centre, University of Auckland, Auckland, New Zealand.
¹⁸ Department of Hospital Medicine and Penn Medicine Center for Evidence-based Practice, University of Pennsylvania, Philadelphia, Pennsylvania, United States of America.
¹⁹ Department of Critical Care, Beth Israel Medical Center, Boston, Massachusetts, United States of America.

PMID: 39992936
PMCID: PMC11870378
DOI: 10.1371/journal.pmed.1004432

A systematic review of machine learning-based prognostic models for acute pancreatitis: Towards improving methods and reporting quality

Brian Critelli et al. PLoS Med. 2025.

. 2025 Feb 24;22(2):e1004432.

doi: 10.1371/journal.pmed.1004432. eCollection 2025 Feb.

Authors

Affiliations

¹ Department of Gastroenterology and Hepatology, Weill Cornell Medical College, New York, New York, United States of America.
² Department of Gastroenterology and Hepatology, Ohio State University Wexner Medical Center, Columbus, Ohio, United States of America.
³ Northeast Ohio Medical School, Rootstown, Ohio, United States of America.
⁴ Department of Process and Data Science, Rheinisch-Westfälische Technische Hochschule Aachen University, Aachen, Germany.
⁵ Department of Computational Physiology, Massachusetts Institute of Technology, Cambridge, Massachusetts, United States of America.
⁶ Department of Bioinformatics, Ohio State University Wexner Medical Center, Columbus, Ohio, United States of America.
⁷ Department of Systems Immunology, Benaroya Research Institute at Virginia Mason, Seattle, Washington, United States of America.
⁸ Department of Interventional Immunology, Benaroya Research Institute at Virginia Mason, Seattle, Washington, United States of America.
⁹ Department of Gastroenterology, University of Southern California, Los Angeles, California, United States of America.
¹⁰ Department of Emergency Medicine, Ohio State University Wexner Medical Center, Columbus, Ohio, United States of America.
¹¹ Department of Gastroenterology, University of Illinois at Chicago, Chicago, Illinois, United States of America.
¹² Department of Gastroenterology, University of Pittsburgh Medical Center, Pittsburgh, Pennsylvania, United States of America.
¹³ Department of Emergency Medicine, University of Southern California, Los Angeles, California, United States of America.
¹⁴ Department of Emergency Medicine, University of Pittsburgh Medical Center, Pittsburgh, Pennsylvania, United States of America.
¹⁵ Department of Medicine, University of Kentucky, Lexington, Kentucky, United States of America.
¹⁶ Department of Gastroenterology, Johns Hopkins Medical Center, Baltimore, Maryland, United States of America.
¹⁷ Department of Surgical and Translational Research Centre, University of Auckland, Auckland, New Zealand.
¹⁸ Department of Hospital Medicine and Penn Medicine Center for Evidence-based Practice, University of Pennsylvania, Philadelphia, Pennsylvania, United States of America.
¹⁹ Department of Critical Care, Beth Israel Medical Center, Boston, Massachusetts, United States of America.

PMID: 39992936
PMCID: PMC11870378
DOI: 10.1371/journal.pmed.1004432

Abstract

Background: An accurate prognostic tool is essential to aid clinical decision-making (e.g., patient triage) and to advance personalized medicine. However, such a prognostic tool is lacking for acute pancreatitis (AP). Increasingly machine learning (ML) techniques are being used to develop high-performing prognostic models in AP. However, methodologic and reporting quality has received little attention. High-quality reporting and study methodology are critical for model validity, reproducibility, and clinical implementation. In collaboration with content experts in ML methodology, we performed a systematic review critically appraising the quality of methodology and reporting of recently published ML AP prognostic models.

Methods/findings: Using a validated search strategy, we identified ML AP studies from the databases MEDLINE and EMBASE published between January 2021 and December 2023. We also searched pre-print servers medRxiv, bioRxiv, and arXiv for pre-prints registered between January 2021 and December 2023. Eligibility criteria included all retrospective or prospective studies that developed or validated new or existing ML models in patients with AP that predicted an outcome following an episode of AP. Meta-analysis was considered if there was homogeneity in the study design and in the type of outcome predicted. For risk of bias (ROB) assessment, we used the Prediction Model Risk of Bias Assessment Tool. Quality of reporting was assessed using the Transparent Reporting of a Multivariable Prediction Model of Individual Prognosis or Diagnosis-Artificial Intelligence (TRIPOD+AI) statement that defines standards for 27 items that should be reported in publications using ML prognostic models. The search strategy identified 6,480 publications of which 30 met the eligibility criteria. Studies originated from China (22), the United States (4), and other (4). All 30 studies developed a new ML model and none sought to validate an existing ML model, producing a total of 39 new ML models. AP severity (23/39) or mortality (6/39) were the most common outcomes predicted. The mean area under the curve for all models and endpoints was 0.91 (SD 0.08). The ROB was high for at least one domain in all 39 models, particularly for the analysis domain (37/39 models). Steps were not taken to minimize over-optimistic model performance in 27/39 models. Due to heterogeneity in the study design and in how the outcomes were defined and determined, meta-analysis was not performed. Studies reported on only 15/27 items from TRIPOD+AI standards, with only 7/30 justifying sample size and 13/30 assessing data quality. Other reporting deficiencies included omissions regarding human-AI interaction (28/30), handling low-quality or incomplete data in practice (27/30), sharing analytical codes (25/30), study protocols (25/30), and reporting source data (19/30).

Conclusions: There are significant deficiencies in the methodology and reporting of recently published ML based prognostic models in AP patients. These undermine the validity, reproducibility, and implementation of these prognostic models despite their promise of superior predictive accuracy.

Registration: Research Registry (reviewregistry1727).

Copyright: © 2025 Critelli et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

PubMed Disclaimer

Conflict of interest statement

The authors have declared that no competing interests exist.

Figures

**Fig 1. Summary of risk of bias in four domains assessed by PROBAST.**

**Fig 2. Heatmap depicting common areas of deficiencies in reporting standards as assessed by TRIPOD+AI.**
* Publication has same first author and year as another paper listed; PMID of each * in ascending order: Yang and colleagues (2022): 35430680, 35607360 [58,59]. Luo and colleagues (2023): 36653317, 36773821 [65,66]. Zhang and colleagues (2023): 36902504, 36964219, 37196588 [–71].

See this image and copyright information in PMC

References

1. Xiao AY, Tan ML, Wu LM, Asrani VM, Windsor JA, Yadav D, et al.. Global incidence and mortality of pancreatic diseases: a systematic review, meta-analysis, and meta-regression of population-based cohort studies. Lancet Gastroenterol Hepatol. 2016;1(1):45–55. Epub 20160628. doi: 10.1016/S2468-1253(16)30004-8 - DOI - PubMed
1. Iannuzzi JP, King JA, Leong JH, Quan J, Windsor JW, Tanyingoh D, et al.. Global incidence of acute pancreatitis is increasing over time: a systematic review and meta-analysis. Gastroenterology. 2022;162(1):122–34. Epub 20210925. doi: 10.1053/j.gastro.2021.09.043 - DOI - PubMed
1. Lee PJ, Papachristou GI. New insights into acute pancreatitis. Nat Rev Gastroenterol Hepatol. 2019;16(8):479–96. doi: 10.1038/s41575-019-0158-2 - DOI - PubMed
1. Banks PA, Bollen TL, Dervenis C, Gooszen HG, Johnson CD, Sarr MG, et al.; Acute Pancreatitis Classification Working Group. Classification of acute pancreatitis—2012: revision of the Atlanta classification and definitions by international consensus. Gut. 2013;62(1):102–11. doi: 10.1136/gutjnl-2012-302779 - DOI - PubMed
1. Dellinger EP, Forsmark CE, Layer P, Levy P, Maravi-Poma E, Petrov MS, et al.; Pancreatitis Across Nations Clinical Research and Education Alliance (PANCREA). Determinant-based classification of acute pancreatitis severity: an international multidisciplinary consultation. Ann Surg. 2012;256(6):875–80. doi: 10.1097/SLA.0b013e318256f778 - DOI - PubMed

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

LinkOut - more resources

Full Text Sources
- PubMed Central
- Public Library of Science
Medical
- MedlinePlus Health Information
Miscellaneous
- NCI CPTAC Assay Portal

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

A systematic review of machine learning-based prognostic models for acute pancreatitis: Towards improving methods and reporting quality

Affiliations

A systematic review of machine learning-based prognostic models for acute pancreatitis: Towards improving methods and reporting quality

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

References

Publication types

MeSH terms

LinkOut - more resources

Full Text Sources

Medical

Miscellaneous