Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Feb 24;3(2):100200.
doi: 10.1016/j.mcpdig.2025.100200. eCollection 2025 Jun.

External Validation of Persistent Severe Acute Kidney Injury Prediction With Machine Learning Model

Affiliations

External Validation of Persistent Severe Acute Kidney Injury Prediction With Machine Learning Model

Simone Zappalà et al. Mayo Clin Proc Digit Health. .

Abstract

Objective: To externally validate the persistent electronic alert (PersEA) machine learning model for predicting persistent severe acute kidney injury (psAKI), addressing the scarcity of validated prediction models.

Patients and methods: We included adult patients (18 years or older) admitted to intensive care unit with at least stage 2 acute kidney injury (AKI) at a tertiary medical center, using retrospective data collected between January 1st, 2017 and December 31st, 2022. The data were accessed and analyzed during the period from March 1st, 2023, through July 28th, 2023. The psAKI was defined as AKI stage 3 lasting for ≥72 hours or AKI leading to death in 48 hours or kidney replacement therapy in 1 day. The performance of the PersEA model, a boosted tree algorithm fed by hourly patient data via electronic health records to provide real-time psAKI predictions, was evaluated using specific metrics that penalize late alarms. We measured the area under the receiver operating characteristic and the area under the precision-recall curves.

Results: After screening, 4479 patients from the Mayo Clinic cohort were included in the current external validation study, with 234 (5.22%) having psAKI. Results from the Amsterdam UMCdb (531 patients, 59 [11.11%] positive) and MIMIC-III (495 patients, 57 [11.52%] positive) cohorts were obtained in a prior development study. The model demonstrated an area under the receiver operating characteristic curve of 0.98 (95% CI, 0.97-0.98) and an area under the precision-recall curve of 0.67 (95% CI, 0.60-0.73), and when applying the threshold that reached 0.80 sensitivity on the internal cohort, PersEA achieved 0.88 sensitivity, 0.94 specificity, and 0.47 precision, all based on Mayo Clinic data.

Conclusion: The PersEA model performed excellently on an external cohort, showing that it is scalable on high-quality data with little to no tuning once a noisy training set is chosen.

PubMed Disclaimer

Conflict of interest statement

Drs Zappalà, Alfieri, and Ancona, hold the European patent EP23180428 entitled “System and method for detection and prediction of kidney disease events.” Additionally, Drs Alfieri and Ancona are shareholders of U-Care Medical s.r.l., Drs Zappalà and Alfieri are employees of U-Care Medical s.r.l., and Dr Ancona is the CEO of U-Care Medical s.r.l. The other authors report no competing interests.

Figures

Figure 1
Figure 1
Risk rate analysis for a positive patient with time of persistent event (Tp) at the 47th intensive care unit hour. The medium threshold triggers an alert at the 44th hour (−3 hours lead time), whereas the low threshold does so at the 39th hour (−8 hours lead time). However, the high threshold prompts an alert at the 65th hour (18 hours lead time), potentially too late clinically. Notably, the low and medium thresholds result in true-positive results, whereas the high threshold leads to a false-negative result according to our alarm definition.
Figure 2
Figure 2
Performance curves of persistent electronic alert on Mayo Clinic for each baseline serum creatinine definition. Dots represent 80% of the sensitivity threshold working point. ADM, creatinine at admission used as the baseline; FPR, false positive rate; MDRD, modification of diet in renal disease; OPAL, Optimization Presumption Adjustment Limitation Consideration (Definition from the OPAL study [NCT01846884]); PR, precision-recall curve; ROC, receiver operating characteristic; UCM, U-Care Medical definition.

Similar articles

References

    1. Lewington A.J., Cerdá J., Mehta R.L. Raising awareness of acute kidney injury: a global perspective of a silent killer. Kidney Int. 2013;84(3):457–467. doi: 10.1038/ki.2013.153. - DOI - PMC - PubMed
    1. Hoste E.A., Bagshaw S.M., Bellomo R., et al. Epidemiology of acute kidney injury in critically ill patients: the multinational AKI-EPI study. Intensive Care Med. 2015;41(8):1411–1423. doi: 10.1007/s00134-015-3934-7. - DOI - PubMed
    1. Li Q., Hu P., Kang H., Zhou F. Clinical characteristics and short-term outcomes of acute kidney injury missed diagnosis in older patients with severe COVID-19 in intensive care unit. J Nutr Health Aging. 2021;25(4):492–500. doi: 10.1007/s12603-020-1550-x. - DOI - PMC - PubMed
    1. Kurzhagen J.T., Dellepiane S., Cantaluppi V., Rabb H. AKI: an increasingly recognized risk factor for CKD development and progression. J Nephrol. 2020;33(6):1171–1187. doi: 10.1007/s40620-020-00793-2. - DOI - PubMed
    1. Chawla L.S., Bellomo R., Bihorac A., et al. Acute kidney disease and renal recovery: consensus report of the Acute Disease Quality Initiative (ADQI) 16 Workgroup. Nat Rev Nephrol. 2017;13(4):241–257. doi: 10.1038/nrneph.2017.2. - DOI - PubMed

LinkOut - more resources