Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 May 1;23(5):344-352.
doi: 10.1097/PCC.0000000000002910. Epub 2022 May 5.

Dynamic Mortality Risk Predictions for Children in ICUs: Development and Validation of Machine Learning Models

Affiliations

Dynamic Mortality Risk Predictions for Children in ICUs: Development and Validation of Machine Learning Models

Eduardo A Trujillo Rivera et al. Pediatr Crit Care Med. .

Abstract

Objectives: Assess a machine learning method of serially updated mortality risk.

Design: Retrospective analysis of a national database (Health Facts; Cerner Corporation, Kansas City, MO).

Setting: Hospitals caring for children in ICUs.

Patients: A total of 27,354 admissions cared for in ICUs from 2009 to 2018.

Interventions: None.

Main outcome: Hospital mortality risk estimates determined at 6-hour time periods during care in the ICU. Models were truncated at 180 hours due to decreased sample size secondary to discharges and deaths.

Measurements and main results: The Criticality Index, based on physiology, therapy, and care intensity, was computed for each admission for each time period and calibrated to hospital mortality risk (Criticality Index-Mortality [CI-M]) at each of 29 time periods (initial assessment: 6 hr; last assessment: 180 hr). Performance metrics and clinical validity were determined from the held-out test sample (n = 3,453, 13%). Discrimination assessed with the area under the receiver operating characteristic curve was 0.852 (95% CI, 0.843-0.861) overall and greater than or equal to 0.80 for all individual time periods. Calibration assessed by the Hosmer-Lemeshow goodness-of-fit test showed good fit overall (p = 0.196) and was statistically not significant for 28 of the 29 time periods. Calibration plots for all models revealed the intercept ranged from--0.002 to 0.009, the slope ranged from 0.867 to 1.415, and the R2 ranged from 0.862 to 0.989. Clinical validity assessed using population trajectories and changes in the risk status of admissions (clinical volatility) revealed clinical trajectories consistent with clinical expectations and greater clinical volatility in deaths than survivors (p < 0.001).

Conclusions: Machine learning models incorporating physiology, therapy, and care intensity can track changes in hospital mortality risk during intensive care. The CI-M's framework and modeling method are potentially applicable to monitoring clinical improvement and deterioration in real time.

PubMed Disclaimer

Conflict of interest statement

Dr. Trujillo Rivera’s institution received funding from Children's National Medical Center. Drs. Chamberlain’s, Patel’s, Morizona’s, and Pollack’s institutions received funding from Mallinckrodt. Drs. Patel’s and Morizona’s institutions received funding from Awards Ul1TR001876 and KL2TR001877 from the National Institutes of Health (NIH), National Center for Advancing Translational Sciences. Drs. Patel, Morizona, and Pollack received support for article research from the NIH. Dr. Morizona disclosed having a 16% share as a founder of Cogthera LLC, a company that will develop drugs for cognitive impairment. Dr. Pollack’s institution received funding from NIH. Dr. Heneghan has disclosed that she does not have any potential conflicts of interest.

Figures

Figure 1.
Figure 1.
Area under the receiver operating characteristic curves (AUROC) for all Criticality Index – Mortality models and for all models combined. a. Separate models were developed for 29 6-hour time periods from hour 6 to hour 180. The dots are AUROCs for the individual models and the shaded areas are the 95% confidence intervals b. Composite AUROC. All models have been combined.
Figure 2.
Figure 2.
Calibration plot performance metrics for all Criticality Index – Mortality models and for all models combined a. Separate models were developed for 29 6-hour time periods from hour 6 to hour 180. Calibration plot metrics include the regression line intercepts (black), slopes (tan) and R2’s (blue). The shaded areas are the 95% confidence intervals. b. Composite calibration plot. The dashed line is the line of identity and the solid line is the regression line. The plot includes 294 risk intervals from all models. The comparison of observed proportions to expected proportions of outcome for each interval were not statistically different (p > 0.05) for 290 (98.64%) of the risk intervals. Each interval had ≥ 170 admissions. The Hosmer-Lemeshow goodness-of-fit test for the overall calibration was 0.195.
Figure 3.
Figure 3.
Trajectories for high risk deaths and survivors and low risk deaths and survivors. Risk was computed with the Criticality Index – Mortality. High risk indicates the highest mortality risk decile determined from the initial time period. The shaded areas are the 95% confidence intervals. The mortality risks cutpoints for the high risk cohorts were 15.1% for the deaths and 11.3% for the survivors. The trajectories were constructed from the total sample with the following sample sizes: low risk survivors: 24,168, admissions, high risk survivors: 2,685 admissions, low risk deaths: 451 admissions, high risk deaths: 50 patients.
Figure 4.
Figure 4.
Change in mortality risk for individual admissions in consecutive time periods. Risk was computed with the Criticality Index – Mortality. Data are shown for survivors (tan) and deaths (black). The frequencies (vertical axis) are standardized such that the added areas under the bars are 1 for each outcome group in each figure. a. Change in mortality risk for all consecutive time intervals. Deaths demonstrate increased clinical volatility compared to survivors. The average increase in mortality was 0.021 for deaths and 0.006 for survivors (p<0.001) and the average decrease was 0.022 for deaths and 0.008 for survivors (p<0.001). b. Maximum mortality risk increase (clinical deterioration). The average maximum deterioration for deaths was 0.050 and 0.015 for survivors (p<0.001) c. Maximum mortality risk decrease (clinical improvement). The average maximum decrease was 0.063 for deaths and 0.022 for survivors (p<0.001).

Comment in

References

    1. Pollack MM, Holubkov R, Funai T, et al. The Pediatric Risk of Mortality Score: Update 2015. Pediatr Crit Care Med. 2016;17(1):2–9. doi: 10.1097/PCC.0000000000000558 - DOI - PMC - PubMed
    1. Straney L, Clements A, Parslow RC, et al. Paediatric index of mortality 3: an updated model for predicting mortality in pediatric intensive care. Pediatr Crit Care Med. 2013;14(7):673–681. doi: 10.1097/PCC.0b013e31829760cf - DOI - PubMed
    1. Leteurtre S, Duhamel A, Salleron J, Grandbastien B, Lacroix J, Leclerc F; Groupe Francophone de Réanimation et d’Urgences Pédiatriques (GFRUP). PELOD-2: an update of the PEdiatric logistic organ dysfunction score. Crit Care Med. 2013. Jul;41(7):1761–73. doi: 10.1097/CCM.0b013e31828a2bbd. PMID: 23685639.doi:10.1016/S0140–6736(03)13908–6 - DOI - PubMed
    1. Aczon MD, Ledbetter DR, Laksana E, Ho LV, Wetzel RC. Continuous Prediction of Mortality in the PICU: A Recurrent Neural Network Model in a Single-Center Dataset. Pediatr Crit Care Med. 2021;22(6):519–529. doi: 10.1097/PCC.0000000000002682 - DOI - PMC - PubMed
    1. Rivera EAT, Patel AK, Chamberlain JM, et al. Criticality: A New Concept of Severity of Illness for Hospitalized Children. Pediatr Crit Care Med. 2021;22(1):e33–e43. doi: 10.1097/PCC.0000000000002560 - DOI - PMC - PubMed

Publication types