Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2019 Aug 15:2:76.
doi: 10.1038/s41746-019-0153-6. eCollection 2019.

Developing well-calibrated illness severity scores for decision support in the critically ill

Affiliations

Developing well-calibrated illness severity scores for decision support in the critically ill

Christopher V Cosgriff et al. NPJ Digit Med. .

Abstract

Illness severity scores are regularly employed for quality improvement and benchmarking in the intensive care unit, but poor generalization performance, particularly with respect to probability calibration, has limited their use for decision support. These models tend to perform worse in patients at a high risk for mortality. We hypothesized that a sequential modeling approach wherein an initial regression model assigns risk and all patients deemed high risk then have their risk quantified by a second, high-risk-specific, regression model would result in a model with superior calibration across the risk spectrum. We compared this approach to a logistic regression model and a sophisticated machine learning approach, the gradient boosting machine. The sequential approach did not have an effect on the receiver operating characteristic curve or the precision-recall curve but resulted in improved reliability curves. The gradient boosting machine achieved a small improvement in discrimination performance and was similarly calibrated to the sequential models.

Keywords: Health care; Medical research; Prognosis.

PubMed Disclaimer

Conflict of interest statement

Competing interestsO.B. is employed by Philips Healthcare. The other authors declare no competing interests.

Figures

Fig. 1
Fig. 1
Fluxogram. Cohort selection process
Fig. 2
Fig. 2
Reliability curves for the APACHE IVa and Logit models
Fig. 3
Fig. 3
Receiver operating characteristic and precision-recall curves. Receiver operating characteristic and precision-recall curves for all the models. Area under the receiver operating characteristic curve (AUC) and average precision (AP) are provided for each model along with 95% confidence intervals obtained from bootstrapping
Fig. 4
Fig. 4
Reliability curves for sequential models
Fig. 5
Fig. 5
Reliability curves for the extreme gradient boosting model

References

    1. Breslow MJ, Badawi O. Severity scoring in the critically ill: Part 2: Maximizing value from outcome prediction scoring systems. Chest. 2012;141:518–527. doi: 10.1378/chest.11-0331. - DOI - PubMed
    1. Breslow MJ, Badawi O. Severity scoring in the critically ill: part 1–interpretation and accuracy of outcome prediction scoring systems. Chest. 2012;141:245–252. doi: 10.1378/chest.11-0330. - DOI - PubMed
    1. Zimmerman JE, Kramer AA, McNair DS, Malila FM. Acute Physiology and Chronic Health Evaluation (APACHE) IV: hospital mortality assessment for today’s critically ill patients. Crit. Care Med. 2006;34:1297–1310. doi: 10.1097/01.CCM.0000215112.84523.F0. - DOI - PubMed
    1. Moreno RP, et al. SAPS 3–From evaluation of the patient to evaluation of the intensive care unit. Part 2: Development of a prognostic model for hospital mortality at ICU admission. Intensive Care Med. 2005;31:1345–1355. doi: 10.1007/s00134-005-2763-5. - DOI - PMC - PubMed
    1. Vincent JL, et al. The SOFA (Sepsis-related Organ Failure Assessment) score to describe organ dysfunction/failure. On behalf of the Working Group on Sepsis-Related Problems of the European Society of Intensive Care Medicine. Intensive Care Med. 1996;22:707–710. doi: 10.1007/BF01709751. - DOI - PubMed