Performance of intensive care unit severity scoring systems across different ethnicities in the USA: a retrospective observational study
- PMID: 33766288
- PMCID: PMC8063502
- DOI: 10.1016/S2589-7500(21)00022-4
Performance of intensive care unit severity scoring systems across different ethnicities in the USA: a retrospective observational study
Abstract
Background: Despite wide use of severity scoring systems for case-mix determination and benchmarking in the intensive care unit (ICU), the possibility of scoring bias across ethnicities has not been examined. Guidelines on the use of illness severity scores to inform triage decisions for allocation of scarce resources, such as mechanical ventilation, during the current COVID-19 pandemic warrant examination for possible bias in these models. We investigated the performance of the severity scoring systems Acute Physiology and Chronic Health Evaluation IVa (APACHE IVa), Oxford Acute Severity of Illness Score (OASIS), and Sequential Organ Failure Assessment (SOFA) across four ethnicities in two large ICU databases to identify possible ethnicity-based bias.
Methods: Data from the electronic ICU Collaborative Research Database (eICU-CRD) and the Medical Information Mart for Intensive Care III (MIMIC-III) database, built from patient episodes in the USA from 2014-15 and 2001-12, respectively, were analysed for score performance in Asian, Black, Hispanic, and White people after appropriate exclusions. Hospital mortality was the outcome of interest. Discrimination and calibration were determined for all three scoring systems in all four groups, using area under receiver operating characteristic (AUROC) curve for different ethnicities to assess discrimination, and standardised mortality ratio (SMR) or proxy measures to assess calibration.
Findings: We analysed 166 751 participants (122 919 eICU-CRD and 43 832 MIMIC-III). Although measurements of discrimination were significantly different among the groups (AUROC ranging from 0·86 to 0·89 [p=0·016] with APACHE IVa and from 0·75 to 0·77 [p=0·85] with OASIS), they did not display any discernible systematic patterns of bias. However, measurements of calibration indicated persistent, and in some cases statistically significant, patterns of difference between Hispanic people (SMR 0·73 with APACHE IVa and 0·64 with OASIS) and Black people (0·67 and 0·68) versus Asian people (0·77 and 0·95) and White people (0·76 and 0·81). Although calibrations were imperfect for all groups, the scores consistently showed a pattern of overpredicting mortality for Black people and Hispanic people. Similar results were seen using SOFA scores across the two databases.
Interpretation: The systematic differences in calibration across ethnicities suggest that illness severity scores reflect statistical bias in their predictions of mortality.
Funding: There was no specific funding for this study.
Copyright © 2021 The Author(s). Published by Elsevier Ltd. This is an Open Access article under the CC BY-NC-ND 4.0 license. Published by Elsevier Ltd.. All rights reserved.
Conflict of interest statement
Declaration of interests
RS received writing fees for health-care reports from Crystallise UK.
CM is a director for Crystallise UK. HM, JWG, DJS, and LAC declare no competing interests.
Figures




Update of
-
Performance of intensive care unit severity scoring systems across different ethnicities.medRxiv [Preprint]. 2021 Jan 20:2021.01.19.21249222. doi: 10.1101/2021.01.19.21249222. medRxiv. 2021. Update in: Lancet Digit Health. 2021 Apr;3(4):e241-e249. doi: 10.1016/S2589-7500(21)00022-4. PMID: 33501459 Free PMC article. Updated. Preprint.
Comment in
-
Ethnicity-based bias in clinical severity scores.Lancet Digit Health. 2021 Apr;3(4):e209-e210. doi: 10.1016/S2589-7500(21)00044-3. Lancet Digit Health. 2021. PMID: 33766286 No abstract available.
Similar articles
-
Performance of intensive care unit severity scoring systems across different ethnicities.medRxiv [Preprint]. 2021 Jan 20:2021.01.19.21249222. doi: 10.1101/2021.01.19.21249222. medRxiv. 2021. Update in: Lancet Digit Health. 2021 Apr;3(4):e241-e249. doi: 10.1016/S2589-7500(21)00022-4. PMID: 33501459 Free PMC article. Updated. Preprint.
-
Evaluating Prognostic Bias of Critical Illness Severity Scores Based on Age, Sex, and Primary Language in the United States: A Retrospective Multicenter Study.Crit Care Explor. 2024 Jan 17;6(1):e1033. doi: 10.1097/CCE.0000000000001033. eCollection 2024 Jan. Crit Care Explor. 2024. PMID: 38239408 Free PMC article.
-
Predictive Value of Sequential Organ Failure Assessment Score across Patients with and without COVID-19 Infection.Ann Am Thorac Soc. 2022 May;19(5):790-798. doi: 10.1513/AnnalsATS.202106-680OC. Ann Am Thorac Soc. 2022. PMID: 34784497 Free PMC article.
-
Severity scoring in the critically ill: part 1--interpretation and accuracy of outcome prediction scoring systems.Chest. 2012 Jan;141(1):245-252. doi: 10.1378/chest.11-0330. Chest. 2012. PMID: 22215834 Review.
-
Comparison of Severity of Illness Scores and Artificial Intelligence Models That Are Predictive of Intensive Care Unit Mortality: Meta-analysis and Review of the Literature.JMIR Med Inform. 2022 May 31;10(5):e35293. doi: 10.2196/35293. JMIR Med Inform. 2022. PMID: 35639445 Free PMC article. Review.
Cited by
-
Contraindications to the Initiation of Veno-Venous ECMO for Severe Acute Respiratory Failure in Adults: A Systematic Review and Practical Approach Based on the Current Literature.Membranes (Basel). 2021 Jul 30;11(8):584. doi: 10.3390/membranes11080584. Membranes (Basel). 2021. PMID: 34436348 Free PMC article. Review.
-
Origin of Sepsis Associated with the Short-Term Mortality of Patients: A Retrospective Study Using the eICU Collaborative Research Database.Int J Gen Med. 2021 Dec 24;14:10293-10301. doi: 10.2147/IJGM.S345050. eCollection 2021. Int J Gen Med. 2021. PMID: 34992444 Free PMC article.
-
Bias in vital signs? Machine learning models can learn patients' race or ethnicity from the values of vital signs alone.BMJ Health Care Inform. 2025 Jul 10;32(1):e101098. doi: 10.1136/bmjhci-2024-101098. BMJ Health Care Inform. 2025. PMID: 40639839 Free PMC article.
-
Association between lactate/albumin ratio and 28-day all-cause mortality in critically ill patients with acute myocardial infarction.Sci Rep. 2024 Oct 10;14(1):23677. doi: 10.1038/s41598-024-73788-9. Sci Rep. 2024. PMID: 39389996 Free PMC article.
-
Association of body temperature and mortality in critically ill patients: an observational study using two large databases.Eur J Med Res. 2024 Jan 6;29(1):33. doi: 10.1186/s40001-023-01616-3. Eur J Med Res. 2024. PMID: 38184625 Free PMC article.
References
-
- Wiens J, Price WN 2nd, Sjoding MW. Diagnosing bias in data-driven algorithms for healthcare. Nat Med 2020; 26: 25–26. - PubMed
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources