The path toward equal performance in medical machine learning
- PMID: 37521051
- PMCID: PMC10382979
- DOI: 10.1016/j.patter.2023.100790
The path toward equal performance in medical machine learning
Abstract
To ensure equitable quality of care, differences in machine learning model performance between patient groups must be addressed. Here, we argue that two separate mechanisms can cause performance differences between groups. First, model performance may be worse than theoretically achievable in a given group. This can occur due to a combination of group underrepresentation, modeling choices, and the characteristics of the prediction task at hand. We examine scenarios in which underrepresentation leads to underperformance, scenarios in which it does not, and the differences between them. Second, the optimal achievable performance may also differ between groups due to differences in the intrinsic difficulty of the prediction task. We discuss several possible causes of such differences in task difficulty. In addition, challenges such as label biases and selection biases may confound both learning and performance evaluation. We highlight consequences for the path toward equal performance, and we emphasize that leveling up model performance may require gathering not only more data from underperforming groups but also better data. Throughout, we ground our discussion in real-world medical phenomena and case studies while also referencing relevant statistical theory.
© 2023 The Author(s).
Conflict of interest statement
The authors declare no competing interests.
Figures


Similar articles
-
Algorithm Recommendation and Performance Prediction Using Meta-Learning.Int J Neural Syst. 2023 Mar;33(3):2350011. doi: 10.1142/S0129065723500119. Epub 2023 Feb 1. Int J Neural Syst. 2023. PMID: 36722692
-
Data-driven modeling and prediction of blood glucose dynamics: Machine learning applications in type 1 diabetes.Artif Intell Med. 2019 Jul;98:109-134. doi: 10.1016/j.artmed.2019.07.007. Epub 2019 Jul 26. Artif Intell Med. 2019. PMID: 31383477 Review.
-
Prediction and Evaluation of Machine Learning Algorithm for Prediction of Blood Transfusion during Cesarean Section and Analysis of Risk Factors of Hypothermia during Anesthesia Recovery.Comput Math Methods Med. 2022 Apr 13;2022:8661324. doi: 10.1155/2022/8661324. eCollection 2022. Comput Math Methods Med. 2022. Retraction in: Comput Math Methods Med. 2023 Jun 28;2023:9863486. doi: 10.1155/2023/9863486. PMID: 35465016 Free PMC article. Retracted.
-
Identifying Patients at Risk of Acute Kidney Injury among Patients Receiving Immune Checkpoint Inhibitors: A Machine Learning Approach.Diagnostics (Basel). 2022 Dec 14;12(12):3157. doi: 10.3390/diagnostics12123157. Diagnostics (Basel). 2022. PMID: 36553164 Free PMC article.
-
Failing Is Derailing: The Underperformance as a Stressor Model.Front Psychol. 2020 Jul 16;11:1617. doi: 10.3389/fpsyg.2020.01617. eCollection 2020. Front Psychol. 2020. PMID: 32765368 Free PMC article. Review.
Cited by
-
The limits of fair medical imaging AI in real-world generalization.Nat Med. 2024 Oct;30(10):2838-2848. doi: 10.1038/s41591-024-03113-4. Epub 2024 Jun 28. Nat Med. 2024. PMID: 38942996 Free PMC article.
-
Health equity assessment of machine learning performance (HEAL): a framework and dermatology AI model case study.EClinicalMedicine. 2024 Mar 14;70:102479. doi: 10.1016/j.eclinm.2024.102479. eCollection 2024 Apr. EClinicalMedicine. 2024. PMID: 38685924 Free PMC article.
-
An Investigation into Race Bias in Random Forest Models Based on Breast DCE-MRI Derived Radiomics Features.Clin Image Based Proced Fairness AI Med Imaging Ethical Philos Issues Med Imaging (2023). 2023;14242:225-234. doi: 10.1007/978-3-031-45249-9_22. Epub 2023 Oct 9. Clin Image Based Proced Fairness AI Med Imaging Ethical Philos Issues Med Imaging (2023). 2023. PMID: 39404661 Free PMC article.
-
The Permissibility of Biased AI in a Biased World: An Ethical Analysis of AI for Screening and Referrals for Diabetic Retinopathy in Singapore.Asian Bioeth Rev. 2024 Oct 31;17(1):167-185. doi: 10.1007/s41649-024-00315-3. eCollection 2025 Jan. Asian Bioeth Rev. 2024. PMID: 39896078 Free PMC article.
-
A scoping review and evidence gap analysis of clinical AI fairness.NPJ Digit Med. 2025 Jun 14;8(1):360. doi: 10.1038/s41746-025-01667-2. NPJ Digit Med. 2025. PMID: 40517148 Free PMC article.
References
-
- Buolamwini J., Gebru T. In: Proceedings of the 1st Conference on Fairness, Accountability and Transparency. Friedler S.A., Wilson C., editors. Vol. 81. PMLR; 2018. Gender Shades: Intersectional Accuracy Disparities in Commercial Gender Classification; pp. 77–91.http://proceedings.mlr.press/v81/buolamwini18a.html
Publication types
LinkOut - more resources
Full Text Sources