Mitigating Bias in Radiology Machine Learning: 1. Data Handling
- PMID: 36204544
- PMCID: PMC9533091
- DOI: 10.1148/ryai.210290
Mitigating Bias in Radiology Machine Learning: 1. Data Handling
Abstract
Minimizing bias is critical to adoption and implementation of machine learning (ML) in clinical practice. Systematic mathematical biases produce consistent and reproducible differences between the observed and expected performance of ML systems, resulting in suboptimal performance. Such biases can be traced back to various phases of ML development: data handling, model development, and performance evaluation. This report presents 12 suboptimal practices during data handling of an ML study, explains how those practices can lead to biases, and describes what may be done to mitigate them. Authors employ an arbitrary and simplified framework that splits ML data handling into four steps: data collection, data investigation, data splitting, and feature engineering. Examples from the available research literature are provided. A Google Colaboratory Jupyter notebook includes code examples to demonstrate the suboptimal practices and steps to prevent them. Keywords: Data Handling, Bias, Machine Learning, Deep Learning, Convolutional Neural Network (CNN), Computer-aided Diagnosis (CAD) © RSNA, 2022.
Keywords: Bias; Computer-aided Diagnosis (CAD); Convolutional Neural Network (CNN); Data Handling; Deep Learning; Machine Learning.
© 2022 by the Radiological Society of North America, Inc.
Conflict of interest statement
Disclosures of conflicts of interest: P.R. No relevant relationships. B.K. No relevant relationships. S.F. No relevant relationships. M.M. No relevant relationships. D.V.V.G. No relevant relationships. Y.S. No relevant relationships. K.Z. No relevant relationships. G.M.C. Member of the Radiology: Artificial Intelligence trainee editorial board. B.J.E. Grant from NCI; stock/stock options in FlowSIGMA, VoiceIT, and Yunu; consultant to the editor for Radiology: Artificial Intelligence.
Figures
References
-
- West E , Mutasa S , Zhu Z , Ha R . Global Trend in Artificial Intelligence-Based Publications in Radiology From 2000 to 2018 . AJR Am J Roentgenol 2019. ; 213 ( 6 ): 1204 – 1206 . - PubMed
-
- Tariq A , Purkayastha S , Padmanaban GP , et al . Current Clinical Applications of Artificial Intelligence in Radiology and Their Best Supporting Evidence . J Am Coll Radiol 2020. ; 17 ( 11 ): 1371 – 1381 . - PubMed
-
- Liew C . The future of radiology augmented with Artificial Intelligence: A strategy for success . Eur J Radiol 2018. ; 102 : 152 – 156 . - PubMed
-
- Krishna R , Maithreyi R , Surapaneni KM . Research bias: a review for medical students . J Clin Diagn Res 2010. ; 4 ( 2 ): 2320 – 2324 . https://www.jcdr.net/article_abstract.aspx?issn=0973-709x&year=2010&volu... .
-
- Kocak B , Kus EA , Kilickesmez O . How to read and review papers on machine learning and artificial intelligence in radiology: a survival guide to key methodological concepts . Eur Radiol 2021. ; 31 ( 4 ): 1819 – 1830 . - PubMed
LinkOut - more resources
Full Text Sources
Miscellaneous
