Pitfalls and Best Practices in Evaluation of AI Algorithmic Biases in Radiology
- PMID: 40392092
- PMCID: PMC12127964
- DOI: 10.1148/radiol.241674
Pitfalls and Best Practices in Evaluation of AI Algorithmic Biases in Radiology
Abstract
Despite growing awareness of problems with fairness in artificial intelligence (AI) models in radiology, evaluation of algorithmic biases, or AI biases, remains challenging due to various complexities. These include incomplete reporting of demographic information in medical imaging datasets, variability in definitions of demographic categories, and inconsistent statistical definitions of bias. To guide the appropriate evaluation of AI biases in radiology, this article summarizes the pitfalls in the evaluation and measurement of algorithmic biases. These pitfalls span the spectrum from the technical (eg, how different statistical definitions of bias impact conclusions about whether an AI model is biased) to those associated with social context (eg, how different conventions of race and ethnicity impact identification or masking of biases). Actionable best practices and future directions to avoid these pitfalls are summarized across three key areas: (a) medical imaging datasets, (b) demographic definitions, and (c) statistical evaluations of bias. Although AI bias in radiology has been broadly reviewed in the recent literature, this article focuses specifically on underrecognized potential pitfalls related to the three key areas. By providing awareness of these pitfalls along with actionable practices to avoid them, exciting AI technologies can be used in radiology for the good of all people.
© RSNA, 2025 See also the editorial by Davis in this issue.
Conflict of interest statement
Similar articles
-
Sociodemographic Variables Reporting in Human Radiology Artificial Intelligence Research.J Am Coll Radiol. 2023 Jun;20(6):554-560. doi: 10.1016/j.jacr.2023.03.014. Epub 2023 May 5. J Am Coll Radiol. 2023. PMID: 37148953
-
AI pitfalls and what not to do: mitigating bias in AI.Br J Radiol. 2023 Oct;96(1150):20230023. doi: 10.1259/bjr.20230023. Epub 2023 Sep 12. Br J Radiol. 2023. PMID: 37698583 Free PMC article. Review.
-
Pitfalls in Interpretive Applications of Artificial Intelligence in Radiology.AJR Am J Roentgenol. 2024 Oct;223(4):e2431493. doi: 10.2214/AJR.24.31493. Epub 2024 Jul 24. AJR Am J Roentgenol. 2024. PMID: 39046137 Review.
-
Bias in artificial intelligence for medical imaging: fundamentals, detection, avoidance, mitigation, challenges, ethics, and prospects.Diagn Interv Radiol. 2025 Mar 3;31(2):75-88. doi: 10.4274/dir.2024.242854. Epub 2024 Jul 2. Diagn Interv Radiol. 2025. PMID: 38953330 Free PMC article. Review.
-
Fairness of artificial intelligence in healthcare: review and recommendations.Jpn J Radiol. 2024 Jan;42(1):3-15. doi: 10.1007/s11604-023-01474-3. Epub 2023 Aug 4. Jpn J Radiol. 2024. PMID: 37540463 Free PMC article. Review.
Cited by
-
The ethics of data mining in healthcare: challenges, frameworks, and future directions.BioData Min. 2025 Jul 11;18(1):47. doi: 10.1186/s13040-025-00461-w. BioData Min. 2025. PMID: 40646553 Free PMC article. Review.
References
-
- Lakhani P , Sundaram B . Deep Learning at Chest Radiography: Automated Classification of Pulmonary Tuberculosis by Using Convolutional Neural Networks . Radiology 2017. ; 284 ( 2 ): 574 – 582 . - PubMed
-
- Rothenberg SA , Savage CH , Abou Elkassem A , et al. . Prospective Evaluation of AI Triage of Pulmonary Emboli on CT Pulmonary Angiograms . Radiology 2023. ; 309 ( 1 ): e230702 . - PubMed
-
- Titano JJ , Badgeley M , Schefflein J , et al. . Automated deep-neural-network surveillance of cranial images for acute neurologic events . Nat Med 2018. ; 24 ( 9 ): 1337 – 1341 . - PubMed
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Medical