Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2025 May;315(2):e241674.
doi: 10.1148/radiol.241674.

Pitfalls and Best Practices in Evaluation of AI Algorithmic Biases in Radiology

Affiliations
Review

Pitfalls and Best Practices in Evaluation of AI Algorithmic Biases in Radiology

Paul H Yi et al. Radiology. 2025 May.

Abstract

Despite growing awareness of problems with fairness in artificial intelligence (AI) models in radiology, evaluation of algorithmic biases, or AI biases, remains challenging due to various complexities. These include incomplete reporting of demographic information in medical imaging datasets, variability in definitions of demographic categories, and inconsistent statistical definitions of bias. To guide the appropriate evaluation of AI biases in radiology, this article summarizes the pitfalls in the evaluation and measurement of algorithmic biases. These pitfalls span the spectrum from the technical (eg, how different statistical definitions of bias impact conclusions about whether an AI model is biased) to those associated with social context (eg, how different conventions of race and ethnicity impact identification or masking of biases). Actionable best practices and future directions to avoid these pitfalls are summarized across three key areas: (a) medical imaging datasets, (b) demographic definitions, and (c) statistical evaluations of bias. Although AI bias in radiology has been broadly reviewed in the recent literature, this article focuses specifically on underrecognized potential pitfalls related to the three key areas. By providing awareness of these pitfalls along with actionable practices to avoid them, exciting AI technologies can be used in radiology for the good of all people.

PubMed Disclaimer

Conflict of interest statement

Disclosures of conflicts of interest: P.H.Y. Grants or contracts from National Cancer Institute, National Institutes of Health, American College of Radiology, and RSNA; consulting fees from Bunkerhill Health; associate editor of Radiology: Artificial Intelligence; vice chair of Society for Imaging Informatics in Medicine Program Planning Committee; and stock or stock options from Bunkerhill Health. P.B. No relevant relationships. B.B. No relevant relationships. S.P.G. No relevant relationships. A.K. No relevant relationships. P.K. No relevant relationships. D.L. No relevant relationships. V.S.P. No relevant relationships. S.M.S. No relevant relationships. L.M. Editor of Radiology; grant from Siemens, Gordon and Betty Moore Foundation, Mary Kay Foundation, and Google; consulting fees from Lunit, iCAD, and Guerbet; payment or honoraria for lectures, presentations, speakers bureaus, manuscript writing, or educational events from iCAD and Guerbet; board member of International Society for Magnetic Resonance in Medicine and Society of Breast Imaging; stock or stock options in Lunit. J.S. Grant from National Institutes of Health (R01CA287422).

Similar articles

Cited by

References

    1. Lakhani P , Sundaram B . Deep Learning at Chest Radiography: Automated Classification of Pulmonary Tuberculosis by Using Convolutional Neural Networks . Radiology 2017. ; 284 ( 2 ): 574 – 582 . - PubMed
    1. Buda M , Wildman-Tobriner B , Hoang JK , et al. . Management of Thyroid Nodules Seen on US Images: Deep Learning May Match Performance of Radiologists . Radiology 2019. ; 292 ( 3 ): 695 – 701 . - PMC - PubMed
    1. Ding Y , Sohn JH , Kawczynski MG , et al. . A Deep Learning Model to Predict a Diagnosis of Alzheimer Disease by Using 18F-FDG PET of the Brain . Radiology 2019. ; 290 ( 2 ): 456 – 464 . - PMC - PubMed
    1. Rothenberg SA , Savage CH , Abou Elkassem A , et al. . Prospective Evaluation of AI Triage of Pulmonary Emboli on CT Pulmonary Angiograms . Radiology 2023. ; 309 ( 1 ): e230702 . - PubMed
    1. Titano JJ , Badgeley M , Schefflein J , et al. . Automated deep-neural-network surveillance of cranial images for acute neurologic events . Nat Med 2018. ; 24 ( 9 ): 1337 – 1341 . - PubMed

LinkOut - more resources