Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Mar 31;1(3):e0000022.
doi: 10.1371/journal.pdig.0000022. eCollection 2022 Mar.

Sources of bias in artificial intelligence that perpetuate healthcare disparities-A global review

Affiliations

Sources of bias in artificial intelligence that perpetuate healthcare disparities-A global review

Leo Anthony Celi et al. PLOS Digit Health. .

Abstract

Background: While artificial intelligence (AI) offers possibilities of advanced clinical prediction and decision-making in healthcare, models trained on relatively homogeneous datasets, and populations poorly-representative of underlying diversity, limits generalisability and risks biased AI-based decisions. Here, we describe the landscape of AI in clinical medicine to delineate population and data-source disparities.

Methods: We performed a scoping review of clinical papers published in PubMed in 2019 using AI techniques. We assessed differences in dataset country source, clinical specialty, and author nationality, sex, and expertise. A manually tagged subsample of PubMed articles was used to train a model, leveraging transfer-learning techniques (building upon an existing BioBERT model) to predict eligibility for inclusion (original, human, clinical AI literature). Of all eligible articles, database country source and clinical specialty were manually labelled. A BioBERT-based model predicted first/last author expertise. Author nationality was determined using corresponding affiliated institution information using Entrez Direct. And first/last author sex was evaluated using the Gendarize.io API.

Results: Our search yielded 30,576 articles, of which 7,314 (23.9%) were eligible for further analysis. Most databases came from the US (40.8%) and China (13.7%). Radiology was the most represented clinical specialty (40.4%), followed by pathology (9.1%). Authors were primarily from either China (24.0%) or the US (18.4%). First and last authors were predominately data experts (i.e., statisticians) (59.6% and 53.9% respectively) rather than clinicians. And the majority of first/last authors were male (74.1%).

Interpretation: U.S. and Chinese datasets and authors were disproportionately overrepresented in clinical AI, and almost all of the top 10 databases and author nationalities were from high income countries (HICs). AI techniques were most commonly employed for image-rich specialties, and authors were predominantly male, with non-clinical backgrounds. Development of technological infrastructure in data-poor regions, and diligence in external validation and model re-calibration prior to clinical implementation in the short-term, are crucial in ensuring clinical AI is meaningful for broader populations, and to avoid perpetuating global health inequity.

PubMed Disclaimer

Conflict of interest statement

Leo Anthony Celi is the Editor-in Chief of PLOS Digital Health and Judy Gichoya Wawira is a Section Editor for PLOS Digital Health.

Figures

Fig 1
Fig 1
(a) Confusion matrix and ROC curve. (b) Confusion matrix and ROC curve.
Fig 2
Fig 2
(a) Distribution of overall database nationality in AI in medicine. (b) Heatmap of distribution of overall database nationality in AI in medicine (reference #35)
Fig 3
Fig 3. Distribution of overall paper specialty in AI in medicine.
Fig 4
Fig 4. Distribution of overall author nationality in AI in medicine.
Fig 5
Fig 5. Distribution of first and last author expertise in AI in medicine.
Fig 6
Fig 6. Distribution of overall author gender in AI in medicine.

References

    1. Bur A, Shew M, New J. Artificial intelligence for the otolaryngologist: A state of the art review. Otolaryngol Head Neck Surg. 2019;160(4):603–611. doi: 10.1177/0194599819827507 - DOI - PubMed
    1. Oxford english dictionary. oxford, UK: Oxford University Press; 2018.
    1. Muehlematter UJ, Daniore P, Vokinger KN. Approval of artificial intelligence and machine learning-based medical devices in the USA and europe (2015–20): A comparative analysis. The Lancet Digital Health. 2021;3(3):e195–e203. doi: 10.1016/S2589-7500(20)30292-2 - DOI - PubMed
    1. Hinton G. Deep learning—a technology with the potential to transform health care. JAMA. 2018;30(11):1101–1102. doi: 10.1001/jama.2018.11100 - DOI - PubMed
    1. Ching T, Himmelstein DS, Beaulieu-Jones BK, et al. Opportunities and obstacles for deep learning in biology and medicine. Journal of The Royal Society Interface. 2018;15(141):20170387. doi: 10.1098/rsif.2017.0387 - DOI - PMC - PubMed

LinkOut - more resources