Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Dec 28;24(1):3599.
doi: 10.1186/s12889-024-21081-9.

Bias in machine learning applications to address non-communicable diseases at a population-level: a scoping review

Affiliations

Bias in machine learning applications to address non-communicable diseases at a population-level: a scoping review

Sharon Birdi et al. BMC Public Health. .

Abstract

Background: Machine learning (ML) is increasingly used in population and public health to support epidemiological studies, surveillance, and evaluation. Our objective was to conduct a scoping review to identify studies that use ML in population health, with a focus on its use in non-communicable diseases (NCDs). We also examine potential algorithmic biases in model design, training, and implementation, as well as efforts to mitigate these biases.

Methods: We searched the peer-reviewed, indexed literature using Medline, Embase, Cochrane Central Register of Controlled Trials and Cochrane Database of Systematic Reviews, CINAHL, Scopus, ACM Digital Library, Inspec, Web of Science's Science Citation Index, Social Sciences Citation Index, and the Emerging Sources Citation Index, up to March 2022.

Results: The search identified 27 310 studies and 65 were included. Study aims were separated into algorithm comparison (n = 13, 20%) or disease modelling for population-health-related outputs (n = 52, 80%). We extracted data on NCD type, data sources, technical approach, possible algorithmic bias, and jurisdiction. Type 2 diabetes was the most studied NCD. The most common use of ML was for risk modeling. Mitigating bias was not extensively addressed, with most methods focused on mitigating sex-related bias.

Conclusion: This review examines current applications of ML in NCDs, highlighting potential biases and strategies for mitigation. Future research should focus on communicable diseases and the transferability of ML models in low and middle-income settings. Our findings can guide the development of guidelines for the equitable use of ML to improve population health outcomes.

Keywords: Artificial intelligence; Machine learning; Non-communicable disease; Population health.

PubMed Disclaimer

Conflict of interest statement

Declarations. Ethics approval and consent to participate: Not applicable. Consent for publication: Not applicable. Competing interests: The authors declare no competing interests.

Figures

Fig. 1
Fig. 1
PRISMA-ScR flow diagram
Fig. 2
Fig. 2
Distribution of included studies by year of publication

Similar articles

  • Machine learning used to study risk factors for chronic diseases: A scoping review.
    Shergill M, Durant S, Birdi S, Rabet R, Ziegler C, Ali S, Buckeridge D, Ghassemi M, Gibson J, John-Baptiste A, Macklin J, McCradden M, McKenzie K, Naraei P, Owusu-Bempah A, Rosella LC, Shaw J, Upshur R, Mishra S, Pinto AD. Shergill M, et al. Can J Public Health. 2025 Jun 11. doi: 10.17269/s41997-025-01059-9. Online ahead of print. Can J Public Health. 2025. PMID: 40498391
  • Strategies to Mitigate Age-Related Bias in Machine Learning: Scoping Review.
    Chu C, Donato-Woodger S, Khan SS, Shi T, Leslie K, Abbasgholizadeh-Rahimi S, Nyrup R, Grenier A. Chu C, et al. JMIR Aging. 2024 Mar 22;7:e53564. doi: 10.2196/53564. JMIR Aging. 2024. PMID: 38517459 Free PMC article.
  • Beyond the black stump: rapid reviews of health research issues affecting regional, rural and remote Australia.
    Osborne SR, Alston LV, Bolton KA, Whelan J, Reeve E, Wong Shee A, Browne J, Walker T, Versace VL, Allender S, Nichols M, Backholer K, Goodwin N, Lewis S, Dalton H, Prael G, Curtin M, Brooks R, Verdon S, Crockett J, Hodgins G, Walsh S, Lyle DM, Thompson SC, Browne LJ, Knight S, Pit SW, Jones M, Gillam MH, Leach MJ, Gonzalez-Chica DA, Muyambi K, Eshetie T, Tran K, May E, Lieschke G, Parker V, Smith A, Hayes C, Dunlop AJ, Rajappa H, White R, Oakley P, Holliday S. Osborne SR, et al. Med J Aust. 2020 Dec;213 Suppl 11:S3-S32.e1. doi: 10.5694/mja2.50881. Med J Aust. 2020. PMID: 33314144
  • Predicting population health with machine learning: a scoping review.
    Morgenstern JD, Buajitti E, O'Neill M, Piggott T, Goel V, Fridman D, Kornas K, Rosella LC. Morgenstern JD, et al. BMJ Open. 2020 Oct 27;10(10):e037860. doi: 10.1136/bmjopen-2020-037860. BMJ Open. 2020. PMID: 33109649 Free PMC article.
  • The future of Cochrane Neonatal.
    Soll RF, Ovelman C, McGuire W. Soll RF, et al. Early Hum Dev. 2020 Nov;150:105191. doi: 10.1016/j.earlhumdev.2020.105191. Epub 2020 Sep 12. Early Hum Dev. 2020. PMID: 33036834

References

    1. World Health Organization. Noncommunicable diseases. World Health Organization; 2022. https://www.who.int/news-room/fact-sheets/detail/noncommunicable-diseases.
    1. Boutayeb A, Boutayeb S. The burden of non communicable diseases in developing countries. Int J Equity Health. 2005;4(1): 2. - PMC - PubMed
    1. Artificial intelligence and data technology provide smarter health care – 4 solutions that have made a difference for noncommunicable diseases. Available from: https://www.who.int/europe/news/item/14-12-2021-artificial-intelligence-.... Cited 2023 Oct 4.
    1. Thrall JH, Li X, Li Q, Cruz C, Do S, Dreyer K, et al. Artificial intelligence and machine learning in radiology: opportunities, challenges, pitfalls, and criteria for success. J Am Coll Radiol. 2018;15(3):504–8. - PubMed
    1. Berente N, Bin Gu, Recker J, Santhanam R. Managing artificial intelligence. Manage Inform Syst Q. 2021;45(3):1433–50.

Publication types

LinkOut - more resources