Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2025 Sep;26(9):635-649.
doi: 10.1038/s41576-025-00839-w. Epub 2025 May 15.

Methodological opportunities in genomic data analysis to advance health equity

Affiliations
Review

Methodological opportunities in genomic data analysis to advance health equity

Brieuc Lehmann et al. Nat Rev Genet. 2025 Sep.

Abstract

The causes and consequences of inequities in genomic research and medicine are complex and widespread. However, it is widely acknowledged that underrepresentation of diverse populations in human genetics research risks exacerbating existing health disparities. Efforts to improve diversity are ongoing, but an often-overlooked source of inequity is the choice of analytical methods used to process, analyse and interpret genomic data. This choice can influence all areas of genomic research, from genome-wide association studies and polygenic score development to variant prioritization and functional genomics. New statistical and machine learning techniques to understand, quantify and correct for the impact of biases in genomic data are emerging within the wider genomic research and genomic medicine ecosystems. At this crucial time point, it is important to clarify where improvements in methods and practices can, or cannot, have a role in improving equity in genomics. Here, we review existing approaches to promote equity and fairness in statistical analysis for genomics, and propose future methodological developments that are likely to yield the most impact for equity.

PubMed Disclaimer

Conflict of interest statement

Competing interests: This manuscript was informed by a project commissioned by the Diverse Data (DD) initiative at Genomics England (GEL) in December 2022 to explore the use of statistical and machine learning methods to improve fairness and equity in genomics. K.K. is the Scientific Lead for DD. S.T., T.N. and Y.C. are Genomic Data Scientists at GEL. M.S. was the Lead Genomic Data Scientist for DD, and M.M. was the Programme Lead for DD. B.L. and L.B. were paid consultants to GEL for the project. M.M. is Director of One HealthTech, which provides the secretariat for the Data Science for Health Equity community, which B.L. is also the co-founder of. B.L. and L.B. have acted as consultants for Google DeepMind in relation to other research in this field; however, Google DeepMind was not involved in this project or this publication. F.F. is an employee and shareholder of Microsoft Corporation.

Similar articles

Cited by

References

    1. Fatumo, S. et al. A roadmap to increase diversity in genomic studies. Nat. Med. 28, 243–250 (2022). This paper reports the persistent lack of diversity across genetic ancestry for participants in genome-wide association studies and discusses strategies to enhance inclusion. - PubMed - PMC
    1. Popejoy, A. B. & Fullerton, S. M. Genomics is failing on diversity. Nature 538, 161–164 (2016). - PubMed - PMC
    1. Need, A. C. & Goldstein, D. B. Next generation disparities in human genomics: concerns and remedies. Trends Genet. 25, 489–494 (2009). - PubMed
    1. Bustamante, C. D., De La Vega, F. M. & Burchard, E. G. Genomics for the world. Nature 475, 163–165 (2011). - PubMed - PMC
    1. Carrot-Zhang, J. et al. Comprehensive analysis of genetic ancestry and its molecular correlates in cancer. Cancer Cell 37, 639–654.e6 (2020). - PubMed - PMC

LinkOut - more resources