Rare coding variant analysis for human diseases across biobanks and ancestries
- PMID: 39210047
- PMCID: PMC7618415
- DOI: 10.1038/s41588-024-01894-5
Rare coding variant analysis for human diseases across biobanks and ancestries
Abstract
Large-scale sequencing has enabled unparalleled opportunities to investigate the role of rare coding variation in human phenotypic variability. Here, we present a pan-ancestry analysis of sequencing data from three large biobanks, including the All of Us research program. Using mixed-effects models, we performed gene-based rare variant testing for 601 diseases across 748,879 individuals, including 155,236 with ancestry dissimilar to European. We identified 363 significant associations, which highlighted core genes for the human disease phenome and identified potential novel associations, including UBR3 for cardiometabolic disease and YLPM1 for psychiatric disease. Pan-ancestry burden testing represented an inclusive and useful approach for discovery in diverse datasets, although we also highlight the importance of ancestry-specific sensitivity analyses in this setting. Finally, we found that effect sizes for rare protein-disrupting variants were concordant between samples similar to European ancestry and other genetic ancestries (βDeming = 0.7-1.0). Our results have implications for multi-ancestry and cross-biobank approaches in sequencing association studies for human disease.
© 2024. The Author(s), under exclusive licence to Springer Nature America, Inc.
Conflict of interest statement
P.T.E. has received sponsored research support from IBM Health, Bayer AG, Bristol Myers Squibb, and Pfizer; he has consulted for Bayer AG. S.A.L. is a full-time employee of Novartis Institutes of BioMedical Research as of July 18, 2022. S.A.L. previously received sponsored research support from Bristol Myers Squibb, Pfizer, Boehringer Ingelheim, Fitbit, Medtronic, Premier, and IBM, and has consulted for Bristol Myers Squibb, Pfizer, Blackstone Life Sciences, and Invitae. P.N. has received sponsored research support from Amgen, Apple, Boston Scientific, Novartis, and AstraZeneca, personal fees from Apple, AstraZeneca, Genentech / Roche, Novartis, Allelica, Foresite Labs, Blackstone Life Sciences, and HeartFlow, is a scientific advisory board member of Esperion Therapeutics, geneXwell, and TenSixteen Bio, is a scientific co-founder of TenSixteen Bio, and spousal employment at Vertex, all unrelated to the present work. B.M.P. serves on the Steering Committee of the Yale Open Data Access Project funded by Johnson & Johnson. The remaining authors declare no competing interests.
Figures
References
Methods-only references
-
- Szustakowski JD, et al. Advancing human genetics research and drug discovery through exome sequencing of the UK Biobank. Nat Genet. 2021;53:942–948. - PubMed
MeSH terms
Grants and funding
- K08 HL159346/HL/NHLBI NIH HHS/United States
- 91815610/ZONMW_/ZonMw/Netherlands
- R01 HL092577/HL/NHLBI NIH HHS/United States
- 18SFRN34110082/American Heart Association (American Heart Association, Inc.)
- 1RO1HL092577/Foundation for the National Institutes of Health (Foundation for the National Institutes of Health, Inc.)
- 862032/American Heart Association (American Heart Association, Inc.)
- K08HL159346/Foundation for the National Institutes of Health (Foundation for the National Institutes of Health, Inc.)
- 03-007-2022-0035/Hartstichting (Dutch Heart Foundation)
- 1K08HL153937/Foundation for the National Institutes of Health (Foundation for the National Institutes of Health, Inc.)
- 961045/American Heart Association (American Heart Association, Inc.)
- 1R01HL157635/Foundation for the National Institutes of Health (Foundation for the National Institutes of Health, Inc.)
- R01 HL157635/HL/NHLBI NIH HHS/United States
- K08 HL153937/HL/NHLBI NIH HHS/United States
LinkOut - more resources
Full Text Sources
