. 2023 Jul 7;14(1):4023.

doi: 10.1038/s41467-023-38930-7.

Optimal strategies for learning multi-ancestry polygenic scores vary across traits

Brieuc Lehmann¹, Maxine Mackintosh^{2

3}, Gil McVean^#⁴, Chris Holmes^#^{3

4

5}

Affiliations

¹ Department of Statistical Science, University College London, London, UK. b.lehmann@ucl.ac.uk.
² Genomics England, London, UK.
³ The Alan Turing Institute, London, UK.
⁴ Big Data Institute, University of Oxford, Oxford, UK.
⁵ Department of Statistics, University of Oxford, Oxford, UK.

^# Contributed equally.

PMID: 37419925
PMCID: PMC10328935
DOI: 10.1038/s41467-023-38930-7

Optimal strategies for learning multi-ancestry polygenic scores vary across traits

Brieuc Lehmann et al. Nat Commun. 2023.

. 2023 Jul 7;14(1):4023.

doi: 10.1038/s41467-023-38930-7.

Authors

Brieuc Lehmann¹, Maxine Mackintosh^{2

3}, Gil McVean^#⁴, Chris Holmes^#^{3

4

5}

Affiliations

¹ Department of Statistical Science, University College London, London, UK. b.lehmann@ucl.ac.uk.
² Genomics England, London, UK.
³ The Alan Turing Institute, London, UK.
⁴ Big Data Institute, University of Oxford, Oxford, UK.
⁵ Department of Statistics, University of Oxford, Oxford, UK.

^# Contributed equally.

PMID: 37419925
PMCID: PMC10328935
DOI: 10.1038/s41467-023-38930-7

Abstract

Polygenic scores (PGSs) are individual-level measures that aggregate the genome-wide genetic predisposition to a given trait. As PGS have predominantly been developed using European-ancestry samples, trait prediction using such European ancestry-derived PGS is less accurate in non-European ancestry individuals. Although there has been recent progress in combining multiple PGS trained on distinct populations, the problem of how to maximize performance given a multiple-ancestry cohort is largely unexplored. Here, we investigate the effect of sample size and ancestry composition on PGS performance for fifteen traits in UK Biobank. For some traits, PGS estimated using a relatively small African-ancestry training set outperformed, on an African-ancestry test set, PGS estimated using a much larger European-ancestry only training set. We observe similar, but not identical, results when considering other minority-ancestry groups within UK Biobank. Our results emphasise the importance of targeted data collection from underrepresented groups in order to address existing disparities in PGS performance.

PubMed Disclaimer

Conflict of interest statement

G.M. is a director of and shareholder in Genomics PLC, and is a partner in Peptide Groove LLP. M.M. is a Programme Lead for the Diverse Data initiative at Genomics England Ltd. B.L. and C.H. declare no competing interests.

Figures

**Fig. 1. Overview of methods.**
A To evaluate the different PGSs, we performed various splits of the available data. Firstly, we held out test sets of 20% of individuals in each ancestry group. From the remaining 80%, we constructed three types of training sets: a single-ancestry set consisting only of European-ancestry individuals (purple block), a single-ancestry set consisting of non-European-ancestry individuals (yellow block), and a dual-ancestry set consisting of both European-ancestry and non-European-ancestry individuals (blue block). For each training set, we used another 20% of the data to select the regularisation parameter in the LASSO. B For the dual-ancestry training set, we used an importance weighted LASSO, assigning higher weights to individuals in the minority-ancestry group. See Methods for full details.

**Fig. 2. Simulation study: predictive gap against number of African-ancestry individuals in training set.**
Each panel corresponds to a different number of African-ancestry training set individuals from n_AFR = 2000 to n_AFR = 18,000. The training sets for PGS_dual (blue lines) consisted of the corresponding African-ancestry training set for PGS_AFR (yellow lines), along with n_EUR = 18,000 European-ancestry individuals. Each line represents the mean predictive gap across 50 repetitions. The horizontal dashed lines correspond to the predictive gap for European-ancestry (EUR) test sets based on an unweighted LASSO, while the solid lines correspond to the predictive gap for African-ancestry (AFR) test sets. The parameter γ corresponds to the degree of reweighting used in the reweighted LASSO for PGS_dual. The correlation of genetic effects between ancestries ρ was varied from 0.5 (lighter lines) to 1 (darker lines).

**Fig. 3. Predictive performance for African-ancestry individuals against sample size for 15 traits in UK Biobank.**
a We fixed the number of European-ancestry (EUR) individuals in the training set at ~50,000 (26,388 for female genital prolapse (FGP)) and varied the number of African-ancestry (AFR) individuals from 0 to ~4700 (2900). The predictive performance, evaluated in terms of partial r², on African-ancestry individuals increased markedly for mean corpuscular volume (MCV) and platelet crit; and stayed largely stable (or increased slightly) for the remainder. b Here, we instead fixed the number of African-ancestry individuals in the training set at ~4700 (2900 for FGP) for each trait and varied the number of European-ancestry individuals so that the proportion of European-ancestry individuals in the training set ranged from 0% to 90%. The effect on performance on African-ancestry individuals again varied by trait, showing a clear improvement for MPV and height, and a moderate decrease for MCV. Error bars correspond to the range across five cross-validation rounds of training set construction and PGS estimation. Phenotype acronyms: mean platelet volume (MPV), mean corpuscular volume (MCV), body mass index (BMI), atrial fibrillation (AFib), diverticular disease of the intestine (DDI), female genital prolapse (FGP).

**Fig. 4. Partial r² for PGS_EUR, PGS_dual, and PGS_AFR on 15 traits in UK Biobank.**
Predictive performance on an African-ancestry (AFR) test set is shown by the solid lines. The dashed lines correspond to predictive performance on a European-ancestry (EUR) test set using PGS_EUR. The single-ancestry scores were estimated using a standard, unweighted LASSO. The dual-ancestry scores were constructed using an importance weighted LASSO with various degrees of reweighting γ. Traits are ordered according to partial r² of PGS_EUR on the European-ancestry test set (note the varying y-axes). Error bars correspond to the range across five cross-validation rounds of training set construction and PGS estimation. Phenotype acronyms: mean platelet volume (MPV), mean corpuscular volume (MCV), body mass index (BMI), atrial fibrillation (AFib), diverticular disease of the intestine (DDI), female genital prolapse (FGP).

**Fig. 5. Partial r² for PGS_EUR, PGS_dual, and PGS_min on four traits in UK Biobank for five minority-ancestry groups.**
The single-ancestry scores were estimated using a standard, unweighted LASSO. The dual-ancestry scores were constructed using an importance weighted LASSO with various degrees of reweighting γ. Error bars correspond to the range across five cross-validation rounds of training set construction and PGS estimation. The four traits considered are height, MCV, asthma, and erythrocyte distribution width. We used inferred genetic ancestry labels from Pan-UKBB, with participants divided into six groups: European ancestry (EUR), African ancestry (AFR), Admixed American ancestry (AMR), Central/South Asian ancestry (CSA), East Asian ancestry (EAS), and Middle Eastern ancestry (MID).

**Fig. 6. Allele frequency composition of variance explained by single- and dual-ancestry PGS.**
Results shown for mean corpuscular volume (left) and height (right) in a African-ancestry test set (AFR; top) and a European-ancestry test set (EUR; bottom). The black dots represent partial r² for all the variants, i.e. the entire polygenic score. Variants were grouped according to their minor allele frequency (MAF) in African-ancestry individuals (blue palette) or in European-ancestry individuals (green palette). Each bar represents the sum of the partial r² values for each subset of variants in a given polygenic score. Note that the bars are stacked, and the height of the bar is generally higher than corresponding dot due to LD between variants. The parameter γ corresponds to the degree of reweighting used in the reweighted LASSO for PGS_dual.

See this image and copyright information in PMC

Cited by

Clinical use of polygenic scores in type 2 diabetes: challenges and possibilities.
Prasad RB, Hakaste L, Tuomi T. Prasad RB, et al. Diabetologia. 2025 Jul;68(7):1361-1374. doi: 10.1007/s00125-025-06419-1. Epub 2025 Apr 5. Diabetologia. 2025. PMID: 40186687 Free PMC article. Review.
Unsupervised Ensemble Learning for Efficient Integration of Pre-trained Polygenic Risk Scores.
Duan R, Gao C, Tubbs J, Han Y, Guo M, Li S, Ma E, Luo D, Smoller J, Lee P. Duan R, et al. Res Sq [Preprint]. 2025 Apr 1:rs.3.rs-5976048. doi: 10.21203/rs.3.rs-5976048/v1. Res Sq. 2025. PMID: 40235488 Free PMC article. Preprint.
Benchmarking multi-ancestry prostate cancer polygenic risk scores in a real-world cohort.
Shah Y, Kulm S, Nauseef JT, Chen Z, Elemento O, Kensler KH, Sharaf RN. Shah Y, et al. PLoS Comput Biol. 2024 Apr 10;20(4):e1011990. doi: 10.1371/journal.pcbi.1011990. eCollection 2024 Apr. PLoS Comput Biol. 2024. PMID: 38598551 Free PMC article.
Polygenic prediction across populations is influenced by ancestry, genetic architecture, and methodology.
Wang Y, Kanai M, Tan T, Kamariza M, Tsuo K, Yuan K, Zhou W, Okada Y; BioBank Japan Project; Huang H, Turley P, Atkinson EG, Martin AR. Wang Y, et al. Cell Genom. 2023 Sep 14;3(10):100408. doi: 10.1016/j.xgen.2023.100408. eCollection 2023 Oct 11. Cell Genom. 2023. PMID: 37868036 Free PMC article.
All of Us diversity and scale improve polygenic prediction contextually with greatest improvements for under-represented populations.
Tsuo K, Shi Z, Ge T, Mandla R, Hou K, Ding Y, Pasaniuc B, Wang Y, Martin AR. Tsuo K, et al. bioRxiv [Preprint]. 2025 May 8:2024.08.06.606846. doi: 10.1101/2024.08.06.606846. bioRxiv. 2025. PMID: 39149254 Free PMC article. Preprint.

See all "Cited by" articles

References

1. Chatterjee N, Shi J, García-Closas M. Developing and evaluating polygenic risk prediction models for stratified disease prevention. Nat. Rev. Genet. 2016;17:392–406. doi: 10.1038/nrg.2016.27. - DOI - PMC - PubMed
1. Torkamani Ali, Wineinger NE, Topol EJ. The personal and clinical utility of polygenic risk scores. Nat. Rev. Genet. 2018;19:581–590. doi: 10.1038/s41576-018-0018-x. - DOI - PubMed
1. Khera AV, et al. Genome-wide polygenic scores for common diseases identify individuals with risk equivalent to monogenic mutations. Nat. Genet. 2018;50:1219–1224. doi: 10.1038/s41588-018-0183-z. - DOI - PMC - PubMed
1. Knowles JW, Ashley EA. Cardiovascular disease: The rise of the genetic risk score. PLoS Med. 2018;15:1–7. doi: 10.1371/journal.pmed.1002546. - DOI - PMC - PubMed
1. Maas P, et al. Breast cancer risk from modifiable and nonmodifiable risk factors among white women in the United States. JAMA Oncol. 2016;2:1295–1302. doi: 10.1001/jamaoncol.2016.1025. - DOI - PMC - PubMed

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Grants and funding

WT_/Wellcome Trust/United Kingdom

LinkOut - more resources

Full Text Sources
Research Materials
- NCI CPTC Antibody Characterization Program

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Optimal strategies for learning multi-ancestry polygenic scores vary across traits

Affiliations

Optimal strategies for learning multi-ancestry polygenic scores vary across traits

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

Grants and funding

LinkOut - more resources

Full Text Sources

Research Materials

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

Related information

Grants and funding

LinkOut - more resources

Full Text Sources

Research Materials