Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
[Preprint]. 2024 Feb 3:2024.01.31.24302103.
doi: 10.1101/2024.01.31.24302103.

A Stacking Framework for Polygenic Risk Prediction in Admixed Individuals

Affiliations

A Stacking Framework for Polygenic Risk Prediction in Admixed Individuals

Kevin Liao et al. medRxiv. .

Abstract

Polygenic risk scores (PRS) are summaries of an individual's personalized genetic risk for a trait or disease. However, PRS often perform poorly for phenotype prediction when the ancestry of the target population does not match the population in which GWAS effect sizes were estimated. For many populations this can be addressed by performing GWAS in the target population. However, admixed individuals (whose genomes can be traced to multiple ancestral populations) lie on an ancestry continuum and are not easily represented as a discrete population. Here, we propose slaPRS (stacking local ancestry PRS), which incorporates multiple ancestry GWAS to alleviate the ancestry dependence of PRS in admixed samples. slaPRS uses ensemble learning (stacking) to combine local population specific PRS in regions across the genome. We compare slaPRS to single population PRS and a method that combines single population PRS globally. In simulations, slaPRS outperformed existing approaches and reduced the ancestry dependence of PRS in African Americans. In lipid traits from African British individuals (UK Biobank), slaPRS again improved on single population PRS while performing comparably to the globally combined PRS. slaPRS provides a data-driven and flexible framework to incorporate multiple population-specific GWAS and local ancestry in samples of admixed ancestry.

PubMed Disclaimer

Conflict of interest statement

Conflict of interest disclosure The authors report no conflicts of interest regarding commercial or financial interests involved with the study.

Figures

Figure 1.
Figure 1.
Diagram of local window and level 0 population specific PRS model predictions. Admixed genomes split into 5Mb windows and in each window a local population A and B PRS are computed using population-specific effect sizes. Local ancestry further computed to form covariate vector for level 1 stacking model.
Figure 2.
Figure 2.
Boxplots comparing performance of slaPRS (differing in choice of level 0 predictors from each block), PRSMarquez, and single population PRS: PRSEUR & PRSAFR (see methods) quantified through adjusted R2. Testing samples stratified by overall % of European ancestry.
Figure 3.
Figure 3.
Line graph comparing PRS performance across methods (quantified by median adjusted R2 between estimated PRS and phenotype value) as the African GWAS sample size changes (n=2000, 5000, 10,000). and Testing admixed samples stratified by European ancestry quantile.
Figure 4.
Figure 4.
Line graph comparing PRS performance as quantified through median adjusted R2 between the estimated PRS and phenotype value. Transethnic genetic correlation varies from ρ={0.2,05,0.8} and testing admixed samples stratified by European ancestry quantile.
Figure 5.
Figure 5.
Line graph comparing PRS Performance for UKB lipid phenotypes. Performance quantified through median adjusted R2 from model PHENO ~ PRS + PC1 + PC2 + PC3 + PC4. Testing admixed samples are stratified by European ancestry quantile.

Similar articles

References

    1. Loos R. J. F. 15 years of genome-wide association studies and no signs of slowing down. Nat. Commun. 11, 5900 (2020). - PMC - PubMed
    1. Dudbridge F. Power and Predictive Accuracy of Polygenic Risk Scores. PLoS Genetics vol. 9 e1003348 Preprint at 10.1371/journal.pgen.1003348 (2013). - DOI - PMC - PubMed
    1. Polygenic Risk Score Task Force of the International Common Disease Alliance. Responsible use of polygenic risk scores in the clinic: potential benefits, risks and gaps. Nat. Med. 27, 1876–1884 (2021). - PubMed
    1. Khera A. V. et al. Genome-wide polygenic scores for common diseases identify individuals with risk equivalent to monogenic mutations. Nat. Genet. 50, 1219–1224 (2018). - PMC - PubMed
    1. Inouye M. et al. Genomic risk prediction of coronary artery disease in 480,000 adults: Implications for primary prevention. J. Am. Coll. Cardiol. 72, 1883–1893 (2018). - PMC - PubMed

Publication types