Leveraging both individual-level genetic data and GWAS summary statistics increases polygenic prediction
- PMID: 33964208
- PMCID: PMC8206385
- DOI: 10.1016/j.ajhg.2021.04.014
Leveraging both individual-level genetic data and GWAS summary statistics increases polygenic prediction
Abstract
The accuracy of polygenic risk scores (PRSs) to predict complex diseases increases with the training sample size. PRSs are generally derived based on summary statistics from large meta-analyses of multiple genome-wide association studies (GWASs). However, it is now common for researchers to have access to large individual-level data as well, such as the UK Biobank data. To the best of our knowledge, it has not yet been explored how best to combine both types of data (summary statistics and individual-level data) to optimize polygenic prediction. The most widely used approach to combine data is the meta-analysis of GWAS summary statistics (meta-GWAS), but we show that it does not always provide the most accurate PRS. Through simulations and using 12 real case-control and quantitative traits from both iPSYCH and UK Biobank along with external GWAS summary statistics, we compare meta-GWAS with two alternative data-combining approaches, stacked clumping and thresholding (SCT) and meta-PRS. We find that, when large individual-level data are available, the linear combination of PRSs (meta-PRS) is both a simple alternative to meta-GWAS and often more accurate.
Keywords: PRS; complex traits; genetic prediction; meta-analysis; polygenic risk scores; psychiatric disorders.
Copyright © 2021 The Authors. Published by Elsevier Inc. All rights reserved.
Conflict of interest statement
C.M.B. reports: Shire (grant recipient, Scientific Advisory Board member); Idorsia (consultant); Lundbeckfonden (grant recipient); Pearson (author, royalty recipient). The other authors declare no competing interests.
Figures


Similar articles
-
Efficient Implementation of Penalized Regression for Genetic Risk Prediction.Genetics. 2019 May;212(1):65-74. doi: 10.1534/genetics.119.302019. Epub 2019 Feb 26. Genetics. 2019. PMID: 30808621 Free PMC article.
-
Comparison of Methods Utilizing Sex-Specific PRSs Derived From GWAS Summary Statistics.Front Genet. 2022 Jul 8;13:892950. doi: 10.3389/fgene.2022.892950. eCollection 2022. Front Genet. 2022. PMID: 35873490 Free PMC article.
-
Incorporating European GWAS findings improve polygenic risk prediction accuracy of breast cancer among East Asians.Genet Epidemiol. 2021 Jul;45(5):471-484. doi: 10.1002/gepi.22382. Epub 2021 Mar 19. Genet Epidemiol. 2021. PMID: 33739539 Free PMC article.
-
Implementation and implications for polygenic risk scores in healthcare.Hum Genomics. 2021 Jul 20;15(1):46. doi: 10.1186/s40246-021-00339-y. Hum Genomics. 2021. PMID: 34284826 Free PMC article. Review.
-
Polygenic Risk Score in African populations: progress and challenges.F1000Res. 2023 Apr 11;11:175. doi: 10.12688/f1000research.76218.2. eCollection 2022. F1000Res. 2023. PMID: 37273966 Free PMC article. Review.
Cited by
-
Portability of 245 polygenic scores when derived from the UK Biobank and applied to 9 ancestry groups from the same cohort.Am J Hum Genet. 2022 Jan 6;109(1):12-23. doi: 10.1016/j.ajhg.2021.11.008. Am J Hum Genet. 2022. PMID: 34995502 Free PMC article.
-
Optimal strategies for learning multi-ancestry polygenic scores vary across traits.Nat Commun. 2023 Jul 7;14(1):4023. doi: 10.1038/s41467-023-38930-7. Nat Commun. 2023. PMID: 37419925 Free PMC article.
-
Postpartum and non-postpartum depression: a population-based matched case-control study comparing polygenic risk scores for severe mental disorders.Transl Psychiatry. 2023 Nov 13;13(1):346. doi: 10.1038/s41398-023-02649-2. Transl Psychiatry. 2023. PMID: 37953300 Free PMC article.
-
Multi-PGS enhances polygenic prediction by combining 937 polygenic scores.Nat Commun. 2023 Aug 5;14(1):4702. doi: 10.1038/s41467-023-40330-w. Nat Commun. 2023. PMID: 37543680 Free PMC article.
-
Advancements and limitations in polygenic risk score methods for genomic prediction: a scoping review.Hum Genet. 2024 Dec;143(12):1401-1431. doi: 10.1007/s00439-024-02716-8. Epub 2024 Nov 14. Hum Genet. 2024. PMID: 39542907
References
-
- Wray N.R., Lee S.H., Mehta D., Vinkhuyzen A.A., Dudbridge F., Middeldorp C.M. Research review: Polygenic methods and their application to psychiatric traits. J. Child Psychol. Psychiatry. 2014;55:1068–1087. - PubMed
-
- Buniello A., MacArthur J.A.L., Cerezo M., Harris L.W., Hayhurst J., Malangone C., McMahon A., Morales J., Mountjoy E., Sollis E. The NHGRI-EBI GWAS Catalog of published genome-wide association studies, targeted arrays and summary statistics 2019. Nucleic Acids Res. 2019;47(D1):D1005–D1012. - PMC - PubMed
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources