Multi-PGS enhances polygenic prediction by combining 937 polygenic scores
- PMID: 37543680
- PMCID: PMC10404269
- DOI: 10.1038/s41467-023-40330-w
Multi-PGS enhances polygenic prediction by combining 937 polygenic scores
Abstract
The predictive performance of polygenic scores (PGS) is largely dependent on the number of samples available to train the PGS. Increasing the sample size for a specific phenotype is expensive and takes time, but this sample size can be effectively increased by using genetically correlated phenotypes. We propose a framework to generate multi-PGS from thousands of publicly available genome-wide association studies (GWAS) with no need to individually select the most relevant ones. In this study, the multi-PGS framework increases prediction accuracy over single PGS for all included psychiatric disorders and other available outcomes, with prediction R2 increases of up to 9-fold for attention-deficit/hyperactivity disorder compared to a single PGS. We also generate multi-PGS for phenotypes without an existing GWAS and for case-case predictions. We benchmark the multi-PGS framework against other methods and highlight its potential application to new emerging biobanks.
© 2023. Springer Nature Limited.
Conflict of interest statement
C.M.B. reports: Lundbeckfonden (grant recipient); Pearson (author, royalty recipient); Equip Health Inc. (Stakeholder Advisory Board). B.M.N. is a member of the scientific advisory board at Deep Genomics and Neumora, consultant of the scientific advisory board for Camp4 Therapeutics and consultant for Merck. B.J.V. is on Allelica’s international advisory board. The remaining authors declare no competing interests.
Figures
References
Publication types
MeSH terms
Associated data
LinkOut - more resources
Full Text Sources
Medical
