Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Mar;30(3):339-348.
doi: 10.1038/s41431-021-01028-z. Epub 2022 Jan 4.

A tool for translating polygenic scores onto the absolute scale using summary statistics

Affiliations

A tool for translating polygenic scores onto the absolute scale using summary statistics

Oliver Pain et al. Eur J Hum Genet. 2022 Mar.

Abstract

There is growing interest in the clinical application of polygenic scores as their predictive utility increases for a range of health-related phenotypes. However, providing polygenic score predictions on the absolute scale is an important step for their safe interpretation. We have developed a method to convert polygenic scores to the absolute scale for binary and normally distributed phenotypes. This method uses summary statistics, requiring only the area-under-the-ROC curve (AUC) or variance explained (R2) by the polygenic score, and the prevalence of binary phenotypes, or mean and standard deviation of normally distributed phenotypes. Polygenic scores are converted using normal distribution theory. We also evaluate methods for estimating polygenic score AUC/R2 from genome-wide association study (GWAS) summary statistics alone. We validate the absolute risk conversion and AUC/R2 estimation using data for eight binary and three continuous phenotypes in the UK Biobank sample. When the AUC/R2 of the polygenic score is known, the observed and estimated absolute values were highly concordant. Estimates of AUC/R2 from the lassosum pseudovalidation method were most similar to the observed AUC/R2 values, though estimated values deviated substantially from the observed for autoimmune disorders. This study enables accurate interpretation of polygenic scores using only summary statistics, providing a useful tool for educational and clinical purposes. Furthermore, we have created interactive webtools implementing the conversion to the absolute ( https://opain.github.io/GenoPred/PRS_to_Abs_tool.html ). Several further barriers must be addressed before clinical implementation of polygenic scores, such as ensuring target individuals are well represented by the GWAS sample.

PubMed Disclaimer

Conflict of interest statement

CML sits on the Myriad Neuroscience Scientific Advisory Board. The other authors declare no competing interests.

Figures

Fig. 1
Fig. 1. Comparison of observed and estimated probability of being a case across 20 DBSLMM polygenic score quantiles.
Estimated values are based on either the observed polygenic score AUC, or the lassosum estimated AUC. Figures are available in colour online.
Fig. 2
Fig. 2. Comparison of observed and estimated phenotype mean and standard deviation across 20 DBSLMM polygenic score quantiles.
Estimated values are either based on the observed polygenic score R2, or the lassosum estimated R2. Figures are available in colour online.
Fig. 3
Fig. 3. Shiny app implementing absolute scale conversion for binary phenotypes.
Parameters reflect prevalence of schizophrenia and AUC of the schizophrenia polygenic score [36]. Figures are available in colour online.
Fig. 4
Fig. 4. Shiny app implementing absolute scale conversion for normally distributed phenotypes.
Parameters reflect mean and SD of IQ, and R2 of educational attainment polygenic score for IQ [37]. Figures are available in colour online.

References

    1. Polderman TJC, Benyamin B, de Leeuw CA, Sullivan PF, van Bochoven A, Visscher PM, et al. Meta-analysis of the heritability of human traits based on fifty years of twin studies. Nat Genet. 2015;47:702–9. doi: 10.1038/ng.3285. - DOI - PubMed
    1. Wood AR, Esko T, Yang J, Vedantam S, Pers TH, Gustafsson S, et al. Defining the role of common variation in the genomic and biological architecture of adult human height. Nat Genet. 2014;46:1173. doi: 10.1038/ng.3097. - DOI - PMC - PubMed
    1. Howard DM, Adams MJ, Clarke T-K, Hafferty JD, Gibson J, Shirali M, et al. Genome-wide meta-analysis of depression identifies 102 independent variants and highlights the importance of the prefrontal brain regions. Nat Neurosci. 2019;22:343–52. doi: 10.1038/s41593-018-0326-7. - DOI - PMC - PubMed
    1. Choi SW, Mak TS-H, O’Reilly PF. Tutorial: a guide to performing polygenic risk score analyses. Nat Protoc. 2020;15:1–14. - PMC - PubMed
    1. Pain O, Glanville KP, Hagenaars SP, Selzam S, Fürtjes AE, Gaspar HA, et al. Evaluation of polygenic prediction methodology within a reference-standardized framework. PLoS Genet. 2021;17:e1009021. doi: 10.1371/journal.pgen.1009021. - DOI - PMC - PubMed

Publication types