Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 May 21;14(1):11632.
doi: 10.1038/s41598-024-62513-1.

Stacked neural network for predicting polygenic risk score

Affiliations

Stacked neural network for predicting polygenic risk score

Sun Bin Kim et al. Sci Rep. .

Abstract

In recent years, the utility of polygenic risk scores (PRS) in forecasting disease susceptibility from genome-wide association studies (GWAS) results has been widely recognised. Yet, these models face limitations due to overfitting and the potential overestimation of effect sizes in correlated variants. To surmount these obstacles, we devised the Stacked Neural Network Polygenic Risk Score (SNPRS). This novel approach synthesises outputs from multiple neural network models, each calibrated using genetic variants chosen based on diverse p-value thresholds. By doing so, SNPRS captures a broader array of genetic variants, enabling a more nuanced interpretation of the combined effects of these variants. We assessed the efficacy of SNPRS using the UK Biobank data, focusing on the genetic risks associated with breast and prostate cancers, as well as quantitative traits like height and BMI. We also extended our analysis to the Korea Genome and Epidemiology Study (KoGES) dataset. Impressively, our results indicate that SNPRS surpasses traditional PRS models and an isolated deep neural network in terms of accuracy, highlighting its promise in refining the efficacy and relevance of PRS in genetic studies.

Keywords: Deep learning; Ensemble learning; Polygenic risk score.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing interests.

Figures

Figure 1
Figure 1
The overall architecture of SNPRS model.
Figure 2
Figure 2
Nagelkerke R-square and R-square plot comparison for UKBB dataset with single models and stacked one.
Figure 3
Figure 3
Nagelkerke R-square and R-square plot comparison for UKBB dataset with other models.
Figure 4
Figure 4
Nagelkerke R-square and R-square plot comparison for KoGES dataset with single models and stacked one.
Figure 5
Figure 5
Nagelkerke R-square and R-square plot comparison for KoGES dataset with other models.
Algorithm 1
Algorithm 1
Stacked neural network.

Similar articles

Cited by

References

    1. Visscher PM, Brown MA, McCarthy MI, Yang J. Five years of GWAS discovery. Am. J. Hum. Genet. 2012;90:7–24. doi: 10.1016/j.ajhg.2011.11.029. - DOI - PMC - PubMed
    1. Torkamani A, Wineinger NE, Topol EJ. The personal and clinical utility of polygenic risk scores. Nat. Rev. Genet. 2018;19:581–590. doi: 10.1038/s41576-018-0018-x. - DOI - PubMed
    1. Sugrue LP, Desikan RS. What are polygenic scores and why are they important? JAMA. 2019;321:1820–1821. doi: 10.1001/jama.2019.3893. - DOI - PubMed
    1. Lewis CM, Vassos E. Polygenic risk scores: From research tools to clinical instruments. Genome Med. 2020;12:44. doi: 10.1186/s13073-020-00742-5. - DOI - PMC - PubMed
    1. Khera AV, et al. Genome-wide polygenic scores for common diseases identify individuals with risk equivalent to monogenic mutations. Nat. Genet. 2018;50:1219–1224. doi: 10.1038/s41588-018-0183-z. - DOI - PMC - PubMed

MeSH terms