Bayesian methods for multivariate modeling of pleiotropic SNP associations and genetic risk prediction
- PMID: 22973300
- PMCID: PMC3438684
- DOI: 10.3389/fgene.2012.00176
Bayesian methods for multivariate modeling of pleiotropic SNP associations and genetic risk prediction
Abstract
Genome-wide association studies (GWAS) have identified numerous associations between genetic loci and individual phenotypes; however, relatively few GWAS have attempted to detect pleiotropic associations, in which loci are simultaneously associated with multiple distinct phenotypes. We show that pleiotropic associations can be directly modeled via the construction of simple Bayesian networks, and that these models can be applied to produce single or ensembles of Bayesian classifiers that leverage pleiotropy to improve genetic risk prediction. The proposed method includes two phases: (1) Bayesian model comparison, to identify Single-Nucleotide Polymorphisms (SNPs) associated with one or more traits; and (2) cross-validation feature selection, in which a final set of SNPs is selected to optimize prediction. To demonstrate the capabilities and limitations of the method, a total of 1600 case-control GWAS datasets with two dichotomous phenotypes were simulated under 16 scenarios, varying the association strengths of causal SNPs, the size of the discovery sets, the balance between cases and controls, and the number of pleiotropic causal SNPs. Across the 16 scenarios, prediction accuracy varied from 90 to 50%. In the 14 scenarios that included pleiotropically associated SNPs, the pleiotropic model search and prediction methods consistently outperformed the naive model search and prediction. In the two scenarios in which there were no true pleiotropic SNPs, the differences between the pleiotropic and naive model searches were minimal. To further evaluate the method on real data, a discovery set of 1071 sickle cell disease (SCD) patients was used to search for pleiotropic associations between cerebral vascular accidents and fetal hemoglobin level. Classification was performed on a smaller validation set of 352 SCD patients, and showed that the inclusion of pleiotropic SNPs may slightly improve prediction, although the difference was not statistically significant. The proposed method is robust, computationally efficient, and provides a powerful new approach for detecting and modeling pleiotropic disease loci.
Keywords: Bayesian; GWAS; SNP; pleiotropy; prediction.
Figures











Similar articles
-
PleioGRiP: genetic risk prediction with pleiotropy.Bioinformatics. 2013 Apr 15;29(8):1086-8. doi: 10.1093/bioinformatics/btt081. Epub 2013 Feb 17. Bioinformatics. 2013. PMID: 23419378 Free PMC article.
-
A multi-trait Bayesian method for mapping QTL and genomic prediction.Genet Sel Evol. 2018 Mar 24;50(1):10. doi: 10.1186/s12711-018-0377-y. Genet Sel Evol. 2018. PMID: 29571285 Free PMC article.
-
An efficient unified model for genome-wide association studies and genomic selection.Genet Sel Evol. 2017 Aug 24;49(1):64. doi: 10.1186/s12711-017-0338-x. Genet Sel Evol. 2017. PMID: 28836943 Free PMC article.
-
Comprehensive identification of pleiotropic loci for body fat distribution using the NHGRI-EBI Catalog of published genome-wide association studies.Obes Rev. 2019 Mar;20(3):385-406. doi: 10.1111/obr.12806. Epub 2018 Nov 22. Obes Rev. 2019. PMID: 30565845 Review.
-
Multivariate analysis of genome-wide data to identify potential pleiotropic genes for type 2 diabetes, obesity and coronary artery disease using MetaCCA.Int J Cardiol. 2019 May 15;283:144-150. doi: 10.1016/j.ijcard.2018.10.102. Epub 2018 Oct 31. Int J Cardiol. 2019. PMID: 30459114 Review.
Cited by
-
Regularized machine learning in the genetic prediction of complex traits.PLoS Genet. 2014 Nov 13;10(11):e1004754. doi: 10.1371/journal.pgen.1004754. eCollection 2014 Nov. PLoS Genet. 2014. PMID: 25393026 Free PMC article. No abstract available.
-
Statistical methods to detect pleiotropy in human complex traits.Open Biol. 2017 Nov;7(11):170125. doi: 10.1098/rsob.170125. Open Biol. 2017. PMID: 29093210 Free PMC article. Review.
-
The genetics of extreme longevity: lessons from the new England centenarian study.Front Genet. 2012 Nov 30;3:277. doi: 10.3389/fgene.2012.00277. eCollection 2012. Front Genet. 2012. PMID: 23226160 Free PMC article.
-
A regression framework to uncover pleiotropy in large-scale electronic health record data.J Am Med Inform Assoc. 2019 Oct 1;26(10):1083-1090. doi: 10.1093/jamia/ocz084. J Am Med Inform Assoc. 2019. PMID: 31529123 Free PMC article.
-
PleioGRiP: genetic risk prediction with pleiotropy.Bioinformatics. 2013 Apr 15;29(8):1086-8. doi: 10.1093/bioinformatics/btt081. Epub 2013 Feb 17. Bioinformatics. 2013. PMID: 23419378 Free PMC article.
References
-
- Gupta M., Cheung C. L., Hsu Y. H., Demissie S., Cupples L. A., Kiel D. P., Karasik D. (2011). Identification of homogenous genetic architecture of multiple genetically correlated traits by block clustering of genome-wide associations. J. Bone Miner. Res. 26, 1261–127110.1002/jbmr.333 - DOI - PMC - PubMed
-
- Hand D. J. (2009). “Naive Bayes,” in The Top Ten Algorithms in Data Mining, eds Wu X., Kumar V. (London: Chapman and Hall; ), 163–178
Grants and funding
LinkOut - more resources
Full Text Sources