High performance computing enabling exhaustive analysis of higher order single nucleotide polymorphism interaction in Genome Wide Association Studies
- PMID: 25870758
- PMCID: PMC4383059
- DOI: 10.1186/2047-2501-3-S1-S3
High performance computing enabling exhaustive analysis of higher order single nucleotide polymorphism interaction in Genome Wide Association Studies
Abstract
Genome-wide association studies (GWAS) are a common approach for systematic discovery of single nucleotide polymorphisms (SNPs) which are associated with a given disease. Univariate analysis approaches commonly employed may miss important SNP associations that only appear through multivariate analysis in complex diseases. However, multivariate SNP analysis is currently limited by its inherent computational complexity. In this work, we present a computational framework that harnesses supercomputers. Based on our results, we estimate a three-way interaction analysis on 1.1 million SNP GWAS data requiring over 5.8 years on the full "Avoca" IBM Blue Gene/Q installation at the Victorian Life Sciences Computation Initiative. This is hundreds of times faster than estimates for other CPU based methods and four times faster than runtimes estimated for GPU methods, indicating how the improvement in the level of hardware applied to interaction analysis may alter the types of analysis that can be performed. Furthermore, the same analysis would take under 3 months on the currently largest IBM Blue Gene/Q supercomputer "Sequoia" at the Lawrence Livermore National Laboratory assuming linear scaling is maintained as our results suggest. Given that the implementation used in this study can be further optimised, this runtime means it is becoming feasible to carry out exhaustive analysis of higher order interaction studies on large modern GWAS.
Figures
References
-
- Visscher PM, Brown Ma, McCarthy MI, Yang J. Five years of GWAS discovery. American Journal of Human Genetics. 2012;90(7):24. http://www.ncbi.nlm.nih.gov/pubmed/22243964 - PMC - PubMed
-
- Cantor RM, Lange K, Sinsheimer JS. Prioritizing GWAS Results: A Review of Statistical Methods and Recommendations for Their Application. American Journal of Human Genetics. 2010;86:6–22. doi: 10.1016/j.ajhg.2009.11.017. http://www.ncbi.nlm.nih.gov/pubmed/20074509 - DOI - PMC - PubMed
-
- Zuk O, Hechter E, Sunyaev SR, Lander ES. The mystery of missing heritability: Genetic interactions create phantom heritability. Proceedings of the National Academy of Sciences. 2012;109:1193–1198. doi: 10.1073/pnas.1119675109. http://www.ncbi.nlm.nih.gov/pubmed/22223662 - DOI - PMC - PubMed
-
- Culverhouse R, Suarez BK, Lin J, Reich T. A perspective on epistasis: limits of models displaying no main effect. American Journal of Human Genetics. 2002;70:461–471. doi: 10.1086/338759. http://www.ncbi.nlm.nih.gov/pubmed/11791213 - DOI - PMC - PubMed
-
- Gilbert-Diamond D, Moore JH. Analysis of gene-gene interactions. Current Protocols in Human Genetics. 2011;Chapter 1(July):Unit1.14. http://www.ncbi.nlm.nih.gov/pubmed/21735376 - PMC - PubMed
LinkOut - more resources
Full Text Sources
