Powerful SNP-set analysis for case-control genome-wide association studies
- PMID: 20560208
- PMCID: PMC3032061
- DOI: 10.1016/j.ajhg.2010.05.002
Powerful SNP-set analysis for case-control genome-wide association studies
Abstract
GWAS have emerged as popular tools for identifying genetic variants that are associated with disease risk. Standard analysis of a case-control GWAS involves assessing the association between each individual genotyped SNP and disease risk. However, this approach suffers from limited reproducibility and difficulties in detecting multi-SNP and epistatic effects. As an alternative analytical strategy, we propose grouping SNPs together into SNP sets on the basis of proximity to genomic features such as genes or haplotype blocks, then testing the joint effect of each SNP set. Testing of each SNP set proceeds via the logistic kernel-machine-based test, which is based on a statistical framework that allows for flexible modeling of epistatic and nonlinear SNP effects. This flexibility and the ability to naturally adjust for covariate effects are important features of our test that make it appealing in comparison to individual SNP tests and existing multimarker tests. Using simulated data based on the International HapMap Project, we show that SNP-set testing can have improved power over standard individual-SNP analysis under a wide range of settings. In particular, we find that our approach has higher power than individual-SNP analysis when the median correlation between the disease-susceptibility variant and the genotyped SNPs is moderate to high. When the correlation is low, both individual-SNP analysis and the SNP-set analysis tend to have low power. We apply SNP-set analysis to analyze the Cancer Genetic Markers of Susceptibility (CGEMS) breast cancer GWAS discovery-phase data.
Figures




Similar articles
-
Association test based on SNP set: logistic kernel machine based test vs. principal component analysis.PLoS One. 2012;7(9):e44978. doi: 10.1371/journal.pone.0044978. Epub 2012 Sep 13. PLoS One. 2012. PMID: 23028716 Free PMC article.
-
An efficient weighted tag SNP-set analytical method in genome-wide association studies.BMC Genet. 2015 Mar 13;16:25. doi: 10.1186/s12863-015-0182-3. BMC Genet. 2015. PMID: 25879733 Free PMC article.
-
SNP set association analysis for familial data.Genet Epidemiol. 2012 Dec;36(8):797-810. doi: 10.1002/gepi.21676. Epub 2012 Sep 11. Genet Epidemiol. 2012. PMID: 22968922 Free PMC article.
-
Kernel machine SNP-set analysis for censored survival outcomes in genome-wide association studies.Genet Epidemiol. 2011 Nov;35(7):620-31. doi: 10.1002/gepi.20610. Epub 2011 Aug 4. Genet Epidemiol. 2011. PMID: 21818772 Free PMC article.
-
Tag SNP selection for association studies.Genet Epidemiol. 2004 Dec;27(4):365-74. doi: 10.1002/gepi.20028. Genet Epidemiol. 2004. PMID: 15372618 Review.
Cited by
-
Views on GWAS statistical analysis.Bioinformation. 2020 May 31;16(5):393-397. doi: 10.6026/97320630016393. eCollection 2020. Bioinformation. 2020. PMID: 32831520 Free PMC article.
-
HYST: a hybrid set-based test for genome-wide association studies, with application to protein-protein interaction-based association analysis.Am J Hum Genet. 2012 Sep 7;91(3):478-88. doi: 10.1016/j.ajhg.2012.08.004. Am J Hum Genet. 2012. PMID: 22958900 Free PMC article.
-
Gastrointestinal stromal tumors, somatic mutations and candidate genetic risk variants.PLoS One. 2013 Apr 18;8(4):e62119. doi: 10.1371/journal.pone.0062119. Print 2013. PLoS One. 2013. PMID: 23637977 Free PMC article. Clinical Trial.
-
Polymorphisms in miRNA Genes Targeting the AMPK Signaling Pathway are Associated with Cervical Cancer Susceptibility in a Han Chinese Population.Int J Gen Med. 2024 Sep 16;17:4171-4188. doi: 10.2147/IJGM.S473133. eCollection 2024. Int J Gen Med. 2024. PMID: 39308972 Free PMC article.
-
Test for interactions between a genetic marker set and environment in generalized linear models.Biostatistics. 2013 Sep;14(4):667-81. doi: 10.1093/biostatistics/kxt006. Epub 2013 Mar 5. Biostatistics. 2013. PMID: 23462021 Free PMC article.
References
-
- Easton D.F., Pooley K.A., Dunning A.M., Pharoah P.D., Thompson D., Ballinger D.G., Struewing J.P., Morrison J., Field H., Luben R., SEARCH collaborators. kConFab. AOCS Management Group Genome-wide association study identifies novel breast cancer susceptibility loci. Nature. 2007;447:1087–1093. - PMC - PubMed
-
- Yeager M., Orr N., Hayes R.B., Jacobs K.B., Kraft P., Wacholder S., Minichiello M.J., Fearnhead P., Yu K., Chatterjee N. Genome-wide association study of prostate cancer identifies a second risk locus at 8q24. Nat. Genet. 2007;39:645–649. - PubMed
-
- Gudmundsson J., Sulem P., Manolescu A., Amundadottir L.T., Gudbjartsson D., Helgason A., Rafnar T., Bergthorsson J.T., Agnarsson B.A., Baker A. Genome-wide association study identifies a second prostate cancer susceptibility variant at 8q24. Nat. Genet. 2007;39:631–637. - PubMed
-
- Thomas G., Jacobs K.B., Yeager M., Kraft P., Wacholder S., Orr N., Yu K., Chatterjee N., Welch R., Hutchinson A. Multiple loci identified in a genome-wide association study of prostate cancer. Nat. Genet. 2008;40:310–315. - PubMed
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
Research Materials