Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2010 Jun 11;86(6):929-42.
doi: 10.1016/j.ajhg.2010.05.002.

Powerful SNP-set analysis for case-control genome-wide association studies

Affiliations

Powerful SNP-set analysis for case-control genome-wide association studies

Michael C Wu et al. Am J Hum Genet. .

Abstract

GWAS have emerged as popular tools for identifying genetic variants that are associated with disease risk. Standard analysis of a case-control GWAS involves assessing the association between each individual genotyped SNP and disease risk. However, this approach suffers from limited reproducibility and difficulties in detecting multi-SNP and epistatic effects. As an alternative analytical strategy, we propose grouping SNPs together into SNP sets on the basis of proximity to genomic features such as genes or haplotype blocks, then testing the joint effect of each SNP set. Testing of each SNP set proceeds via the logistic kernel-machine-based test, which is based on a statistical framework that allows for flexible modeling of epistatic and nonlinear SNP effects. This flexibility and the ability to naturally adjust for covariate effects are important features of our test that make it appealing in comparison to individual SNP tests and existing multimarker tests. Using simulated data based on the International HapMap Project, we show that SNP-set testing can have improved power over standard individual-SNP analysis under a wide range of settings. In particular, we find that our approach has higher power than individual-SNP analysis when the median correlation between the disease-susceptibility variant and the genotyped SNPs is moderate to high. When the correlation is low, both individual-SNP analysis and the SNP-set analysis tend to have low power. We apply SNP-set analysis to analyze the Cancer Genetic Markers of Susceptibility (CGEMS) breast cancer GWAS discovery-phase data.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Empirical Power for SNP Sets Based on ASAH1 and LD Plot for the 86 SNPs in the ASAH1 Gene Based on the CEU Sample from the International HapMap Project The typed SNPs are denoted with a triangle, and the bottom panel shows the LD structure of the SNPs in the ASAH1 gene.
Figure 2
Figure 2
Empirical Power for SNP Sets Based on ASAH1 The SNPs on the x axis are sorted by median R2 with the 14 typed SNPs, which are shown in the bottom plot.
Figure 3
Figure 3
Smoothed Empirical Power Curves as a Function of Median R2 between the Causal SNP and the Typed SNP for SNP Sets Based on a Range of Genes
Figure 4
Figure 4
Comparison of the Power and Type I Error of the Logistic Kernel-Machine Test, the Wessel and Schork Method, and Mukhopadhyay et al.'s Approach Abbreviations are as follows: K, logistic kernel-machine test; W, Wessel and Schork method; M, Mukhopadhyay et al.'s approach. Power and size estimates are based on 500 and 1000 simulations, respectively. The blue line shows the expected type I error rate.

Similar articles

Cited by

References

    1. Hunter D.J., Kraft P., Jacobs K.B., Cox D.G., Yeager M., Hankinson S.E., Wacholder S., Wang Z., Welch R., Hutchinson A. A genome-wide association study identifies alleles in FGFR2 associated with risk of sporadic postmenopausal breast cancer. Nat. Genet. 2007;39:870–874. - PMC - PubMed
    1. Easton D.F., Pooley K.A., Dunning A.M., Pharoah P.D., Thompson D., Ballinger D.G., Struewing J.P., Morrison J., Field H., Luben R., SEARCH collaborators. kConFab. AOCS Management Group Genome-wide association study identifies novel breast cancer susceptibility loci. Nature. 2007;447:1087–1093. - PMC - PubMed
    1. Yeager M., Orr N., Hayes R.B., Jacobs K.B., Kraft P., Wacholder S., Minichiello M.J., Fearnhead P., Yu K., Chatterjee N. Genome-wide association study of prostate cancer identifies a second risk locus at 8q24. Nat. Genet. 2007;39:645–649. - PubMed
    1. Gudmundsson J., Sulem P., Manolescu A., Amundadottir L.T., Gudbjartsson D., Helgason A., Rafnar T., Bergthorsson J.T., Agnarsson B.A., Baker A. Genome-wide association study identifies a second prostate cancer susceptibility variant at 8q24. Nat. Genet. 2007;39:631–637. - PubMed
    1. Thomas G., Jacobs K.B., Yeager M., Kraft P., Wacholder S., Orr N., Yu K., Chatterjee N., Welch R., Hutchinson A. Multiple loci identified in a genome-wide association study of prostate cancer. Nat. Genet. 2008;40:310–315. - PubMed

Publication types

Substances