Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2016 Jul 8;44(12):e115.
doi: 10.1093/nar/gkw347. Epub 2016 Apr 25.

A fast and powerful W-test for pairwise epistasis testing

Affiliations

A fast and powerful W-test for pairwise epistasis testing

Maggie Haitian Wang et al. Nucleic Acids Res. .

Erratum in

  • A fast and powerful W-test for pairwise epistasis testing.
    Wang MH, Sun R, Guo J, Weng H, Lee J, Hu I, Sham PC, Zee BC. Wang MH, et al. Nucleic Acids Res. 2016 Dec 1;44(21):10526. doi: 10.1093/nar/gkw866. Epub 2016 Sep 26. Nucleic Acids Res. 2016. PMID: 27672040 Free PMC article. No abstract available.

Abstract

Epistasis plays an essential role in the development of complex diseases. Interaction methods face common challenge of seeking a balance between persistent power, model complexity, computation efficiency, and validity of identified bio-markers. We introduce a novel W-test to identify pairwise epistasis effect, which measures the distributional difference between cases and controls through a combined log odds ratio. The test is model-free, fast, and inherits a Chi-squared distribution with data adaptive degrees of freedom. No permutation is needed to obtain the P-values. Simulation studies demonstrated that the W-test is more powerful in low frequency variants environment than alternative methods, which are the Chi-squared test, logistic regression and multifactor-dimensionality reduction (MDR). In two independent real bipolar disorder genome-wide associations (GWAS) datasets, the W-test identified significant interactions pairs that can be replicated, including SLIT3-CENPN, SLIT3-TMEM132D, CNTNAP2-NDST4 and CNTCAP2-RTN4R The genes in the pairs play central roles in neurotransmission and synapse formation. A majority of the identified loci are undiscoverable by main effect and are low frequency variants. The proposed method offers a powerful alternative tool for mapping the genetic puzzle underlying complex disorders.

PubMed Disclaimer

Figures

Diagram 1.
Diagram 1.
Decomposition of the W-test. The W-test measures the distributional differences between cases and controls using a combined log odds ratio. The dependency among the cells is handled by the data-dependent scalars h and f, estimated from the null hypothesis. The overall test statistic follows a Chi-squared distribution with f degrees of freedom.
Figure 1.
Figure 1.
Power of alternative methods in low frequency variant environment. In the low frequency variant environment (1% < MAF < 5%),the W-test outperforms alternative methods for both linear and non-linear models.
Figure 2.
Figure 2.
Power comparison of alternative methods at different sample sizes. As the sample size reduces, the W-test shows a robust power compared to alternative methods. The power is calculated under the genetic environment of 1% < MAF < 5% and LD 20% < r2< 80%, using a non-linear model.
Figure 3.
Figure 3.
QQ Plot of W-test on real genome-wide data. The W-test is computed on real genome-wide data with permuted phenotype for SNP–SNP interactions. No inflation of spurious association is observed.
Figure 4.
Figure 4.
Gene-gene Interaction Networks. The solid lines represent significant epistasis effect. Blue color indicates pairs found in the GAIN dataset and red color indicates that they are identified in the WTCCC dataset. Purple circles represent genes replicated by the two independent data; all of which play important roles in brain and neuronal function.

References

    1. Cantor R.M., Lange K., Sinsheimer J.S. Prioritizing GWAS results: a review of statistical methods and recommendations for their application. Am. J. Hum. Genet. 2010;86:6–22. - PMC - PubMed
    1. Kraft P., Hunter D.J. Genetic risk prediction - are we there yet? N. Engl. J. Med. 2009;360:1701–1703. - PubMed
    1. Witte J.S., Visscher P.M., Wray N.R. The contribution of genetic variants to disease depends on the ruler. Nat. Rev. Genet. 2014;15:765–776. - PMC - PubMed
    1. Phillips P.C. Epistasis - the essential role of gene interactions in the structure and evolution of genetic systems. Nat. Rev. Genet. 2008;9:855–867. - PMC - PubMed
    1. Cordell H.J. Detecting gene-gene interactions that underlie human diseases. Nat. Rev. Genet. 2009;10:392–404. - PMC - PubMed

Publication types

LinkOut - more resources