The choice of null distributions for detecting gene-gene interactions in genome-wide association studies
- PMID: 21342556
- PMCID: PMC3044281
- DOI: 10.1186/1471-2105-12-S1-S26
The choice of null distributions for detecting gene-gene interactions in genome-wide association studies
Abstract
Background: In genome-wide association studies (GWAS), the number of single-nucleotide polymorphisms (SNPs) typically ranges between 500,000 and 1,000,000. Accordingly, detecting gene-gene interactions in GWAS is computationally challenging because it involves hundreds of billions of SNP pairs. Stage-wise strategies are often used to overcome the computational difficulty. In the first stage, fast screening methods (e.g. Tuning ReliefF) are applied to reduce the whole SNP set to a small subset. In the second stage, sophisticated modeling methods (e.g., multifactor-dimensionality reduction (MDR)) are applied to the subset of SNPs to identify interesting interaction models and the corresponding interaction patterns. In the third stage, the significance of the identified interaction patterns is evaluated by hypothesis testing.
Results: In this paper, we show that this stage-wise strategy could be problematic in controlling the false positive rate if the null distribution is not appropriately chosen. This is because screening and modeling may change the null distribution used in hypothesis testing. In our simulation study, we use some popular screening methods and the popular modeling method MDR as examples to show the effect of the inappropriate choice of null distributions. To choose appropriate null distributions, we suggest to use the permutation test or testing on the independent data set. We demonstrate their performance using synthetic data and a real genome wide data set from an Aged-related Macular Degeneration (AMD) study.
Conclusions: The permutation test or testing on the independent data set can help choosing appropriate null distributions in hypothesis testing, which provides more reliable results in practice.
Figures











Similar articles
-
DualWMDR: Detecting epistatic interaction with dual screening and multifactor dimensionality reduction.Hum Mutat. 2020 Mar;41(3):719-734. doi: 10.1002/humu.23951. Epub 2019 Nov 25. Hum Mutat. 2020. PMID: 31705708
-
A novel method to identify high order gene-gene interactions in genome-wide association studies: gene-based MDR.BMC Bioinformatics. 2012 Jun 11;13 Suppl 9(Suppl 9):S5. doi: 10.1186/1471-2105-13-S9-S5. BMC Bioinformatics. 2012. PMID: 22901090 Free PMC article.
-
A novel survival multifactor dimensionality reduction method for detecting gene-gene interactions with application to bladder cancer prognosis.Hum Genet. 2011 Jan;129(1):101-10. doi: 10.1007/s00439-010-0905-5. Epub 2010 Oct 28. Hum Genet. 2011. PMID: 20981448 Free PMC article.
-
Epistasis, complexity, and multifactor dimensionality reduction.Methods Mol Biol. 2013;1019:465-77. doi: 10.1007/978-1-62703-447-0_22. Methods Mol Biol. 2013. PMID: 23756906 Review.
-
An overview of SNP interactions in genome-wide association studies.Brief Funct Genomics. 2015 Mar;14(2):143-55. doi: 10.1093/bfgp/elu036. Epub 2014 Sep 19. Brief Funct Genomics. 2015. PMID: 25241224 Review.
Cited by
-
Risk score modeling of multiple gene to gene interactions using aggregated-multifactor dimensionality reduction.BioData Min. 2013 Jan 8;6(1):1. doi: 10.1186/1756-0381-6-1. BioData Min. 2013. PMID: 23294634 Free PMC article.
-
Hypothesis-based analysis of gene-gene interactions and risk of myocardial infarction.PLoS One. 2012;7(8):e41730. doi: 10.1371/journal.pone.0041730. Epub 2012 Aug 2. PLoS One. 2012. PMID: 22876292 Free PMC article.
-
Detecting epistasis in human complex traits.Nat Rev Genet. 2014 Nov;15(11):722-33. doi: 10.1038/nrg3747. Epub 2014 Sep 9. Nat Rev Genet. 2014. PMID: 25200660 Review.
-
Risk estimation and risk prediction using machine-learning methods.Hum Genet. 2012 Oct;131(10):1639-54. doi: 10.1007/s00439-012-1194-y. Epub 2012 Jul 3. Hum Genet. 2012. PMID: 22752090 Free PMC article. Review.
References
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources