A modified entropy-based approach for identifying gene-gene interactions in case-control study
- PMID: 23874943
- PMCID: PMC3715501
- DOI: 10.1371/journal.pone.0069321
A modified entropy-based approach for identifying gene-gene interactions in case-control study
Abstract
Gene-gene interactions may play an important role in the genetics of a complex disease. Detection and characterization of gene-gene interactions is a challenging issue that has stimulated the development of various statistical methods to address it. In this study, we introduce a method to measure gene interactions using entropy-based statistics from a contingency table of trait and genotype combinations. We also developed an exploration procedure by using graphs. We propose a standardized relative information gain (RIG) measure to evaluate the interactions between single nucleotide polymorphism (SNP) combinations. To identify the k (th) order interactions, contingency tables of trait and genotype combinations of k SNPs are constructed, with which RIGs are calculated. The RIGs are standardized using the mean and standard deviation from the permuted datasets. SNP combinations yielding high standardized RIG are chosen for gene-gene interactions. Detection of high-order interactions and comparison of interaction strengths between different orders are made possible by using standardized RIG. We have applied the proposed standardized entropy-based method to two types of data sets from a simulation study and a real genetic association study. We have compared our method and the multifactor dimensionality reduction (MDR) method through power analysis of eight different genetic models with varying penetrance rates, number of SNPs, and sample sizes. Our method shows successful identification of genetic associations and gene-gene interactions both in simulation and real genetic data. Simulation results suggest that the proposed entropy-based method is better able to detect high-order interactions and is superior to the MDR method in most cases. The proposed method is well suited for detecting interactions without main effects as well as for models including main effects.
Conflict of interest statement
Figures




















Similar articles
-
Comparative analysis of methods for detecting interacting loci.BMC Genomics. 2011 Jul 5;12:344. doi: 10.1186/1471-2164-12-344. BMC Genomics. 2011. PMID: 21729295 Free PMC article.
-
A novel survival multifactor dimensionality reduction method for detecting gene-gene interactions with application to bladder cancer prognosis.Hum Genet. 2011 Jan;129(1):101-10. doi: 10.1007/s00439-010-0905-5. Epub 2010 Oct 28. Hum Genet. 2011. PMID: 20981448 Free PMC article.
-
Multiobjective differential evolution-based multifactor dimensionality reduction for detecting gene-gene interactions.Sci Rep. 2017 Oct 9;7(1):12869. doi: 10.1038/s41598-017-12773-x. Sci Rep. 2017. PMID: 28993686 Free PMC article.
-
Epistasis, complexity, and multifactor dimensionality reduction.Methods Mol Biol. 2013;1019:465-77. doi: 10.1007/978-1-62703-447-0_22. Methods Mol Biol. 2013. PMID: 23756906 Review.
-
Multifactor dimensionality reduction: an analysis strategy for modelling and detecting gene-gene interactions in human genetics and pharmacogenomics studies.Hum Genomics. 2006 Mar;2(5):318-28. doi: 10.1186/1479-7364-2-5-318. Hum Genomics. 2006. PMID: 16595076 Free PMC article. Review.
Cited by
-
Germline Variants and Genetic Interactions of Several EMT Regulatory Genes Increase the Risk of HBV-Related Hepatocellular Carcinoma.Front Oncol. 2021 Jun 11;11:564477. doi: 10.3389/fonc.2021.564477. eCollection 2021. Front Oncol. 2021. PMID: 34178612 Free PMC article.
-
Combinations of genetic data in a study of oral cancer.Genes Cancer. 2015 Sep;6(9-10):422-7. doi: 10.18632/genesandcancer.79. Genes Cancer. 2015. PMID: 26622944 Free PMC article.
-
Optimized permutation testing for information theoretic measures of multi-gene interactions.BMC Bioinformatics. 2021 Apr 7;22(1):180. doi: 10.1186/s12859-021-04107-6. BMC Bioinformatics. 2021. PMID: 33827420 Free PMC article.
-
Transferring entropy to the realm of GxG interactions.Brief Bioinform. 2018 Jan 1;19(1):136-147. doi: 10.1093/bib/bbw086. Brief Bioinform. 2018. PMID: 27769993 Free PMC article.
-
EFMDR-Fast: An Application of Empirical Fuzzy Multifactor Dimensionality Reduction for Fast Execution.Genomics Inform. 2018 Dec;16(4):e37. doi: 10.5808/GI.2018.16.4.e37. Epub 2018 Dec 28. Genomics Inform. 2018. PMID: 30602098 Free PMC article.
References
-
- Zhang H, Bonney G (2000) Use of classification trees for association studies. Genet. Epidemiol. 19: 323–332. - PubMed
-
- Sheriff A, Ott J (2001) Applications of neural networks for gene finding. Adv. Genet. 42 287–297. - PubMed
-
- Kooperberg C, Ruczinski I (2005) Identifying interacting SNPs using Monte Carlo logic regression. Genet. Epidemiol. 28: 157–170. - PubMed
-
- Hosmer DW, Lemeshow D (2000) Applied logistic regression, 2nd edn. New York: John Wiley and Sons.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources