Identifying interacting SNPs using Monte Carlo logic regression
- PMID: 15532037
- DOI: 10.1002/gepi.20042
Identifying interacting SNPs using Monte Carlo logic regression
Abstract
Interactions are frequently at the center of interest in single-nucleotide polymorphism (SNP) association studies. When interacting SNPs are in the same gene or in genes that are close in sequence, such interactions may suggest which haplotypes are associated with a disease. Interactions between unrelated SNPs may suggest genetic pathways. Unfortunately, data sets are often still too small to definitively determine whether interactions between SNPs occur. Also, competing sets of interactions could often be of equal interest. Here we propose Monte Carlo logic regression, an exploratory tool that combines Markov chain Monte Carlo and logic regression, an adaptive regression methodology that attempts to construct predictors as Boolean combinations of binary covariates such as SNPs. The goal of Monte Carlo logic regression is to generate a collection of (interactions of) SNPs that may be associated with a disease outcome, and that warrant further investigation. As such, the models that are fitted in the Markov chain are not combined into a single model, as is often done in Bayesian model averaging procedures. Instead, the most frequently occurring patterns in these models are tabulated. The method is applied to a study of heart disease with 779 participants and 89 SNPs. A simulation study is carried out to investigate the performance of the Monte Carlo logic regression approach.
2004 Wiley-Liss, Inc.
Similar articles
-
Logic regression and its extensions.Adv Genet. 2010;72:25-45. doi: 10.1016/B978-0-12-380862-2.00002-3. Adv Genet. 2010. PMID: 21029847
-
Direct analysis of unphased SNP genotype data in population-based association studies via Bayesian partition modelling of haplotypes.Genet Epidemiol. 2005 Sep;29(2):91-107. doi: 10.1002/gepi.20080. Genet Epidemiol. 2005. PMID: 15940704
-
Identification of SNP interactions using logic regression.Biostatistics. 2008 Jan;9(1):187-98. doi: 10.1093/biostatistics/kxm024. Epub 2007 Jun 19. Biostatistics. 2008. PMID: 17578898
-
Methods for identifying SNP interactions: a review on variations of Logic Regression, Random Forest and Bayesian logistic regression.IEEE/ACM Trans Comput Biol Bioinform. 2011 Nov-Dec;8(6):1580-91. doi: 10.1109/TCBB.2011.46. IEEE/ACM Trans Comput Biol Bioinform. 2011. PMID: 21383421 Review.
-
Cluster-localized sparse logistic regression for SNP data.Stat Appl Genet Mol Biol. 2012 Aug 14;11(4):/j/sagmb.2012.11.issue-4/1544-6115.1694/1544-6115.1694.xml. doi: 10.1515/1544-6115.1694. Stat Appl Genet Mol Biol. 2012. PMID: 22944714 Review.
Cited by
-
Test for interaction between two unlinked loci.Am J Hum Genet. 2006 Nov;79(5):831-45. doi: 10.1086/508571. Epub 2006 Sep 21. Am J Hum Genet. 2006. PMID: 17033960 Free PMC article.
-
An application of Random Forests to a genome-wide association dataset: methodological considerations & new findings.BMC Genet. 2010 Jun 14;11:49. doi: 10.1186/1471-2156-11-49. BMC Genet. 2010. PMID: 20546594 Free PMC article.
-
Logic Forest: an ensemble classifier for discovering logical combinations of binary markers.Bioinformatics. 2010 Sep 1;26(17):2183-9. doi: 10.1093/bioinformatics/btq354. Epub 2010 Jul 13. Bioinformatics. 2010. PMID: 20628070 Free PMC article.
-
Detecting gene-gene interactions for complex quantitative traits using generalized fuzzy classification.BMC Bioinformatics. 2018 Sep 18;19(1):329. doi: 10.1186/s12859-018-2361-5. BMC Bioinformatics. 2018. PMID: 30227829 Free PMC article.
-
Evaluating the impact of policies recommending PrEP to subpopulations of men and transgender women who have sex with men based on demographic and behavioral risk factors.PLoS One. 2019 Sep 19;14(9):e0222183. doi: 10.1371/journal.pone.0222183. eCollection 2019. PLoS One. 2019. PMID: 31536518 Free PMC article.
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Medical
Research Materials