Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2011 Jan;75(1):78-89.
doi: 10.1111/j.1469-1809.2010.00604.x. Epub 2010 Sep 8.

Model-based multifactor dimensionality reduction for detecting epistasis in case-control data in the presence of noise

Affiliations

Model-based multifactor dimensionality reduction for detecting epistasis in case-control data in the presence of noise

Tom Cattaert et al. Ann Hum Genet. 2011 Jan.

Abstract

Analyzing the combined effects of genes and/or environmental factors on the development of complex diseases is a great challenge from both the statistical and computational perspective, even using a relatively small number of genetic and nongenetic exposures. Several data-mining methods have been proposed for interaction analysis, among them, the Multifactor Dimensionality Reduction Method (MDR) has proven its utility in a variety of theoretical and practical settings. Model-Based Multifactor Dimensionality Reduction (MB-MDR), a relatively new MDR-based technique that is able to unify the best of both nonparametric and parametric worlds, was developed to address some of the remaining concerns that go along with an MDR analysis. These include the restriction to univariate, dichotomous traits, the absence of flexible ways to adjust for lower order effects and important confounders, and the difficulty in highlighting epistatic effects when too many multilocus genotype cells are pooled into two new genotype groups. We investigate the empirical power of MB-MDR to detect gene-gene interactions in the absence of any noise and in the presence of genotyping error, missing data, phenocopy, and genetic heterogeneity. Power is generally higher for MB-MDR than for MDR, in particular in the presence of genetic heterogeneity, phenocopy, or low minor allele frequencies.

PubMed Disclaimer

Figures

Figure 1
Figure 1. Graphical overview of major MB-MDR steps
Figure 2
Figure 2. Penetrance functions of simulated data of (Ritchie et al., 2003)
Multilocus penetrance functions and MAFs used to simulate case-control data exhibiting gene-gene interactions in the absence of main effects.
Figure 3
Figure 3. MB-MDR and MDR power with different sources of noise, excluding genetic heterogeneity
The 6 plots display MB-MDR power estimates to identify the correct interacting pair for models 1-6, for different p-value cut-offs pc = 0.05,0.1,0.2,0.5 and 1. The color coding is as follows: error-free data (black), data with induced missingness (red), genotyping errors (green) and phenocopy (blue). The line types refer to the different MB-MDR testing strategies used: T = |TH/L| (solid line), max (|TH/LO|,|TL/HO|) (dashed line) and max (|TH/L|,|TH/LO|,|TL/HO|) (dot-dashed line). MDR power estimates of screening over 1-5 order models are also shown (bullets at pc=1).
Figure 4
Figure 4. MB-MDR and MDR power in the presence of genetic heterogeneity
The 6 plots display MB-MDR power estimates for models 1-6, for different p-value cut-offs Pc = 0.05,0.1,0.2,0.5 and 1. The color coding is as follows: power to identify both interacting pairs (black), the first interacting pair (red), and at least one of the interacting pairs (green). The line types refer to the different MB-MDR testing strategies used: T = |TH/L| (solid line), max (|TH/LO|,|TL/HO|) (dashed line) and max (|TH/L|,|TH/LO|,|TL/HO|) (dot-dashed line). MDR power estimates of screening over 1-5 order models are also shown (bullets at Pc=1).

References

    1. Altshuler D, Daly MJ, Lander ES. Genetic mapping in human disease. Science. 2008;322:881–8. - PMC - PubMed
    1. Bellman RE. Adaptive control processes: A guided tour. Princeton: Princeton University Press; 1961.
    1. Calle ML, Urrea V, Vellalta G, Malats N, Van Steen K. Improving strategies for detecting genetic patterns of disease susceptibility in association studies. Stat Med. 2008a;27:6532–46. - PubMed
    1. Calle ML, Urrea V, Vellalta G, Malats N, Van Steen K. Model-Based Multifactor Dimensionality Reduction for detecting interactions in high-dimensional genomic data. U. O. V. Department of Systems Biology (ed) 2008b
    1. Cattaert T, Urrea V, Naj AC, De Lobel L, De Wit V, Fu M, Mahachie John JM, Shen H, Calle ML, Ritchie MD, Edwards TL, Van Steen K. FAM-MDR: a flexible family-base multifactor dimensionality reduction technique to detect epistasis using related individuals. Public Library of Science ONE 2010 - PMC - PubMed

Publication types