Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2011 Jan;75(1):90-104.
doi: 10.1111/j.1469-1809.2010.00605.x. Epub 2010 Sep 15.

Bayesian analysis of genetic interactions in case-control studies, with application to adiponectin genes and colorectal cancer risk

Affiliations

Bayesian analysis of genetic interactions in case-control studies, with application to adiponectin genes and colorectal cancer risk

Nengjun Yi et al. Ann Hum Genet. 2011 Jan.

Abstract

Complex diseases such as cancers are influenced by interacting networks of genetic and environmental factors. However, a joint analysis of multiple genes and environmental factors is challenging, owing to potentially large numbers of correlated and complex variables. We describe Bayesian generalized linear models for simultaneously analyzing covariates, main effects of numerous loci, gene-gene and gene-environment interactions in population case-control studies. Our Bayesian models use Student-t prior distributions with different shrinkage parameters for different types of effects, allowing reliable estimates of main effects and interactions and hence increasing the power for detection of real signals. We implement a fast and stable algorithm for fitting models by extending available tools for classical generalized linear models to the Bayesian case. We propose a novel method to interpret and visualize models with multiple interactions by computing the average predictive probability. Simulations show that the method has the potential to dissect interacting networks of complex diseases. Application of the method to a large case-control study of adiponectin genes and colorectal cancer risk highlights the previous results and detects new epistatic interactions and sex-specific effects that warrant follow-up in independent studies.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Analysis I: jointly fitting age, sex, all main effects of the ten SNPs and sex-gene interactions with three link functions, logit (left), probit (middle) and cloglog (right). The notation for main effects, a and d, indicate additive and dominance effects, respectively. The term X1.X2 represents interaction between X1 and X2. Estimated effects of age and sex are not displayed. The points, short lines and numbers at the right side represent estimates of effects, ± 2 standard errors, and p-values, respectively. The deviance (Dev) and Akaike information criterion (AIC) under each model are also shown.
Figure 2
Figure 2
Analysis II: jointly fitting age, sex, all main effects of the ten SNPs, sex-gene and epistatic interactions with three link functions, logit (left), probit (middle) and cloglog (right). The notation for main effects, a and d, indicate additive and dominance effects, respectively. The term X1.X2 represents interaction between X1 and X2. Estimated effects of age and sex are not displayed. Only epistatic interactions with p-value smaller than 0.05 are displayed. The points, short lines and numbers at the right side represent estimates of effects, ± 2 standard errors, and p-values, respectively. The deviance (Dev) and Akaike information criterion (AIC) under each model are also shown.
Figure 3
Figure 3
Average predictive probability for a) each genotype, b–c) each combination of sex and genotypes of SNPs rs1273385 and rs266729, and d–f) each two-locus genotype at SNPs that show significant interactions. The genotypes c, h, and r represent common homozygote, heterozygote, and rare homozygote, and the notation M and F represent male and female, respectively. The vertical (a) and horizontal (b–f) dotted gray line represents the mean of probabilities.
Figure 4
Figure 4
Simulation I: jointly fitting age, sex, all main effects of the ten SNPs and sex-gene interactions using the proposed priors. The left panel shows the frequency of each effect estimated with p-value smaller than 0.05 over 1000 replicates with three link functions, logit (circle), probit (square) and cloglog (diamond). The right panel shows the assumed values (circle), the estimated means (point) and the 95% intervals (gray line) with the logit link function. Only effects with non-zero simulated value are labeled. The notation for main effects, a and d, indicate additive and dominance effects, respectively. The term X1.X2 represents interaction between X1 and X2.
Figure 5
Figure 5
Comparison with existing models (I): jointly fitting age, sex, all main effects of the ten SNPs and sex-gene interactions using different priors. Frequency of each effect estimated with p-value smaller than 0.05 over 1000 replicates using 1) uniform prior (νj, sj) = (∞, ∞) (□), 2) normal prior (νj, sj) = (∞, 2.5) (◊), and 3) Jeffreys’ prior (νj, sj) = (0, 0) (∇), for all effects. The points (•) represent the analysis using the proposed priors. Only effects with non-zero simulated value are labeled. The notation for main effects, a and d, indicate additive and dominance effects, respectively. The term X1.X2 represents interaction between X1 and X2.
Figure 6
Figure 6
Simulation II: jointly fitting age, sex, all main effects of the ten SNPs, sex-gene and epistatic interactions using the proposed priors. The left panel shows the frequency of each effect estimated with p-value smaller than 0.05 over 1000 replicates with three link functions, logit (circle), probit (square) and cloglog (diamond). The right panel shows the assumed values (circle), the estimated means (point) and the 95% intervals (gray line) with the logit link function. Only effects with non-zero simulated value are labeled. The notation for main effects, a and d, indicate additive and dominance effects, respectively. The term X1.X2 represents interaction between X1 and X2. Only 15 epistatic interactions with the smallest p-values are displayed.
Figure 7
Figure 7
Comparison with existing models (II): jointly fitting age, sex, all main effects of the ten SNPs, sex-gene and epistatic interactions using different priors. Frequency of each effect estimated with p-value smaller than 0.05 over 1000 replicates using 1) normal prior (νj, sj) = (∞, 2.5) (□), 2) t prior (νj, sj) = (1, 2.5) (◊), 3) t prior (νj, sj) = (1, 0.27) (Δ), and 4) Jeffreys’ prior (νj, sj) = (0, 0) (∇), for all effects. The points (•) represent the analysis using the proposed priors. Only effects with non-zero simulated value are labeled. The notation for main effects, a and d, indicate additive and dominance effects, respectively. The term X1.X2 represents interaction between X1 and X2. Only 15 epistatic interactions with the smallest p-values are displayed.
Figure 8
Figure 8
Receiver operating characteristic (ROC) curves for risk prediction with four models simultaneously fitting: 1) age and sex (gray solid), 2) age, sex, and main effects of SNPs (gray dotted), 3) age, sex, main effects of SNPs, and sex-gene (black solid), and 3) age, sex, main effects of SNPs, sex-gene and epistatic interactions (black dotted). The areas under the ROC curves (AUC) for these four models are 0.79, 0.81, 0.82, and 0.87, respectively.

Similar articles

Cited by

References

    1. Bao K, Mallick BK. Gene selection using a two-level hierarchical Bayesian model. Bioinformatics. 2004;20:3423–3430. - PubMed
    1. Berrington de Gonza´ lez A, Cox D. Interpretation of interaction: A review. Ann Appl Stat. 2007;1:371–385.
    1. Bjørnvold M, Undlien DE, Joner G, Dahl-Jørgensen K, Njøstad PR, et al. Joint effects of HLA, INS, PTPN22 and CTLA4 genes on the risk of type 1 diabetes. Diabetologia. 2008;51:589–596. - PMC - PubMed
    1. Cordell HJ. Detecting gene-gene interactions that underlies human diseases. Nature Review Genetics. 2009;10:392–404. - PMC - PubMed
    1. Cordell HJ. Epistasis: what it means, what it doesn't mean, and statistical methods to detect it in humans. Hum Mol Genet. 2002;11(20):2463–2468. - PubMed

Publication types