Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2010 Sep;66(3):934-48.
doi: 10.1111/j.1541-0420.2009.01357.x.

Case-control studies of gene-environment interaction: Bayesian design and analysis

Affiliations

Case-control studies of gene-environment interaction: Bayesian design and analysis

Bhramar Mukherjee et al. Biometrics. 2010 Sep.

Abstract

With increasing frequency, epidemiologic studies are addressing hypotheses regarding gene-environment interaction. In many well-studied candidate genes and for standard dietary and behavioral epidemiologic exposures, there is often substantial prior information available that may be used to analyze current data as well as for designing a new study. In this article, first, we propose a proper full Bayesian approach for analyzing studies of gene-environment interaction. The Bayesian approach provides a natural way to incorporate uncertainties around the assumption of gene-environment independence, often used in such an analysis. We then consider Bayesian sample size determination criteria for both estimation and hypothesis testing regarding the multiplicative gene-environment interaction parameter. We illustrate our proposed methods using data from a large ongoing case-control study of colorectal cancer investigating the interaction of N-acetyl transferase type 2 (NAT2) with smoking and red meat consumption. We use the existing data to elicit a design prior and show how to use this information in allocating cases and controls in planning a future study that investigates the same interaction parameters. The Bayesian design and analysis strategies are compared with their corresponding frequentist counterparts.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Posterior density corresponding to NAT2*RMGF (top) and NAT2*SMOKE (bottom) interaction log odds-ratio under three different levels of uncertainty around gene-environment independence. In each case, the Dirichlet prior parameter on p1 is kept fixed at α1 = (5, 5, 5, 5) and the prior parameter on p0, namely, β0 is varied at (5,5,5,5), (20,20,20,20) and (80,80,80,80) corresponding to the three posteriors denoted by s0 set at 20, 80 and 320 in the above figure. The arrows on the horizontal axis mark the values of the case-control (CC), the case-only (CO) and the empirical Bayes (EB) estimate. The corresponding numerical results are collected in Table 2.
Figure 2
Figure 2
A graphical display of LPC and ALC criterion, plotted against each candidate n for estimating NAT2*RMGF and NAT2*SMOKE interaction log odds-ratio parameter. Optimal allocation of cases and controls for each candidate n is as determined in Web Table 5 by using equation (5). In each case we consider a 95% HPD credible interval. The Dirichlet prior parameters for each strength are chosen as given in Web Table 2, based on data from MECC study. We chose the threshold values as l = 0.8 and δ = 0.05, where the horizontal lines on each graph are drawn. The n at which the graph first exceeds the horizontal line is the desired optimal sample size. Corresponding numerical results are collected in Table 4.

Similar articles

Cited by

References

    1. Adcock CJ. Sample size determination: a review. The Statistician. 1997;46:261–283.
    1. Albert PS, Ratnasinghe D, Tangrea J, Wacholder S. Limitations of the case-only design for identifying gene-environment interactions. American Journal of Epidemiology. 2001;154:687–93. - PubMed
    1. Barrett JH, Smith G, Waxman R, et al. Investigation of interaction between N-acetyltransferase 2 and heterocyclic amines as potential risk factors for colorectalcancer. Carcinogenesis. 2003;24:275–282. - PubMed
    1. Bernardo JM. Statistical inference as a decision problem: the choice of sample size. Statistician. 1997;46:151–153.
    1. Bernardo JM. Bayeisan Theory. Wiley; Chichester: 1994.

Publication types

Substances