Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Comparative Study
. 2010 Feb 15;171(4):498-505.
doi: 10.1093/aje/kwp398. Epub 2010 Jan 4.

A comparison of sample size and power in case-only association studies of gene-environment interaction

Affiliations
Comparative Study

A comparison of sample size and power in case-only association studies of gene-environment interaction

Geraldine M Clarke et al. Am J Epidemiol. .

Abstract

Assuming continuous, normally distributed environmental and categorical genotype variables, the authors compare 6 case-only designs for tests of association in gene-environment interaction. Novel tests modeling the environmental variable as either the response or the predictor and allowing a genetic variable with multiallelic variants are included. The authors show that tests imposing the same genotypic pattern of inheritance perform similarly regardless of whether genotype is the response variable or the predictor variable. The novel tests using the genetic variable as the response variable are advantageous because they are robust to non-normally distributed environmental exposures. Dominance deviance-deviation from additivity in the main or interaction effects-is key to test performance: When it is zero or modest, tests searching for a trend with increasing risk alleles are optimal; when it is large, tests for genotypic effects are optimal. However, the authors show that dominance deviance is attenuated when it is observed at a proxy locus, which is common in genome-wide association studies, so large dominance deviance is likely to be rare. The authors conclude that the trend test is the appropriate tool for large-scale association scans where the true gene-environment interaction model is unknown. The common practice of assuming a dominant pattern of inheritance can cause serious losses of power in the presence of any recessive, or modest dominant, effects.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
Statistical power to detect gene-environment interaction for 1,000 case participants as a function of the dominance deviance δ for tests EGT (solid lines), EGD (dashed lines), EGG (dotted lines), and GTE (dashed-dotted lines) at a disease susceptibility locus G for various values of the high-risk allele frequency pA and interaction effect size bEG: A) pA = 0.1, bEG = log(1.1); B) pA = 0.1, bEG = log(1.2); C) pA = 0.1, bEG = log(1.3); D) pA = 0.25, bEG = log(1.1); E) pA = 0.25, bEG = log(1.2); F) pA = 0.25, bEG = log(1.3). Dominance deviance is defined as the deviation from additivity in the main or interaction effects: Here we assume that the dominance deviance in the main effect and the interaction effect are the same. Results are shown for the situation where there is no main environmental effect, bE = 0, and the main genetic effect is bG = log(1.3); variation in the sizes of main environmental or genetic effects resulted in only minor variations in power and not did not alter conclusions regarding the relative efficacies of the tests considered. The environmental exposure variable was normally distributed and standardized.
Figure 2.
Figure 2.
Statistical power to detect gene-environment interaction for 1,000 case participants as a function of the dominance deviance δ for tests EGT (solid lines), EGD (dashed lines), EGG (dotted lines), and GTE (dashed-dotted lines) at a noncausal locus H with minor allele frequency pB = 0.2 for various values of the minor allele frequency pA and interaction effect size bEG at the disease susceptibility locus G: A) pA = 0.1, bEG = log(1.1); B) pA = 0.1, bEG = log(1.2); C) pA = 0.1, bEG = log(1.3); D) pA = 0.25, bEG = log(1.1); E) pA = 0.25, bEG = log(1.2); F) pA = 0.25, bEG = log(1.3). The alleles at H are in maximum linkage disequilibrium (r2) with the alleles at G: when pA = 0.1, r2 = 0.44; when pA = 0.25, r2 = 0.75; and when pA = 0.4, r2 = 0.37. Dominance deviance is defined as the deviation from additivity in the main or interaction effects: Here we assume that the dominance deviance in the main effect and the interaction effect are the same. Results are shown for the situation where there is no main environmental effect, bE = 0, and the main genetic effect is bG = log(1.3); variation in the sizes of main environmental or genetic effects resulted in only minor variations in power and not did not alter conclusions regarding the relative efficacies of the tests considered. The environmental exposure variable was normally distributed and standardized.

Comment in

Similar articles

Cited by

References

    1. Khoury MJ, Flanders WD. Nontraditional epidemiologic approaches in the analysis of gene-environment interaction: case-control studies with no controls! Am J Epidemiol. 1996;144(3):207–213. - PubMed
    1. Piegorsch WW, Weinberg CR, Taylor JA. Non-hierarchical logistic models and case-only designs for assessing susceptibility in population-based case-control studies. Stat Med. 1994;13(2):153–162. - PubMed
    1. Begg CB, Zhang ZF. Statistical analysis of molecular epidemiology studies employing case-series. Cancer Epidemiol Biomarkers Prev. 1994;3(2):173–175. - PubMed
    1. Umbach DM, Weinberg CR. Designing and analysing case-control studies to exploit independence of genotype and exposure. Stat Med. 1997;16(15):1731–1743. - PubMed
    1. Yang Q, Khoury MJ, Flanders WD. Sample size requirements in case-only designs to detect gene-environment interaction. Am J Epidemiol. 1997;146(9):713–720. - PubMed

Publication types