Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2007 Jun 19:4:4.
doi: 10.1186/1742-5573-4-4.

Power for tests of interaction: effect of raising the Type I error rate

Affiliations

Power for tests of interaction: effect of raising the Type I error rate

Stephen W Marshall. Epidemiol Perspect Innov. .

Abstract

Background: Power for assessing interactions during data analysis is often poor in epidemiologic studies. This is because epidemiologic studies are frequently powered primarily to assess main effects only. In light of this, some investigators raise the Type I error rate, thereby increasing power, when testing interactions. However, this is a poor analysis strategy if the study is chronically under-powered (e.g. in a small study) or already adequately powered (e.g. in a very large study). To demonstrate this point, this study quantified the gain in power for testing interactions when the Type I error rate is raised, for a variety of study sizes and types of interaction.

Methods: Power was computed for the Wald test for interaction, the likelihood ratio test for interaction, and the Breslow-Day test for heterogeneity of the odds ratio. Ten types of interaction, ranging from sub-additive through to super-multiplicative, were investigated in the simple scenario of two binary risk factors. Case-control studies of various sizes were investigated (75 cases & 150 controls, 300 cases & 600 controls, and 1200 cases & 2400 controls).

Results: The strategy of raising the Type I error rate from 5% to 20% resulted in a useful power gain (a gain of at least 10%, resulting in power of at least 70%) in only 7 of the 27 interaction type/study size scenarios studied (26%). In the other 20 scenarios, power was either already adequate (n = 8; 30%), or else so low that it was still weak (below 70%) even after raising the Type I error rate to 20% (n = 12; 44%).

Conclusion: Relaxing the Type I error rate did not usefully improve the power for tests of interaction in many of the scenarios studied. In many studies, the small power gains obtained by raising the Type I error will be more than offset by the disadvantage of increased "false positives". I recommend investigators should not routinely raise the Type I error rate when assessing tests of interaction.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Population Odds Ratios for Ten Hypothetical Interaction Scenarios (based on Greenland, 1983).

References

    1. Greenland S, Rothman KJ. Chapter 18: Concepts of Interaction. In: Rothman KJ, Greenland S, editor. Modern Epidemiology. 2. New York NY: Lippincott-Raven; 1998. pp. 329–342.
    1. Breslow NE, Day NE. Statistical Methods in Cancer Research; Volume II: The Design and Analysis of Cohort Studies. Lyon: IARC; 1987. pp. 1–406. (section 7.10). - PubMed
    1. Greenland S. Tests for interaction in epidemiologic studies: a review and a study of power. Stat Med. 1983;2:243–251. doi: 10.1002/sim.4780020219. - DOI - PubMed
    1. Selvin S. Statistical Analysis of Epidemiologic Data. New York: NY: Oxford University Press; 1996. pp. 213–214.
    1. Rothman KJ. A show of confidence. N Engl J Med. 1978;299:1362–1363. - PubMed

LinkOut - more resources