Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Randomized Controlled Trial
. 2021 Nov 30;40(27):6150-6163.
doi: 10.1002/sim.9176. Epub 2021 Sep 12.

Informing power and sample size calculations when using inverse probability of treatment weighting using the propensity score

Affiliations
Randomized Controlled Trial

Informing power and sample size calculations when using inverse probability of treatment weighting using the propensity score

Peter C Austin. Stat Med. .

Abstract

Propensity score weighting is increasingly being used in observational studies to estimate the effects of treatments. The use of such weights induces a within-person homogeneity in outcomes that must be accounted for when estimating the variance of the estimated treatment effect. Knowledge of the variance inflation factor (VIF), which describes the extent to which the effective sample size has been reduced by weighting, allows for conducting sample size and power calculations for observational studies that use propensity score weighting. However, estimation of the VIF requires knowledge of the weights, which are only known once the study has been conducted. We describe methods to estimate the VIF based on two characteristics of the observational study: the anticipated prevalence of treatment and the anticipated c-statistic of the propensity score model. We considered five different sets of weights: those for estimating the average treatment effect (ATE), the average treated effect in the treated (ATT), and three recently described sets of weights: overlap weights, matching weights, and entropy weights. The VIF was substantially smaller for the latter three sets of weights than for the first two sets of weights. Once the VIF has been estimated during the design phase of the study, sample size and power calculations can be done using calculations appropriate for a randomized controlled trial with similar prevalence of treatment and similar outcome variable, and then multiplying the requisite sample size by the estimated VIF. Implementation of these methods allows for improving the design and reporting of observational studies that use propensity score weighting.

Keywords: inverse probability of treatment weighting; power; propensity score; sample size; study design.

PubMed Disclaimer

Figures

FIGURE 1
FIGURE 1
Distribution of the propensity score in treated and control subjects (primary simulations) [Colour figure can be viewed at wileyonlinelibrary.com]
FIGURE 2
FIGURE 2
Variance inflation factors/design effects for main simulations [Colour figure can be viewed at wileyonlinelibrary.com]
FIGURE 3
FIGURE 3
Comparing estimated and true variance inflation factor/design effect: normal distribution [Colour figure can be viewed at wileyonlinelibrary.com]
FIGURE 4
FIGURE 4
Comparing estimated and true VIF/DE: Beta distribution [Colour figure can be viewed at wileyonlinelibrary.com]
FIGURE 5
FIGURE 5
Comparing estimated and true VIF/DE: Chi‐squared distribution [Colour figure can be viewed at wileyonlinelibrary.com]
FIGURE 6
FIGURE 6
Comparing estimated and true VIF/DE: log‐normal distribution [Colour figure can be viewed at wileyonlinelibrary.com]

References

    1. Dorn HF. Philosophy of inference from retrospective studies. Am J Public Health. 1953;43:692‐699. - PMC - PubMed
    1. Rubin DB. Matched Sampling for Causal Effects. New York, NY: Cambridge University Press; 2006.
    1. Hernan MA, Robins JM. Using big data to emulate a target trial when a randomized trial is not available. Am J Epidemiol. 2016;183(8):758‐764. - PMC - PubMed
    1. Hoenig JM, Heisey DM. The abuse of power: the pervasive fallacy of power calculations for data analysis. Am Stat. 2001;55(1):19‐24.
    1. Rosenbaum PR, Rubin DB. The central role of the propensity score in observational studies for causal effects. Biometrika. 1983;70:41‐55.

Publication types

Grants and funding