Randomized Controlled Trial

. 2021 Nov 30;40(27):6150-6163.

doi: 10.1002/sim.9176. Epub 2021 Sep 12.

Informing power and sample size calculations when using inverse probability of treatment weighting using the propensity score

Peter C Austin^{1

2

3}

Affiliations

¹ ICES, Toronto, Ontario, Canada.
² Institute of Health Management, Policy and Evaluation, University of Toronto, Ontario, Canada.
³ Sunnybrook Research Institute, Toronto, Ontario, Canada.

PMID: 34510501
PMCID: PMC9293235
DOI: 10.1002/sim.9176

Randomized Controlled Trial

Informing power and sample size calculations when using inverse probability of treatment weighting using the propensity score

Peter C Austin. Stat Med. 2021.

. 2021 Nov 30;40(27):6150-6163.

doi: 10.1002/sim.9176. Epub 2021 Sep 12.

Author

Peter C Austin^{1

2

3}

Affiliations

¹ ICES, Toronto, Ontario, Canada.
² Institute of Health Management, Policy and Evaluation, University of Toronto, Ontario, Canada.
³ Sunnybrook Research Institute, Toronto, Ontario, Canada.

PMID: 34510501
PMCID: PMC9293235
DOI: 10.1002/sim.9176

Abstract

Propensity score weighting is increasingly being used in observational studies to estimate the effects of treatments. The use of such weights induces a within-person homogeneity in outcomes that must be accounted for when estimating the variance of the estimated treatment effect. Knowledge of the variance inflation factor (VIF), which describes the extent to which the effective sample size has been reduced by weighting, allows for conducting sample size and power calculations for observational studies that use propensity score weighting. However, estimation of the VIF requires knowledge of the weights, which are only known once the study has been conducted. We describe methods to estimate the VIF based on two characteristics of the observational study: the anticipated prevalence of treatment and the anticipated c-statistic of the propensity score model. We considered five different sets of weights: those for estimating the average treatment effect (ATE), the average treated effect in the treated (ATT), and three recently described sets of weights: overlap weights, matching weights, and entropy weights. The VIF was substantially smaller for the latter three sets of weights than for the first two sets of weights. Once the VIF has been estimated during the design phase of the study, sample size and power calculations can be done using calculations appropriate for a randomized controlled trial with similar prevalence of treatment and similar outcome variable, and then multiplying the requisite sample size by the estimated VIF. Implementation of these methods allows for improving the design and reporting of observational studies that use propensity score weighting.

Keywords: inverse probability of treatment weighting; power; propensity score; sample size; study design.

PubMed Disclaimer

Figures

**FIGURE 1**
Distribution of the propensity score in treated and control subjects (primary simulations) [Colour figure can be viewed at wileyonlinelibrary.com]

**FIGURE 2**
Variance inflation factors/design effects for main simulations [Colour figure can be viewed at wileyonlinelibrary.com]

**FIGURE 3**
Comparing estimated and true variance inflation factor/design effect: normal distribution [Colour figure can be viewed at wileyonlinelibrary.com]

**FIGURE 4**
Comparing estimated and true VIF/DE: Beta distribution [Colour figure can be viewed at wileyonlinelibrary.com]

**FIGURE 5**
Comparing estimated and true VIF/DE: Chi‐squared distribution [Colour figure can be viewed at wileyonlinelibrary.com]

**FIGURE 6**
Comparing estimated and true VIF/DE: log‐normal distribution [Colour figure can be viewed at wileyonlinelibrary.com]

See this image and copyright information in PMC

References

1. Dorn HF. Philosophy of inference from retrospective studies. Am J Public Health. 1953;43:692‐699. - PMC - PubMed
1. Rubin DB. Matched Sampling for Causal Effects. New York, NY: Cambridge University Press; 2006.
1. Hernan MA, Robins JM. Using big data to emulate a target trial when a randomized trial is not available. Am J Epidemiol. 2016;183(8):758‐764. - PMC - PubMed
1. Hoenig JM, Heisey DM. The abuse of power: the pervasive fallacy of power calculations for data analysis. Am Stat. 2001;55(1):19‐24.
1. Rosenbaum PR, Rubin DB. The central role of the propensity score in observational studies for causal effects. Biometrika. 1983;70:41‐55.

Publication types

Actions
Actions
Actions

MeSH terms

Actions
Actions
Actions

Grants and funding

PJT 166161/CIHR/Canada

LinkOut - more resources

Full Text Sources
- Europe PubMed Central
- PubMed Central
Medical
- ClinicalTrials.gov
Research Materials
- NCI CPTC Antibody Characterization Program

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Informing power and sample size calculations when using inverse probability of treatment weighting using the propensity score

Affiliations

Informing power and sample size calculations when using inverse probability of treatment weighting using the propensity score

Author

Affiliations

Abstract

Figures

References

Publication types

MeSH terms

Grants and funding

LinkOut - more resources

Full Text Sources

Medical

Research Materials