The performance of different propensity-score methods for estimating differences in proportions (risk differences or absolute risk reductions) in observational studies

Peter C Austin¹

Affiliations

PMID: 20108233
PMCID: PMC3068290
DOI: 10.1002/sim.3854

Free PMC article

Comparative Study

The performance of different propensity-score methods for estimating differences in proportions (risk differences or absolute risk reductions) in observational studies

Peter C Austin. Stat Med. 2010.

Free PMC article

. 2010 Sep 10;29(20):2137-48.

doi: 10.1002/sim.3854.

Author

Peter C Austin¹

Affiliation

¹ Institute for Clinical Evaluative Sciences, Toronto, ON, Canada. peter.austin@ices.on.ca

PMID: 20108233
PMCID: PMC3068290
DOI: 10.1002/sim.3854

Abstract

Propensity score methods are increasingly being used to estimate the effects of treatments on health outcomes using observational data. There are four methods for using the propensity score to estimate treatment effects: covariate adjustment using the propensity score, stratification on the propensity score, propensity-score matching, and inverse probability of treatment weighting (IPTW) using the propensity score. When outcomes are binary, the effect of treatment on the outcome can be described using odds ratios, relative risks, risk differences, or the number needed to treat. Several clinical commentators suggested that risk differences and numbers needed to treat are more meaningful for clinical decision making than are odds ratios or relative risks. However, there is a paucity of information about the relative performance of the different propensity-score methods for estimating risk differences. We conducted a series of Monte Carlo simulations to examine this issue. We examined bias, variance estimation, coverage of confidence intervals, mean-squared error (MSE), and type I error rates. A doubly robust version of IPTW had superior performance compared with the other propensity-score methods. It resulted in unbiased estimation of risk differences, treatment effects with the lowest standard errors, confidence intervals with the correct coverage rates, and correct type I error rates. Stratification, matching on the propensity score, and covariate adjustment using the propensity score resulted in minor to modest bias in estimating risk differences. Estimators based on IPTW had lower MSE compared with other propensity-score methods. Differences between IPTW and propensity-score matching may reflect that these two methods estimate the average treatment effect and the average treatment effect for the treated, respectively.

PubMed Disclaimer

References

1. Schechtman E. Odds ratio, relative risk, absolute risk reduction, and the number needed to treat—which of these should we use? Value in Health. 2002;5:431–436. - PubMed
1. Cook RJ, Sackett DL. The number needed to treat: a clinically useful measure of treatment effect. British Medical Journal. 1995;310:452–454. - PMC - PubMed
1. Jaeschke R, Guyatt G, Shannon H, Walter S, Cook D, Heddle N. Basis statistics for clinicians 3: assessing the effects of treatment: measures of association. Canadian Medical Association Journal. 1995;152:351–357. - PMC - PubMed
1. Sinclair JC, Bracken MB. Clinically useful measures of effect in binary analyses of randomized trials. Journal of Clinical Epidemiology. 1994;47:881–889. - PubMed
1. Available from: http://resources.bmj.com/bmj/authors/types-of-article/research [13 November 2008]

Publication types

Actions
Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Substances

Actions

Grants and funding

MOP 86508/Canadian Institutes of Health Research/Canada

LinkOut - more resources

Full Text Sources

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

The performance of different propensity-score methods for estimating differences in proportions (risk differences or absolute risk reductions) in observational studies

Affiliation

The performance of different propensity-score methods for estimating differences in proportions (risk differences or absolute risk reductions) in observational studies

Author

Affiliation

Abstract

References

Publication types

MeSH terms

Substances

Grants and funding

LinkOut - more resources

Full Text Sources