. 2009 Nov 10;28(25):3083-107.

doi: 10.1002/sim.3697.

Balance diagnostics for comparing the distribution of baseline covariates between treatment groups in propensity-score matched samples

Peter C Austin¹

Affiliations

PMID: 19757444
PMCID: PMC3472075
DOI: 10.1002/sim.3697

Free PMC article

Balance diagnostics for comparing the distribution of baseline covariates between treatment groups in propensity-score matched samples

Peter C Austin. Stat Med. 2009.

Free PMC article

. 2009 Nov 10;28(25):3083-107.

doi: 10.1002/sim.3697.

Author

Peter C Austin¹

Affiliation

¹ Institute for Clinical Evaluative Sciences, G1 06, 2075 Bayview Avenue, Toronto, Ontario, Canada M4N 3M5. peter.austin@ices.on.ca

PMID: 19757444
PMCID: PMC3472075
DOI: 10.1002/sim.3697

Abstract

The propensity score is a subject's probability of treatment, conditional on observed baseline covariates. Conditional on the true propensity score, treated and untreated subjects have similar distributions of observed baseline covariates. Propensity-score matching is a popular method of using the propensity score in the medical literature. Using this approach, matched sets of treated and untreated subjects with similar values of the propensity score are formed. Inferences about treatment effect made using propensity-score matching are valid only if, in the matched sample, treated and untreated subjects have similar distributions of measured baseline covariates. In this paper we discuss the following methods for assessing whether the propensity score model has been correctly specified: comparing means and prevalences of baseline characteristics using standardized differences; ratios comparing the variance of continuous covariates between treated and untreated subjects; comparison of higher order moments and interactions; five-number summaries; and graphical methods such as quantile-quantile plots, side-by-side boxplots, and non-parametric density plots for comparing the distribution of baseline covariates between treatment groups. We describe methods to determine the sampling distribution of the standardized difference when the true standardized difference is equal to zero, thereby allowing one to determine the range of standardized differences that are plausible with the propensity score model having been correctly specified. We highlight the limitations of some previously used methods for assessing the adequacy of the specification of the propensity-score model. In particular, methods based on comparing the distribution of the estimated propensity score between treated and untreated subjects are uninformative.

PubMed Disclaimer

Figures

**Figure 1**
Absolute standardized differences for baseline covariates comparing treated to untreated subjects in the original and the matched sample.

**Figure 2**
Relationship between sample size and the standard deviation of empirical sampling distribution of standardized difference.

**Figure 3**
Side-by-side boxplots and *Q–Q* plots for age.

**Figure 4**
Density plots and cumulative distribution functions for age.

**Figure 5**
Distribution of estimated propensity score in treated and untreated subjects in different matched samples.

See this image and copyright information in PMC

References

1. Rosenbaum PR, Rubin DB. The central role of the propensity score in observational studies for causal effects. Biometrika. 1983;70:41–55.
1. Rosenbaum PR, Rubin DB. Reducing bias in observational studies using subclassification on the propensity score. Journal of the American Statistical Association. 1984;79:516–524.
1. Austin PC, Mamdani MM. A comparison of propensity score methods: a case-study estimating the effectiveness of post-AMI statin use. Statistics in Medicine. 2006;25:2084–2106. - PubMed
1. Weitzen S, Lapane KL, Toledano AY, Hume AL, Mor V. Principles for modeling propensity scores in medical research: a systematic literature review. Pharmacoepidemiology and Drug Safety. 2004;13:841–853. - PubMed
1. Shah BR, Laupacis A, Hux JE, Austin PC. Propensity score methods give similar results to traditional regression modeling in observational studies: a systematic review. Journal of Clinical Epidemiology. 2005;58:550–559. - PubMed

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Substances

Actions

LinkOut - more resources

Full Text Sources
Other Literature Sources
- The Lens - Patent Citations Database
Medical
- ClinicalTrials.gov

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Balance diagnostics for comparing the distribution of baseline covariates between treatment groups in propensity-score matched samples

Affiliation

Balance diagnostics for comparing the distribution of baseline covariates between treatment groups in propensity-score matched samples

Author

Affiliation

Abstract

Figures

References

Publication types

MeSH terms

Substances

LinkOut - more resources

Full Text Sources

Other Literature Sources

Medical