. 2018 Mar 13;115(11):2571-2577.

doi: 10.1073/pnas.1708282114.

Empirical confidence interval calibration for population-level effect estimation studies in observational healthcare data

Martijn J Schuemie^{1

2}, George Hripcsak^{3

4

5}, Patrick B Ryan^{3

2

4}, David Madigan^{3

6}, Marc A Suchard^{3

7

8

9}

Affiliations

¹ Observational Health Data Sciences and Informatics, New York, NY 10032; schuemie@ohdsi.org.
² Epidemiology Analytics, Janssen Research & Development, Titusville, NJ 08560.
³ Observational Health Data Sciences and Informatics, New York, NY 10032.
⁴ Department of Biomedical Informatics, Columbia University, New York, NY 10032.
⁵ Medical Informatics Services, New York-Presbyterian Hospital, New York, NY 10032.
⁶ Department of Statistics, Columbia University, New York, NY 10027.
⁷ Department of Biomathematics, University of California, Los Angeles, CA 90095.
⁸ Department of Biostatistics, University of California, Los Angeles, CA 90095.
⁹ Department of Human Genetics, University of California, Los Angeles, CA 90095.

PMID: 29531023
PMCID: PMC5856503
DOI: 10.1073/pnas.1708282114

Empirical confidence interval calibration for population-level effect estimation studies in observational healthcare data

Martijn J Schuemie et al. Proc Natl Acad Sci U S A. 2018.

. 2018 Mar 13;115(11):2571-2577.

doi: 10.1073/pnas.1708282114.

Authors

Martijn J Schuemie^{1

2}, George Hripcsak^{3

4

5}, Patrick B Ryan^{3

2

4}, David Madigan^{3

6}, Marc A Suchard^{3

7

8

9}

Affiliations

¹ Observational Health Data Sciences and Informatics, New York, NY 10032; schuemie@ohdsi.org.
² Epidemiology Analytics, Janssen Research & Development, Titusville, NJ 08560.
³ Observational Health Data Sciences and Informatics, New York, NY 10032.
⁴ Department of Biomedical Informatics, Columbia University, New York, NY 10032.
⁵ Medical Informatics Services, New York-Presbyterian Hospital, New York, NY 10032.
⁶ Department of Statistics, Columbia University, New York, NY 10027.
⁷ Department of Biomathematics, University of California, Los Angeles, CA 90095.
⁸ Department of Biostatistics, University of California, Los Angeles, CA 90095.
⁹ Department of Human Genetics, University of California, Los Angeles, CA 90095.

PMID: 29531023
PMCID: PMC5856503
DOI: 10.1073/pnas.1708282114

Abstract

Observational healthcare data, such as electronic health records and administrative claims, offer potential to estimate effects of medical products at scale. Observational studies have often been found to be nonreproducible, however, generating conflicting results even when using the same database to answer the same question. One source of discrepancies is error, both random caused by sampling variability and systematic (for example, because of confounding, selection bias, and measurement error). Only random error is typically quantified but converges to zero as databases become larger, whereas systematic error persists independent from sample size and therefore, increases in relative importance. Negative controls are exposure-outcome pairs, where one believes no causal effect exists; they can be used to detect multiple sources of systematic error, but interpreting their results is not always straightforward. Previously, we have shown that an empirical null distribution can be derived from a sample of negative controls and used to calibrate P values, accounting for both random and systematic error. Here, we extend this work to calibration of confidence intervals (CIs). CIs require positive controls, which we synthesize by modifying negative controls. We show that our CI calibration restores nominal characteristics, such as 95% coverage of the true effect size by the 95% CI. We furthermore show that CI calibration reduces disagreement in replications of two pairs of conflicting observational studies: one related to dabigatran, warfarin, and gastrointestinal bleeding and one related to selective serotonin reuptake inhibitors and upper gastrointestinal bleeding. We recommend CI calibration to improve reproducibility of observational studies.

Keywords: calibration; observational studies; systematic error.

PubMed Disclaimer

Conflict of interest statement

Conflict of interest statement: M.J.S. and P.B.R. are full-time employees and shareholders of Janssen Research & Development.

Figures

**Fig. 1.**
Uncalibrated estimates and corresponding SEs for the negative and positive controls in the four studies. The estimates are stratified by true effect size. The areas above the red dashed lines indicate where the CIs include the true effect size. Note that, because of limitations in sample size, not all negative controls could be used to synthesize positive controls.

**Fig. 2.**
The fraction of controls where the true hazard ratio is above, within, or below the CI for various widths of the CI. The dashed lines indicate the boundaries of a perfectly calibrated and centered estimator.

**Fig. 3.**
Calibrated estimates and corresponding SEs for the negative and positive controls in the four studies. The estimates are stratified by true effect size. The areas above the red dashed lines indicate where the CIs include the true effect size.

**Fig. 4.**
The fraction of controls where the true hazard ratio is above, within, or below the calibrated CI for various widths of the CI. The dashed lines indicate the boundaries of a perfectly calibrated and centered estimator. Fractions were computed using leave-one-out cross-validation.

**Fig. 5.**
Estimates from the original studies and our reproduction of the studies by Southworth et al. (12) and Graham et al. (13) both before and after calibration.

**Fig. 6.**
Estimates from the original studies and our reproduction of the studies by Tata et al. (14) both before and after calibration.

See this image and copyright information in PMC

References

1. Overhage JM, Ryan PB, Schuemie MJ, Stang PE. Desideratum for evidence based epidemiology. Drug Saf. 2013;1(36 Suppl):S5–S14. - PubMed
1. Prasad V, Jena AB. Prespecified falsification end points: Can they validate true observational associations? JAMA. 2013;309:241–242. - PubMed
1. Dusetzina SB, Brookhart MA, Maciejewski ML. Control outcomes and exposures for improving internal validity of nonrandomized studies. Health Serv Res. 2015;50:1432–1451. - PMC - PubMed
1. Arnold BF, Ercumen A. Negative control outcomes: A tool to detect bias in randomized trials. JAMA. 2016;316:2597–2598. - PMC - PubMed
1. Lipsitch M, Tchetgen Tchetgen E, Cohen T. Negative controls: A tool for detecting confounding and bias in observational studies. Epidemiology. 2010;21:383–388. - PMC - PubMed

Publication types

Actions
Actions
Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Grants and funding

R01 LM006910/LM/NLM NIH HHS/United States

LinkOut - more resources

Full Text Sources
Other Literature Sources
- H1 Connect - Access expert opinions and insights on biomedical research.
- scite Smart Citations

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Empirical confidence interval calibration for population-level effect estimation studies in observational healthcare data

Affiliations

Empirical confidence interval calibration for population-level effect estimation studies in observational healthcare data

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

References

Publication types

MeSH terms

Grants and funding

LinkOut - more resources

Full Text Sources

Other Literature Sources