Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Jul 1;25(3):617-632.
doi: 10.1093/biostatistics/kxad024.

Semi-supervised mixture multi-source exchangeability model for leveraging real-world data in clinical trials

Affiliations

Semi-supervised mixture multi-source exchangeability model for leveraging real-world data in clinical trials

Lillian M F Haine et al. Biostatistics. .

Erratum in

  • Correction.
    [No authors listed] [No authors listed] Biostatistics. 2024 Dec 31;26(1):kxae029. doi: 10.1093/biostatistics/kxae029. Biostatistics. 2024. PMID: 39186534 Free PMC article. No abstract available.

Abstract

The traditional trial paradigm is often criticized as being slow, inefficient, and costly. Statistical approaches that leverage external trial data have emerged to make trials more efficient by augmenting the sample size. However, these approaches assume that external data are from previously conducted trials, leaving a rich source of untapped real-world data (RWD) that cannot yet be effectively leveraged. We propose a semi-supervised mixture (SS-MIX) multisource exchangeability model (MEM); a flexible, two-step Bayesian approach for incorporating RWD into randomized controlled trial analyses. The first step is a SS-MIX model on a modified propensity score and the second step is a MEM. The first step targets a representative subgroup of individuals from the trial population and the second step avoids borrowing when there are substantial differences in outcomes among the trial sample and the representative observational sample. When comparing the proposed approach to competing borrowing approaches in a simulation study, we find that our approach borrows efficiently when the trial and RWD are consistent, while mitigating bias when the trial and external data differ on either measured or unmeasured covariates. We illustrate the proposed approach with an application to a randomized controlled trial investigating intravenous hyperimmune immunoglobulin in hospitalized patients with influenza, while leveraging data from an external observational study to supplement a subgroup analysis by influenza subtype.

Keywords: Bayesian model averaging; Causal inference; Influenza; Propensity scores; Real-world data.

PubMed Disclaimer

Conflict of interest statement

None declared.

Figures

Fig. 1
Fig. 1
Overview of SS-MIX–MEM approach.
Fig. 2
Fig. 2
Simulations figures. Scenario 1: the “ideal scenario” where the RCT and observational populations are identical, there is no residual bias. Scenario 2: we vary the proportion of observational study individuals from the RCT population. Scenario 3: we introduce residual differences between the RCT and the representative subgroup.
Fig. 3
Fig. 3
FLU-IVIG and FLU 003 application results. Subtype A: All methods produce similar null results with 95% confidence/credible intervals (CIs) for the OR including 1. For all borrowing approaches, ESSS values are greater than 0 and 95% CIs are smaller than a reference model indicating that we gain precision from borrowing. Subtype B: All methods have similar results with OR estimates well below 1 and none of the 95% CIs for the OR including 1. The borrowing approaches do not leverage the FLU 003 data illustrated by ESSS values around 0.
Fig. 4
Fig. 4
Borrowing on both arms simulations results. Scenario 1: there is no bias introduced, confounding is introduced via the treated and control arms mixing parameters, which both vary between 0 and 1. Note that the x-axis is the true mixing parameter of the treated individuals in the observational data and the linetype is the true mixing parameter of the control individuals in the observational data. Scenario 2: the mixing parameter for the treated and control arms is either both 0.5 or both 1, residual bias varies between –2 and 2 for the treated arm observational data, the control arm has no residual bias. Note that the x-axis is the log OR between the treated individuals in the observational data and the treated individuals in the trial.

References

    1. Bareinboim E., Pearl J. (2013). A general algorithm for deciding transportability of experimental results. Journal of Causal Inference 1, 107–134.
    1. Boatman J. A., Vock D. M., Koopmeiners J. S. (2021). Borrowing from supplemental sources to estimate causal effects from a primary data source. Statistics in Medicine 40, 5115–5130. - PMC - PubMed
    1. Chen W. C., Wang C., Li H., Lu N., Tiwari R., Xu Y., Yue L. Q. (2020). Propensity score-integrated composite likelihood approach for augmenting the control arm of a randomized controlled trial by incorporating real-world data. Journal of Biopharmaceutical Statistics 30, 508–520. - PubMed
    1. Wang Chenguang (2021). psrwe: PS-integrated methods for incorporating RWE in clinical studies. R package version 2.2. https://github.com/olssol/psrwe.
    1. Chib S. (1996). Calculating posterior distributions and modal estimates in Markov mixture models. Journal of Econometrics 75, 79–97.

MeSH terms