Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2019 Aug 15;25(16):4993-5001.
doi: 10.1158/1078-0432.CCR-19-0820. Epub 2019 Jun 7.

Design and Evaluation of an External Control Arm Using Prior Clinical Trials and Real-World Data

Affiliations

Design and Evaluation of an External Control Arm Using Prior Clinical Trials and Real-World Data

Steffen Ventz et al. Clin Cancer Res. .

Abstract

Purpose: We discuss designs and interpretable metrics of bias and statistical efficiency of "externally controlled" trials (ECT) and compare ECT performance to randomized and single-arm designs.

Experimental design: We specify an ECT design that leverages information from real-world data (RWD) and prior clinical trials to reduce bias associated with interstudy variations of the enrolled populations. We then used a collection of clinical studies in glioblastoma (GBM) and RWD from patients treated with the current standard of care to evaluate ECTs. Validation is based on a "leave one out" scheme, with iterative selection of a single-arm from one of the studies, for which we estimate treatment effects using the remaining studies as external control. This produces interpretable and robust estimates on ECT bias and type I errors.

Results: We developed a model-free approach to evaluate ECTs based on collections of clinical trials and RWD. For GBM, we verified that inflated false positive error rates of standard single-arm trials can be considerably reduced (up to 30%) by using external control data.

Conclusions: The use of ECT designs in GBM, with adjustments for the clinical profiles of the enrolled patients, should be preferred to single-arm studies with fixed efficacy thresholds extracted from published results on the current standard of care.

PubMed Disclaimer

Figures

Figure 1:
Figure 1:
Bias (πSATπHC)and deviations from a targeted type I error rate of 10%. Bias is due to different patient populations in the single arm trial (SAT) and in the historical study. A single binary characteristic (X=1 or X=0) correlates with the binary outcome Y, and the experimental treatment has no therapeutic effect Pr(Y|X, A = 1) = Pr(Y|X, A = 0). The characteristic X=1 was present in 50% of the patients in the historical control arm, PHC(X = 1) = 0.5. Panel (A) shows the difference (πSATπHC) for a range of probabilities PSAT(X = 1). We consider four levels of association between X and Y; (Pr(Y|X = 1, A = a) and P(Y|X = 0, A = a) equal either to (0.3, 0.9), (0.4, 0.8), (0.5, 0.7) or (0.6, 0.6). Panel (B) indicates, for a SAT (with standard z-test for proportions, H0: πSATπHC) how the false positive rate (y-axis) of the design deviates without adjustments from the targeted type I error rate of 10% when the prevalence PSAT(X = 1) = 0.3, 0.5, 0.6 or 0.8. We consider different sample sizes (x-axis) of the SAT. In panel (B) we assume to know the parameter πHC.
Figure 2:
Figure 2:
Treatment effect estimates of the ECT design. For each studies the RT+TZM arm was used as ECT’s experimental arm and (after adjustment for patents characteristics) compared to the RT+TZM arms of the remaining four studies. Panel (A) shows, for each of the study, covariate adjusted treatment effect estimates (point estimates and 90% confidence interval, n equals to the arm-specific size). Panel (B) shows treatment effect estimates (average value, 5th and 95th percentile) across 10,000 subsamples of n=46 patients using different adjustment methods. We consider direct standardization, matching, inverse-probability weighting and marginal structural models (DSM, PS-M, IPW, MSM). For IPW and MSM we use different reference distributions PrX(x) (see expression 1) of pre-treatment characteristics X. Panel (C) shows the distribution of treatment effect estimates of the ECT (blue line) and RCT (black line) across subsamples of n=46 patients.
Figure 3:
Figure 3:
Model-based evaluation of the type I error and power for RCT, ECT and single-arm trial (SAT) designs for an overall study sample size of n=20, …, 160 patients. In the model-based approach (Supplementary-Material) we sampled baseline characteristics X from the five studies in Table 1, and generated outcomes Y from models Pr(Y|X, A). Panel (A) shows for all studies the type I error rates of RCT, ECT and SAT designs at different overall sample sizes. Different line types (solid, dashed, dotted, etc.) indicate different studies (Table 1). Panels (B-F) show for each study, the power of RCT, SAT and ECT designs, and sample size to achieve 80% power (dotted vertical lines). In panel A, the single arm trial experimental outcomes have been generated as in the ECT simulations, but outcomes Y are directly compared to the EORTC-NCIC CE.3 study estimates, without adjustments for different distributions of patients’ characteristics. For RCTs, half of the randomly selected profiles X are used to define the experimental arm and the remaining half defines the control arm. Two-group (RCT) and single-group (single arm trial) z-tests for proportions were used for testing. To compute the power in Panels B-F of the SAT, we assumed that the historical control benchmark πHC was correctly specified.

References

    1. Marshall G, Blacklock JW, Cameron C, et al. Streptomycin Treatment of Pulmonary Tuberculosis: A Medical Research Council Investigation. Bmj. 1948. doi:10.1136/bmj.2.4582.769 - DOI
    1. Armitage P. Fisher, Bradford Hill, and randomization. Int J Epidemiol. 2003;32(6):925–928. doi:10.1093/ije/dyg286 - DOI - PubMed
    1. Seruga B, Ocana A, Amir E, Tannock IF. Failures in Phase III: Causes and consequences. Clin Cancer Res. 2015;21(20):4552–4560. doi:10.1158/1078-0432.CCR-15-0124 - DOI - PubMed
    1. Gan HK, Grothey A, Pond GR, Moore MJ, Siu LL, Sargent D. Randomized phase II trials: Inevitable or inadvisable? J Clin Oncol. 2010;28(15):2641–2647. doi:10.1200/JCO.2009.26.3343 - DOI - PubMed
    1. Eborall HC, Stewart MCW, Cunningham-Burley S, Price JF, Fowkes FGR. Accrual and drop out in a primary prevention randomised controlled trial: Qualitative study. Trials. 2011;12(1):7. doi:10.1186/1745-6215-12-7 - DOI - PMC - PubMed

Publication types