Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2018;45(6):1052-1076.
doi: 10.1080/02664763.2017.1342780. Epub 2017 Jun 28.

Bayesian adaptive bandit-based designs using the Gittins index for multi-armed trials with normally distributed endpoints

Affiliations

Bayesian adaptive bandit-based designs using the Gittins index for multi-armed trials with normally distributed endpoints

Adam L Smith et al. J Appl Stat. 2018.

Abstract

Adaptive designs for multi-armed clinical trials have become increasingly popular recently because of their potential to shorten development times and to increase patient response. However, developing response-adaptive designs that offer patient-benefit while ensuring the resulting trial provides a statistically rigorous and unbiased comparison of the different treatments included is highly challenging. In this paper, the theory of Multi-Armed Bandit Problems is used to define near optimal adaptive designs in the context of a clinical trial with a normally distributed endpoint with known variance. We report the operating characteristics (type I error, power, bias) and patient-benefit of these approaches and alternative designs using simulation studies based on an ongoing trial. These results are then compared to those recently published in the context of Bernoulli endpoints. Many limitations and advantages are similar in both cases but there are also important differences, specially with respect to type I error control. This paper proposes a simulation-based testing procedure to correct for the observed type I error inflation that bandit-based and adaptive rules can induce.

Keywords: Gittins index; Multi-armed bandit; normally distributed endpoint; patient allocation; response adaptive procedures; sequential sampling.

PubMed Disclaimer

Conflict of interest statement

Disclosure statement No potential conflict of interest was reported by the authors.

Figures

Figure 1.
Figure 1.
Gittins Index values (normal reward process, known variance) for various discount factors d.
Figure 2.
Figure 2.
The posterior mean x¯k,t of each treatment arm's outcomes after each patient in a typical GI trial under H0.
Figure 3.
Figure 3.
Histograms of empirical distributions of the test statistic Z in GI trials, implemented under each hypothesis. Also marked is the standard normal distribution which Z should follow in the FR trial (red). The sample mean Z¯, standard deviation SZ and an empirical 95th-percentile C0.05 have been calculated under H0. The empirical 95th-percentile under H0 will correspond to the critical value for hypothesis testing, and is marked by a vertical dotted line on the histograms. (a) GI trial under H0 (b) GI trial under H1
Figure 4.
Figure 4.
E(x¯k(t)μk), the mean (across the trial realisations) of the bias in the estimated outcome of each treatment after a total of t patients have been treated across both arms in the trial, under each scenario (two-arm trial simulations). (a) H0, control arm k=0, (b) H0, experimental arm k=1 (c) H1, control arm k=0 and (d) H1, experimental arm k=1.
Figure 5.
Figure 5.
E(x¯k(t)μk), the mean (across the trial repeats) of the bias in the estimated treatment outcome of each drug under each scenario in the four-arm trial (large sample size). (a) H0, control arm k=0, (b) H0, experimental arm k=3, (c) H1, control arm k=0, (d) H1, experimental arm k=3.
Figure 6.
Figure 6.
Empirical critical values C0.05 for one-tailed testing to maintain 5% FWER in the four-arm trial design, against number T of patients in the trial.
Figure A.1.
Figure A.1.
Histograms of empirical distributions of the test statistic Z0,1 in TS, RBI, RGI, UCB, KLU and CB two-arm trials, implemented under each hypothesis (as in Figure 3). Also marked is the standard normal distribution which Z0,1 should follow in the FR trial (red). For each design, the sample mean Z¯0,1, standard deviationSZ0,1 and an empirical 95th-percentile C0.05 have been calculated under H0. The empirical 95th-percentile under H0 will correspond to the critical value for hypothesis testing, and is marked by a vertical dotted line on the histograms. (a) TS trial under H0, (b) TS trial under H1, (c) RBI trial under H0, (d) RBI trial under H1, (e) RGI trial under H0, (f) RGI trial under H1, (g) UCB trial under H0, (h) UCB trial under H1, (i) KLU trial underH0, (j) KLU trial under H1, (k) CB trial under H0, (l) CB trial under H1.
Figure A.1.
Figure A.1.
Histograms of empirical distributions of the test statistic Z0,1 in TS, RBI, RGI, UCB, KLU and CB two-arm trials, implemented under each hypothesis (as in Figure 3). Also marked is the standard normal distribution which Z0,1 should follow in the FR trial (red). For each design, the sample mean Z¯0,1, standard deviationSZ0,1 and an empirical 95th-percentile C0.05 have been calculated under H0. The empirical 95th-percentile under H0 will correspond to the critical value for hypothesis testing, and is marked by a vertical dotted line on the histograms. (a) TS trial under H0, (b) TS trial under H1, (c) RBI trial under H0, (d) RBI trial under H1, (e) RGI trial under H0, (f) RGI trial under H1, (g) UCB trial under H0, (h) UCB trial under H1, (i) KLU trial underH0, (j) KLU trial under H1, (k) CB trial under H0, (l) CB trial under H1.

References

    1. do Amaral J.F.P., Aspects of optimal sequential resource allocation, D.Phil. thesis, University of Oxford, 1985.
    1. Atkinson A.C. and Biswas A., Randomised Response-Adaptive Designs in Clinical Trials, CRC Press, Boca Raton, FL, 2014.
    1. Auer P., Cesa-Bianchi N., and Fischer P., Finite-time analysis of the multiarmed bandit problem , Mach. Learn. 47 (2002), pp. 235–256. doi: 10.1023/A:1013689704352 - DOI
    1. Bather J., Randomized allocation of treatments in sequential trials , Adv. Appl. Probab. 12 (1980), pp. 174–182. doi: 10.1017/S0001867800033449 - DOI
    1. Berry D.A., [Investigating therapies of potentially great benefit: ECMO]: comment: ethics and ECMO , Stat. Sci. 4 (1989), pp. 306–310. doi: 10.1214/ss/1177012385 - DOI

LinkOut - more resources