Bayesian adaptive bandit-based designs using the Gittins index for multi-armed trials with normally distributed endpoints

Adam L Smith¹, Sofía S Villar²

Affiliations

¹ Department of Pure Mathematics and Mathematical Statistics, University of Cambridge, Cambridge, UK.
² MRC Biostatistics Unit, University of Cambridge, School of Clinical Medicine, Cambridge, UK.

PMID: 29551849
PMCID: PMC5856359
DOI: 10.1080/02664763.2017.1342780

Bayesian adaptive bandit-based designs using the Gittins index for multi-armed trials with normally distributed endpoints

Adam L Smith et al. J Appl Stat. 2018.

. 2018;45(6):1052-1076.

doi: 10.1080/02664763.2017.1342780. Epub 2017 Jun 28.

Authors

Adam L Smith¹, Sofía S Villar²

Affiliations

¹ Department of Pure Mathematics and Mathematical Statistics, University of Cambridge, Cambridge, UK.
² MRC Biostatistics Unit, University of Cambridge, School of Clinical Medicine, Cambridge, UK.

PMID: 29551849
PMCID: PMC5856359
DOI: 10.1080/02664763.2017.1342780

Abstract

Adaptive designs for multi-armed clinical trials have become increasingly popular recently because of their potential to shorten development times and to increase patient response. However, developing response-adaptive designs that offer patient-benefit while ensuring the resulting trial provides a statistically rigorous and unbiased comparison of the different treatments included is highly challenging. In this paper, the theory of Multi-Armed Bandit Problems is used to define near optimal adaptive designs in the context of a clinical trial with a normally distributed endpoint with known variance. We report the operating characteristics (type I error, power, bias) and patient-benefit of these approaches and alternative designs using simulation studies based on an ongoing trial. These results are then compared to those recently published in the context of Bernoulli endpoints. Many limitations and advantages are similar in both cases but there are also important differences, specially with respect to type I error control. This paper proposes a simulation-based testing procedure to correct for the observed type I error inflation that bandit-based and adaptive rules can induce.

Keywords: Gittins index; Multi-armed bandit; normally distributed endpoint; patient allocation; response adaptive procedures; sequential sampling.

PubMed Disclaimer

Conflict of interest statement

Disclosure statement No potential conflict of interest was reported by the authors.

Figures

**Figure 1.**
Gittins Index values (normal reward process, known variance) for various discount factors d.

**Figure 2.**
The posterior mean ${\bar{x}}_{k, t}$ of each treatment arm's outcomes after each patient in a typical GI trial under $H_{0}$ .

**Figure 3.**
Histograms of empirical distributions of the test statistic Z in GI trials, implemented under each hypothesis. Also marked is the standard normal distribution which Z should follow in the FR trial (red). The sample mean $\bar{Z}$ , standard deviation $S_{Z}$ and an empirical $95 th$ -percentile $C_{0.05}$ have been calculated under $H_{0}$ . The empirical $95 th$ -percentile under $H_{0}$ will correspond to the critical value for hypothesis testing, and is marked by a vertical dotted line on the histograms. (a) GI trial under $H_{0}$ (b) GI trial under $H_{1}$

**Figure 4.**
$E ({\bar{x}}_{k}^{(t)} - μ_{k})$ , the mean (across the trial realisations) of the bias in the estimated outcome of each treatment after a total of t patients have been treated across both arms in the trial, under each scenario (two-arm trial simulations). (a) $H_{0}$ , control arm k=0, (b) $H_{0}$ , experimental arm k=1 (c) $H_{1}$ , control arm k=0 and (d) $H_{1}$ , experimental arm k=1.

**Figure 5.**
$E ({\bar{x}}_{k}^{(t)} - μ_{k})$ , the mean (across the trial repeats) of the bias in the estimated treatment outcome of each drug under each scenario in the four-arm trial (large sample size). (a) $H_{0}$ , control arm k=0, (b) $H_{0}$ , experimental arm k=3, (c) $H_{1}$ , control arm k=0, (d) $H_{1}$ , experimental arm k=3.

**Figure 6.**
Empirical critical values $C_{0.05}$ for one-tailed testing to maintain 5% FWER in the four-arm trial design, against number T of patients in the trial.

**Figure A.1.**
Histograms of empirical distributions of the test statistic $Z_{0, 1}$ in TS, RBI, RGI, UCB, KLU and CB two-arm trials, implemented under each hypothesis (as in Figure 3). Also marked is the standard normal distribution which $Z_{0, 1}$ should follow in the FR trial (red). For each design, the sample mean ${\bar{Z}}_{0, 1}$ , standard deviation $S_{Z_{0, 1}}$ and an empirical 95th-percentile $C_{0.05}$ have been calculated under $H_{0}$ . The empirical 95th-percentile under $H_{0}$ will correspond to the critical value for hypothesis testing, and is marked by a vertical dotted line on the histograms. (a) TS trial under $H_{0}$ , (b) TS trial under $H_{1}$ , (c) RBI trial under $H_{0}$ , (d) RBI trial under $H_{1}$ , (e) RGI trial under $H_{0}$ , (f) RGI trial under $H_{1}$ , (g) UCB trial under $H_{0}$ , (h) UCB trial under $H_{1}$ , (i) KLU trial under $H_{0}$ , (j) KLU trial under $H_{1}$ , (k) CB trial under $H_{0}$ , (l) CB trial under $H_{1}$ .

See this image and copyright information in PMC

References

1. do Amaral J.F.P., Aspects of optimal sequential resource allocation, D.Phil. thesis, University of Oxford, 1985.
1. Atkinson A.C. and Biswas A., Randomised Response-Adaptive Designs in Clinical Trials, CRC Press, Boca Raton, FL, 2014.
1. Auer P., Cesa-Bianchi N., and Fischer P., Finite-time analysis of the multiarmed bandit problem , Mach. Learn. 47 (2002), pp. 235–256. doi: 10.1023/A:1013689704352 - DOI
1. Bather J., Randomized allocation of treatments in sequential trials , Adv. Appl. Probab. 12 (1980), pp. 174–182. doi: 10.1017/S0001867800033449 - DOI
1. Berry D.A., [Investigating therapies of potentially great benefit: ECMO]: comment: ethics and ECMO , Stat. Sci. 4 (1989), pp. 306–310. doi: 10.1214/ss/1177012385 - DOI

Grants and funding

MC_UP_1302/2/MRC_/Medical Research Council/United Kingdom

LinkOut - more resources

Full Text Sources
Other Literature Sources
- scite Smart Citations

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Bayesian adaptive bandit-based designs using the Gittins index for multi-armed trials with normally distributed endpoints

Affiliations

Bayesian adaptive bandit-based designs using the Gittins index for multi-armed trials with normally distributed endpoints

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

References

Grants and funding

LinkOut - more resources

Full Text Sources

Other Literature Sources