Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2014 Aug;11(4):408-417.
doi: 10.1177/1740774514537727. Epub 2014 Jun 12.

Inference about the expected performance of a data-driven dynamic treatment regime

Affiliations

Inference about the expected performance of a data-driven dynamic treatment regime

Bibhas Chakraborty et al. Clin Trials. 2014 Aug.

Abstract

Background: A dynamic treatment regime (DTR) comprises a sequence of decision rules, one per stage of intervention, that recommends how to individualize treatment to patients based on evolving treatment and covariate history. These regimes are useful for managing chronic disorders, and fit into the larger paradigm of personalized medicine. The Value of a DTR is the expected outcome when the DTR is used to assign treatments to a population of interest.

Purpose: The Value of a data-driven DTR, estimated using data from a Sequential Multiple Assignment Randomized Trial, is both a data-dependent parameter and a non-smooth function of the underlying generative distribution. These features introduce additional variability that is not accounted for by standard methods for conducting statistical inference, for example, the bootstrap or normal approximations, if applied without adjustment. Our purpose is to provide a feasible method for constructing valid confidence intervals (CIs) for this quantity of practical interest.

Methods: We propose a conceptually simple and computationally feasible method for constructing valid CIs for the Value of an estimated DTR based on subsampling. The method is self-tuning by virtue of an approach called the double bootstrap. We demonstrate the proposed method using a series of simulated experiments.

Results: The proposed method offers considerable improvement in terms of coverage rates of the CIs over the standard bootstrap approach.

Limitations: In this article, we have restricted our attention to Q-learning for estimating the optimal DTR. However, other methods can be employed for this purpose; to keep the discussion focused, we have not explored these alternatives.

Conclusion: Subsampling-based CIs provide much better performance compared to standard bootstrap for the Value of an estimated DTR.

PubMed Disclaimer

Conflict of interest statement

Conflicts: None claimed.

Figures

Figure 1
Figure 1
Hypothetical SMART design schematic for the smoking cessation example. An “R” within a circle denotes randomization.

References

    1. Lei H, Nahum-Shani I, Lynch K, Oslin D, Murphy SA. A SMART design for building individualized treatment sequences. The Annual Review of Psychology. 2012;8:21–48. - PMC - PubMed
    1. Chakraborty B, Murphy SA, Strecher V. Inference for non-regular parameters in optimal dynamic treatment regimes. Statistical Methods in Medical Research. 2010;19:317–343. - PMC - PubMed
    1. Thall PF, Logothetis C, Pagliaro LC, Wen S, Brown MA, Williams D, et al. Adaptive Therapy for Androgen-Independent Prostate Cancer: A Randomized Selection Trial of Four Regimens. Journal of the National Cancer Institute. 2007;99:1613–1622. - PubMed
    1. Miyahara S, Wahed AS. Weighted Kaplan-Meier estimators for two-stage treatment regimes. Statistics in Medicine. 2010;29:2581–2591. - PubMed
    1. Zhao Y, Zeng D, Socinski MA, Kosorok MR. Reinforcement learning strategies for clinical trials in nonsmall cell lung cancer. Biometrics. 2011;67:1422–1433. - PMC - PubMed