Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2016;111(515):942-947.
doi: 10.1080/01621459.2016.1200914. Epub 2016 Oct 18.

Comment

Affiliations

Comment

Jingxiang Chen et al. J Am Stat Assoc. 2016.

Abstract

Xu, Müller, Wahed, and Thall proposed a Bayesian model to analyze an acute leukemia study involving multi-stage chemotherapy regimes. We discuss two alternative methods, Q-learning and O-learning, to solve the same problem from the machine learning point of view. The numerical studies show that these methods can be flexible and have advantages in some situations to handle treatment heterogeneity while being robust to model misspecification.

Keywords: Dynamic treatment regimes; Multi-stage chemotherapy regimes; O-learning; Q-learning.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Redefinition of the Scheme under the proposed Q-learning Framework. The states in red square boxes (i.e., initialization, resistance and progression) are the treatment decision-making points that are used to split the two stages. Complete remission (C) is not considered as a splitting point since no decision action can be taken. Censoring time could happen at the end of each stage.

References

    1. Bai X, Tsiatis A, Lu W, Song R. Optimal Treatment Regimes for Survival Endpoints Using Locally-Efficient Doubly-Robust Estimator from a Classification Perspective. Statistics in Medicine. 2015 doi: 10.1007/s10985-016-9376-x. Under revision. [946] - DOI - PMC - PubMed
    1. Brookes ST, Whitely E, Egger M, Smith GD, Mulheran PA, Peters TJ. Subgroup Analyses in Randomized Trials: Risks of Subgroup-Specific Analyses; Power and Sample Size for the Interaction Test. Journal of clinical epidemiology. 2004;57:229–236. [946] - PubMed
    1. Fan A, Song R, Lu W. Change-Plane Analysis for Subgroup Detection and Sample Size Calculation. Journal of the American Statistical Association. 2016 doi: 10.1080/01621459.2016.1166115. To appear. [946] - DOI - PMC - PubMed
    1. Goldberg Y, Kosorok MR. Q-Learning with Censored Data. Annals of Statistics. 2012;40:529–560. [946] - PMC - PubMed
    1. Hastie T, Tibshirani R, Friedman J. The Elements of Statistical Learning: Data Mining, Inference, and Prediction. New York: Springer; 2011. [944]

LinkOut - more resources