Estimation of optimal treatment regimes with electronic medical record data using the residual life value estimator
- PMID: 38332633
- PMCID: PMC11471959
- DOI: 10.1093/biostatistics/kxae002
Estimation of optimal treatment regimes with electronic medical record data using the residual life value estimator
Erratum in
-
Correction.Biostatistics. 2024 Dec 31;26(1):kxae029. doi: 10.1093/biostatistics/kxae029. Biostatistics. 2024. PMID: 39186534 Free PMC article. No abstract available.
Abstract
Clinicians and patients must make treatment decisions at a series of key decision points throughout disease progression. A dynamic treatment regime is a set of sequential decision rules that return treatment decisions based on accumulating patient information, like that commonly found in electronic medical record (EMR) data. When applied to a patient population, an optimal treatment regime leads to the most favorable outcome on average. Identifying optimal treatment regimes that maximize residual life is especially desirable for patients with life-threatening diseases such as sepsis, a complex medical condition that involves severe infections with organ dysfunction. We introduce the residual life value estimator (ReLiVE), an estimator for the expected value of cumulative restricted residual life under a fixed treatment regime. Building on ReLiVE, we present a method for estimating an optimal treatment regime that maximizes expected cumulative restricted residual life. Our proposed method, ReLiVE-Q, conducts estimation via the backward induction algorithm Q-learning. We illustrate the utility of ReLiVE-Q in simulation studies, and we apply ReLiVE-Q to estimate an optimal treatment regime for septic patients in the intensive care unit using EMR data from the Multiparameter Intelligent Monitoring Intensive Care database. Ultimately, we demonstrate that ReLiVE-Q leverages accumulating patient information to estimate personalized treatment regimes that optimize a clinically meaningful function of residual life.
Keywords: MIMIC-III; Q-learning; context vector; dynamic treatment regime; electronic medical record; precision medicine; random forest; residual life; sepsis.
© The Author(s) 2024. Published by Oxford University Press. All rights reserved. [br]For permissions, please e-mail: journals.permissions@oup.com.
Conflict of interest statement
None declared.
Figures
References
-
- Bellman R. (1957). Dynamic programming. Princeton, NJ: Princeton University Press.
-
- Breiman L. (2001). Random forests. Mach Learn 45, 5–32.
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
