Machine learning methods for developing precision treatment rules with observational data
- PMID: 31233922
- PMCID: PMC7556331
- DOI: 10.1016/j.brat.2019.103412
Machine learning methods for developing precision treatment rules with observational data
Abstract
Clinical trials have identified a variety of predictor variables for use in precision treatment protocols, ranging from clinical biomarkers and symptom profiles to self-report measures of various sorts. Although such variables are informative collectively, none has proven sufficiently powerful to guide optimal treatment selection individually. This has prompted growing interest in the development of composite precision treatment rules (PTRs) that are constructed by combining information across a range of predictors. But this work has been hampered by the generally small samples in randomized clinical trials and the use of suboptimal analysis methods to analyze the resulting data. In this paper, we propose to address the sample size problem by: working with large observational electronic medical record databases rather than controlled clinical trials to develop preliminary PTRs; validating these preliminary PTRs in subsequent pragmatic trials; and using ensemble machine learning methods rather than individual algorithms to carry out statistical analyses to develop the PTRs. The major challenges in this proposed approach are that treatment are not randomly assigned in observational databases and that these databases often lack measures of key prescriptive predictors and mental disorder treatment outcomes. We proposed a tiered case-cohort design approach that uses innovative methods for measuring and balancing baseline covariates and estimating PTRs to address these challenges.
Keywords: Clinical decision support; Ensemble machine learning; Personalized treatment; Precision treatment; Super learner.
Copyright © 2019 Elsevier Ltd. All rights reserved.
Conflict of interest statement
Declarations of interest
In the past 3 years, Dr. Kessler received support for his epidemiological studies from Sanofi-Aventis; was a consultant for Johnson & Johnson Wellness and Prevention, Sage, Shire, Takeda; and served on an advisory board for the Johnson & Johnson Services Inc. Lake Nona Life Project. Kessler is a co-owner of DataStat, Inc., a market research firm that carries out healthcare research. Zubizarreta was a consultant for Johnson & Johnson Real World Data Analytics and Research, New Brunswick, NJ. The remaining authors have no financial disclosures.
References
-
- Abadie A, Chingos MM, & West MR (2018). Endogenous stratification in randomized experiments. The Review of Economics and Statistics, C(4), 567–580.
-
- American Community Survey (ACS) (2019). https://www.census.gov/programs-surveys/acs, Accessed date: 15 May 2019.
-
- Angrist JD, Imbens GW, & Rubin DB (1996). Identification of causal effects using instrumental variables. Journal of the American Statistical Association, 91(434), 444–455.
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources