Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 May 18:4:572532.
doi: 10.3389/fdata.2021.572532. eCollection 2021.

Evaluating the Effectiveness of Personalized Medicine With Software

Affiliations

Evaluating the Effectiveness of Personalized Medicine With Software

Adam Kapelner et al. Front Big Data. .

Abstract

We present methodological advances in understanding the effectiveness of personalized medicine models and supply easy-to-use open-source software. Personalized medicine involves the systematic use of individual patient characteristics to determine which treatment option is most likely to result in a better average outcome for the patient. Why is personalized medicine not done more in practice? One of many reasons is because practitioners do not have any easy way to holistically evaluate whether their personalization procedure does better than the standard of care, termed improvement. Our software, "Personalized Treatment Evaluator" (the R package PTE), provides inference for improvement out-of-sample in many clinical scenarios. We also extend current methodology by allowing evaluation of improvement in the case where the endpoint is binary or survival. In the software, the practitioner inputs 1) data from a single-stage randomized trial with one continuous, incidence or survival endpoint and 2) an educated guess of a functional form of a model for the endpoint constructed from domain knowledge. The bootstrap is then employed on data unseen during model fitting to provide confidence intervals for the improvement for the average future patient (assuming future patients are similar to the patients in the trial). One may also test against a null scenario where the hypothesized personalization are not more useful than a standard of care. We demonstrate our method's promise on simulated data as well as on data from a randomized comparative trial investigating two treatments for depression.

Keywords: bootstrap; inference; personalized medicine; randomized comparative trial; statistical software; treatment regimes.

PubMed Disclaimer

Conflict of interest statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Figures

FIGURE 1
FIGURE 1
A graphical illustration of (1) our proposed method for estimation and (2) our proposed method for inference on the population mean improvement of an allocation procedure and (3) our proposed future allocation procedure (top left of the illustration). To compute the best estimate of the improvement I^0, the RCT data goes through the K-fold cross validation procedure of Section 3.4 (depicted in the top center). The black slices of the data frame represent the test data. To draw inference, we employ the non-parametric bootstrap procedure of Section 3.5 by sampling the RCT data with replacement and repeating the K-fold CV to produce I^01,I^02,,I^0B (bottom). The gray slices of the data frame represent the duplicate rows in the original data due to sampling with replacement. The confidence interval and significance of H0:μI00 is computed from the bootstrap distribution (middle center). Finally, the practitioner receives f^ which is built with the complete RCT data (top left).
FIGURE 2
FIGURE 2
Histograms of the bootstrap samples of the out-of-sample improvement measures for d0 random (left column) and d0 best (right column) for the response model of Eq. 11 for different values of n. I^0 is illustrated with a thick black line. The CIμI0,95% computed by the percentile method is illustrated by thin black lines.
FIGURE 3
FIGURE 3
Histograms of the bootstrap samples of the cross-validated improvement measures for d0 random (left column) and d0 best (right column) for the response model of Eq. 12 for different values of n. I^0 is illustrated with a thick black line. The CIμI0,95% computed via the percentile method is illustrated by thin black lines. The true population improvement μI0* given the optimal rule d* is illustrated with a dotted black line.
FIGURE 4
FIGURE 4
Histograms of the bootstrap samples of I˜Rand i.e. for the randomd0 business-as-usual allocation procedure. The thick black line is the best estimate of I^0, the thin black lines are the confidence interval computed via the percentile method. More negative values are “better” as improvement is defined as lowering the HSRD composite score corresponding to a patient being less depressed.

Similar articles

Cited by

References

    1. Agresti A. (2018). An Introduction to Categorical Data Analysis. Hoboken, NJ: John Wiley & Sons.
    1. Bagby R. M., Quilty L. C., Segal Z. V., McBride C. C., Kennedy S. H., Costa P. T. (2008). Personality and Differential Treatment Response in Major Depression: a Randomized Controlled Trial Comparing Cognitive-Behavioural Therapy and Pharmacotherapy. Can. J. Psychiatry 53, 361–370. 10.1177/070674370805300605 - DOI - PMC - PubMed
    1. Barrett J. K., Henderson R., Rosthøj S. (2014). Doubly Robust Estimation of Optimal Dynamic Treatment Regimes. Stat. Biosci. 6, 244–260. 10.1007/s12561-013-9097-6 - DOI - PMC - PubMed
    1. Berger J. O., Wang X., Shen L. (2014). A Bayesian Approach to Subgroup Identification. J. Biopharm. Stat. 24, 110–129. 10.1080/10543406.2013.856026 - DOI - PubMed
    1. Berk R. A., Brown L., Buja A., Zhang K., Zhao L. (2013b). Valid Post-selection Inference. Ann. Stat. 41, 802–837. 10.1214/12-aos1077 - DOI

LinkOut - more resources