Comparative Study

. 2019 Oct:114:72-83.

doi: 10.1016/j.jclinepi.2019.05.029. Epub 2019 Jun 10.

Models with interactions overestimated heterogeneity of treatment effects and were prone to treatment mistargeting

David van Klaveren¹, Theodor A Balan², Ewout W Steyerberg³, David M Kent⁴

Affiliations

¹ Department of Public Health, Erasmus University Medical Center, Rotterdam, the Netherlands; Predictive Analytics and Comparative Effectiveness Center, Institute for Clinical Research and Health Policy Studies, Tufts Medical Center, Boston, MA, USA. Electronic address: d.vanklaveren@erasmusmc.nl.
² Department of Biomedical Data Sciences, Leiden University Medical Center, Leiden, the Netherlands.
³ Department of Public Health, Erasmus University Medical Center, Rotterdam, the Netherlands; Department of Biomedical Data Sciences, Leiden University Medical Center, Leiden, the Netherlands.
⁴ Predictive Analytics and Comparative Effectiveness Center, Institute for Clinical Research and Health Policy Studies, Tufts Medical Center, Boston, MA, USA.

PMID: 31195109
PMCID: PMC7497896
DOI: 10.1016/j.jclinepi.2019.05.029

Comparative Study

Models with interactions overestimated heterogeneity of treatment effects and were prone to treatment mistargeting

David van Klaveren et al. J Clin Epidemiol. 2019 Oct.

. 2019 Oct:114:72-83.

doi: 10.1016/j.jclinepi.2019.05.029. Epub 2019 Jun 10.

Authors

David van Klaveren¹, Theodor A Balan², Ewout W Steyerberg³, David M Kent⁴

Affiliations

¹ Department of Public Health, Erasmus University Medical Center, Rotterdam, the Netherlands; Predictive Analytics and Comparative Effectiveness Center, Institute for Clinical Research and Health Policy Studies, Tufts Medical Center, Boston, MA, USA. Electronic address: d.vanklaveren@erasmusmc.nl.
² Department of Biomedical Data Sciences, Leiden University Medical Center, Leiden, the Netherlands.
³ Department of Public Health, Erasmus University Medical Center, Rotterdam, the Netherlands; Department of Biomedical Data Sciences, Leiden University Medical Center, Leiden, the Netherlands.
⁴ Predictive Analytics and Comparative Effectiveness Center, Institute for Clinical Research and Health Policy Studies, Tufts Medical Center, Boston, MA, USA.

PMID: 31195109
PMCID: PMC7497896
DOI: 10.1016/j.jclinepi.2019.05.029

Abstract

Objectives: We aimed to compare the performance of different regression modeling approaches for the prediction of heterogeneous treatment effects.

Study design and setting: We simulated trial samples (n = 3,600; 80% power for a treatment odds ratio of 0.8) from a superpopulation (N = 1,000,000) with 12 binary risk predictors, both without and with six true treatment interactions. We assessed predictions of treatment benefit for four regression models: a "risk model" (with a constant effect of treatment assignment) and three "effect models" (including interactions of risk predictors with treatment assignment). Three novel performance measures were evaluated: calibration for benefit (i.e., observed vs. predicted risk difference in treated vs. untreated), discrimination for benefit, and prediction error for benefit.

Results: The risk modeling approach was well-calibrated for benefit, whereas effect models were consistently overfit, even with doubled sample sizes. Penalized regression reduced miscalibration of the effect models considerably. In terms of discrimination and prediction error, the risk modeling approach was superior in the absence of true treatment effect interactions, whereas penalized regression was optimal in the presence of true treatment interactions.

Conclusion: A risk modeling approach yields models consistently well calibrated for benefit. Effect modeling may improve discrimination for benefit in the presence of true interactions but is prone to overfitting. Hence, effect models-including only plausible interactions-should be fitted using penalized regression.

Keywords: Heterogeneity of treatment effect; Penalized regression analysis; Personalized medicine; Prediction models; Regression analysis; Treatment benefit.

PubMed Disclaimer

Figures

**Figure 1. Risk modeling on only the control patients led to an exaggeration of the treatment effect heterogeneity across predicted risk quartiles**
In the base case simulation scenario without true interaction, the risk model was fitted in either the whole sample or in the control patients of the sample. When the model fitted on the whole sample was used for stratification in risk quartiles (panel A), the observed benefit in the sample (brown bars) is an unbiased estimate of the observed benefit in the population (white bars). In contrast, when the model fitted on the control patients was used for stratification in risk quartiles (panel B), the observed benefit in the sample is too heterogeneous across risk quartiles compared to the observed benefit in the population.

**Figure 2. Effect modeling approaches were seriously overfit and were prone to treatment mistargeting in the absence of true treatment interaction**
Benefit predictions were based on different models fitted in the samples: a model without treatment interactions (panel A), a model with all treatment interactions (panel B), a model with all treatment interactions using backward selection based on AIC (panel C), and a model with all treatment interactions fitted with Lasso regression (panel D). The agreement between predicted (brown bars) and observed (white bars) benefit in predicted benefit quartiles of the population was better for the risk modeling approach (A) compared to the effect modeling approaches (B-D). Moreover, the risk modeling approach (A) resulted in more heterogeneity of observed benefit, i.e. is better able to distinguish between patients with low and patients with high benefit.

Figure 3. Effect modeling approaches were better able to separate patients with differential benefit in the presence of true treatment interaction, but required penalized regression to prevent overfitting
Benefit predictions were based on different models fitted in the samples: a model without treatment interactions (panel A), a model with all treatment interactions (panel B), a model with all treatment interactions using backward selection based on AIC (panel C), and a model with all treatment interactions fitted with Lasso regression (panel D). The agreement between predicted benefit (brown bars) and observed benefit (white bars) in predicted benefit quartiles of the population is better for both the risk modeling approach (A) and the Lasso regression approach (D) compared to the unpenalized effect modeling approaches (B-C). However, the Lasso regression approach (D) resulted in more heterogeneity of observed benefit than the risk modeling approach (A), i.e. is better able to distinguish between patients with low and patients with high benefit.

Figure 4. The risk modeling approach (Constant relative treatment effect) discriminated best in the absence of true interactions, but left considerable treatment benefit heterogeneity undetected in the presence of true interactions
Extreme quartile difference (EQD) represents the difference between the observed benefit in the fourth quartile and the observed benefit in the first quartile of the population in base case simulation scenarios without true interactions (panel A) and with true interactions (panel B). The maximum achievable EQD of the true model is represented by the dashed horizontal lines. In the presence of true interactions (B), penalized regression approaches (Standard Lasso and Standard Ridge) discriminated similarly to unpenalized regression (All interactions).

Figure 5. The risk modeling approach (Constant relative treatment effect) had minimal prediction error in the absence of true interactions, but was outperformed by penalized regression approaches (Standard Lasso or Ridge) in the presence of true interactions
The root mean squared error (rMSE) represents the root of the mean of the square differences between predicted benefit and true benefit in the population for base case simulation scenarios without true interaction (panel A), and with true interaction (panel B).

See this image and copyright information in PMC

References

1. Varadhan R, Segal JB, Boyd CM, Wu AW, Weiss CO. A framework for the analysis of heterogeneity of treatment effect in patient-centered outcomes research. J Clin Epidemiol. 2013;66:818–25. - PMC - PubMed
1. Kent DM, Steyerberg EW, van Klaveren D. Personalized Evidence Based Medicine: Predictive Approaches to Heterogeneous Treatment Effects. BMJ. 2018. In Press. - PMC - PubMed
1. Kent DM, Hayward RA. Limitations of applying summary results of clinical trials to individual patients: the need for risk stratification. JAMA. 2007;298:1209–12. - PubMed
1. Kent DM, Rothwell PM, Ioannidis JP, Altman DG, Hayward RA. Assessing and reporting heterogeneity in treatment effects in clinical trials: a proposal. Trials. 2010;11:85. - PMC - PubMed
1. Abadie A, Chingos MM, West MR. Endogenous Stratification in Randomized Experiments. National Bureau of Economic Research Working Paper Series. 2013;No. 19742.

Publication types

Actions
Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Grants and funding

LinkOut - more resources

Full Text Sources

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Models with interactions overestimated heterogeneity of treatment effects and were prone to treatment mistargeting

Affiliations

Models with interactions overestimated heterogeneity of treatment effects and were prone to treatment mistargeting

Authors

Affiliations

Abstract

Figures

References

Publication types

MeSH terms

Grants and funding

LinkOut - more resources

Full Text Sources