Machine learning algorithms for outcome prediction in (chemo)radiotherapy: An empirical comparison of classifiers

doi:10.1002/mp.12967

Comparative Study

. 2018 Jul;45(7):3449-3459.

doi: 10.1002/mp.12967. Epub 2018 Jun 13.

Machine learning algorithms for outcome prediction in (chemo)radiotherapy: An empirical comparison of classifiers

Timo M Deist^{1

2}, Frank J W M Dankers^{2

3}, Gilmer Valdes⁴, Robin Wijsman³, I-Chow Hsu⁴, Cary Oberije², Tim Lustberg⁵, Johan van Soest⁵, Frank Hoebers⁵, Arthur Jochems^{1

2}, Issam El Naqa⁶, Leonard Wee⁵, Olivier Morin⁴, David R Raleigh⁴, Wouter Bots^{3

7}, Johannes H Kaanders³, José Belderbos⁸, Margriet Kwint⁸, Timothy Solberg⁴, René Monshouwer³, Johan Bussink³, Andre Dekker⁵, Philippe Lambin¹

Affiliations

¹ The D-lab: Decision Support for Precision Medicine, GROW - School for Oncology and Developmental Biology, Maastricht University Medical Centre+, Universiteitssingel 40, 6229 ER, Maastricht, The Netherlands.
² Department of Radiation Oncology, GROW, School for Oncology and Developmental Biology, Maastricht University Medical Center, Maastricht, The Netherlands.
³ Department of Radiation Oncology, Radboud University Medical Center, Nijmegen, The Netherlands.
⁴ Department of Radiation Oncology, University of California San Francisco, San Francisco, CA, USA.
⁵ Department of Radiation Oncology (MAASTRO), GROW, School for Oncology and Developmental Biology, Maastricht University Medical Center, Maastricht, The Netherlands.
⁶ Department of Radiation Oncology, University of Michigan, Ann Arbor, Michigan, USA.
⁷ Institute for Hyperbaric Oxygen (IvHG), Arnhem, The Netherlands.
⁸ Department of Radiation Oncology, The Netherlands Cancer Institute-Antoni van Leeuwenhoek Hospital, Amsterdam, The Netherlands.

PMID: 29763967
PMCID: PMC6095141
DOI: 10.1002/mp.12967

Comparative Study

Machine learning algorithms for outcome prediction in (chemo)radiotherapy: An empirical comparison of classifiers

Timo M Deist et al. Med Phys. 2018 Jul.

. 2018 Jul;45(7):3449-3459.

doi: 10.1002/mp.12967. Epub 2018 Jun 13.

Authors

Affiliations

¹ The D-lab: Decision Support for Precision Medicine, GROW - School for Oncology and Developmental Biology, Maastricht University Medical Centre+, Universiteitssingel 40, 6229 ER, Maastricht, The Netherlands.
² Department of Radiation Oncology, GROW, School for Oncology and Developmental Biology, Maastricht University Medical Center, Maastricht, The Netherlands.
³ Department of Radiation Oncology, Radboud University Medical Center, Nijmegen, The Netherlands.
⁴ Department of Radiation Oncology, University of California San Francisco, San Francisco, CA, USA.
⁵ Department of Radiation Oncology (MAASTRO), GROW, School for Oncology and Developmental Biology, Maastricht University Medical Center, Maastricht, The Netherlands.
⁶ Department of Radiation Oncology, University of Michigan, Ann Arbor, Michigan, USA.
⁷ Institute for Hyperbaric Oxygen (IvHG), Arnhem, The Netherlands.
⁸ Department of Radiation Oncology, The Netherlands Cancer Institute-Antoni van Leeuwenhoek Hospital, Amsterdam, The Netherlands.

PMID: 29763967
PMCID: PMC6095141
DOI: 10.1002/mp.12967

Erratum in

Erratum: "Machine learning algorithms for outcome prediction in (chemo)radiotherapy: An empirical comparison of classifiers" [Med. Phys. 45 (7), 3449-3459 (2018)].
Deist TM, Dankers FJWM, Valdes G, Wijsman R, Hsu IC, Oberije C, Lustberg T, van Soest J, Hoebers F, Jochems A, El Naqa I, Wee L, Morin O, Raleigh DR, Bots W, Kaanders JH, Belderbos J, Kwint M, Solberg T, Monshouwer R, Bussink J, Dekker A, Lambin P. Deist TM, et al. Med Phys. 2019 Feb;46(2):1080-1087. doi: 10.1002/mp.13329. Epub 2019 Jan 7. Med Phys. 2019. PMID: 30730570 Free PMC article. No abstract available.

Abstract

Purpose: Machine learning classification algorithms (classifiers) for prediction of treatment response are becoming more popular in radiotherapy literature. General Machine learning literature provides evidence in favor of some classifier families (random forest, support vector machine, gradient boosting) in terms of classification performance. The purpose of this study is to compare such classifiers specifically for (chemo)radiotherapy datasets and to estimate their average discriminative performance for radiation treatment outcome prediction.

Methods: We collected 12 datasets (3496 patients) from prior studies on post-(chemo)radiotherapy toxicity, survival, or tumor control with clinical, dosimetric, or blood biomarker features from multiple institutions and for different tumor sites, that is, (non-)small-cell lung cancer, head and neck cancer, and meningioma. Six common classification algorithms with built-in feature selection (decision tree, random forest, neural network, support vector machine, elastic net logistic regression, LogitBoost) were applied on each dataset using the popular open-source R package caret. The R code and documentation for the analysis are available online (https://github.com/timodeist/classifier_selection_code). All classifiers were run on each dataset in a 100-repeated nested fivefold cross-validation with hyperparameter tuning. Performance metrics (AUC, calibration slope and intercept, accuracy, Cohen's kappa, and Brier score) were computed. We ranked classifiers by AUC to determine which classifier is likely to also perform well in future studies. We simulated the benefit for potential investigators to select a certain classifier for a new dataset based on our study (pre-selection based on other datasets) or estimating the best classifier for a dataset (set-specific selection based on information from the new dataset) compared with uninformed classifier selection (random selection).

Results: Random forest (best in 6/12 datasets) and elastic net logistic regression (best in 4/12 datasets) showed the overall best discrimination, but there was no single best classifier across datasets. Both classifiers had a median AUC rank of 2. Preselection and set-specific selection yielded a significant average AUC improvement of 0.02 and 0.02 over random selection with an average AUC rank improvement of 0.42 and 0.66, respectively.

Conclusion: Random forest and elastic net logistic regression yield higher discriminative performance in (chemo)radiotherapy outcome and toxicity prediction than other studied classifiers. Thus, one of these two classifiers should be the first choice for investigators when building classification models or to benchmark one's own modeling results against. Our results also show that an informed preselection of classifiers based on existing datasets can improve discrimination over random selection.

Keywords: classification; machine learning; outcome prediction; predictive modeling; radiotherapy.

PubMed Disclaimer

Conflict of interest statement

Andre Dekker, Johan van Soest, and Tim Lustberg are founders and shareholders of Medical Data Works B.V., which provides consulting on medical data collection and analysis projects. Cary Oberije is CEO of ptTheragnostic B.V. Philippe Lambin is member of the advisory board of ptTheragnostic B.V.

Figures

**Figure 1**
Experimental design: each dataset is split into five stratified outer folds (step 1). For each of the folds, the data are preprocessed (imputation, dummy coding, deleting zero variance features, rescaling) (step 2). The hyperparameters are tuned in the training set via a fivefold inner CV (steps 3–5). Based on the selected hyperparameters, a model is learned on the training set (step 6) and applied on the test set (step 7). Performance metrics are calculated on the test set (step 8) and stored for all outer folds. This process is repeated 100 times for each classifier. Randomization seeds are stable across classifiers within a repetition to allow pairwise comparison. [Color figure can be viewed at wileyonlinelibrary.com]

**Figure 2**
Box and scatterplot of the AUC *rank* (lower being better) per outer fivefold CV aggregated over all datasets and repetitions (12 datasets × 100 repetitions = 1200 data points per classifier). [Color figure can be viewed at wileyonlinelibrary.com]

**Figure 3**
Pairwise comparisons of each classifier pair (12 datasets × 100 repetitions = 1200 comparisons per pair). The numbers in the plot indicate how often classifier A (y‐axis) achieved an AUC greater than classifier B (x‐axis). The color indicates whether the increased AUCs by classifier A are statistically significant (violet), insignificant (light violet), or have not been tested (gray). The significance cutoff was set to the 0.05‐level (one‐sided Wilcoxon signed‐rank test, Holm–Bonferroni correction for 15 tests). [Color figure can be viewed at wileyonlinelibrary.com]

**Figure 4**
The mean AUC for each pair of classifier and dataset (100 repetitions = 100 data points per pair). [Color figure can be viewed at wileyonlinelibrary.com]

**Figure 5**
The mean *rank* derived from the AUC (100 repetitions = 100 data points per pair). [Color figure can be viewed at wileyonlinelibrary.com]

See this image and copyright information in PMC

Cited by

Construction of a diagnostic model for osteoarthritis based on transcriptomic immune-related genes.
Chen B, Lin C, Jin X, Zhang X, Yang K, Wang J, Zhang F, Zhang Y, Ji Y, Meng Z. Chen B, et al. Heliyon. 2023 Dec 13;10(1):e23636. doi: 10.1016/j.heliyon.2023.e23636. eCollection 2024 Jan 15. Heliyon. 2023. PMID: 38187306 Free PMC article.
Machine learning for dose-volume histogram based clinical decision-making support system in radiation therapy plans for brain tumors.
Siciarz P, Alfaifi S, Uytven EV, Rathod S, Koul R, McCurdy B. Siciarz P, et al. Clin Transl Radiat Oncol. 2021 Sep 15;31:50-57. doi: 10.1016/j.ctro.2021.09.001. eCollection 2021 Nov. Clin Transl Radiat Oncol. 2021. PMID: 34632117 Free PMC article.
Gut Microbiota Characteristics of People with Obesity by Meta-Analysis of Existing Datasets.
Gong J, Shen Y, Zhang H, Cao M, Guo M, He J, Zhang B, Xiao C. Gong J, et al. Nutrients. 2022 Jul 21;14(14):2993. doi: 10.3390/nu14142993. Nutrients. 2022. PMID: 35889949 Free PMC article.
Expression patterns and immunological characterization of PANoptosis -related genes in gastric cancer.
Qing X, Jiang J, Yuan C, Xie K, Wang K. Qing X, et al. Front Endocrinol (Lausanne). 2023 Aug 18;14:1222072. doi: 10.3389/fendo.2023.1222072. eCollection 2023. Front Endocrinol (Lausanne). 2023. PMID: 37664853 Free PMC article.
Introduction to machine and deep learning for medical physicists.
Cui S, Tseng HH, Pakela J, Ten Haken RK, El Naqa I. Cui S, et al. Med Phys. 2020 Jun;47(5):e127-e147. doi: 10.1002/mp.14140. Med Phys. 2020. PMID: 32418339 Free PMC article.

See all "Cited by" articles

References

1. Lambin P, van Stiphout RGPM, Starmans MHW, et al. Predicting outcomes in radiation oncology–multifactorial decision support systems. Nat Rev Clin Oncol. 2013;10:27–40. - PMC - PubMed
1. Lambin P, Roelofs E, Reymen B, et al. Rapid learning health care in oncology’ – An approach towards decision support systems enabling customised radiotherapy. Radiother Oncol. 2013;109:159–164. - PubMed
1. Kuhn M, Wing J, Weston S, et al. Caret: Classification and Regression Training; 2016. https://CRAN.R-project.org/package=caret.
1. Fernández‐Delgado M, Cernadas E, Barro S, Amorim D. Do we need hundreds of classifiers to solve real world classification problems? J Mach Learn Res. 2014;15:3133–3181.
1. Wainer J. Comparison of 14 different families of classification algorithms on 115 binary datasets. ArXiv160600930 Cs. June 2016. http://arxiv.org/abs/1606.00930. Accessed April 8, 2017.

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Grants and funding

P01 CA059827/CA/NCI NIH HHS/United States

LinkOut - more resources

Full Text Sources
Other Literature Sources
- The Lens - Patent Citations Database
- scite Smart Citations
Medical
- ClinicalTrials.gov

[1] Lambin P, van Stiphout RGPM, Starmans MHW, et al. Predicting outcomes in radiation oncology–multifactorial decision support systems. Nat Rev Clin Oncol. 2013;10:27–40. - PMC - PubMed

[2] Lambin P, van Stiphout RGPM, Starmans MHW, et al. Predicting outcomes in radiation oncology–multifactorial decision support systems. Nat Rev Clin Oncol. 2013;10:27–40. - PMC - PubMed

[3] Lambin P, Roelofs E, Reymen B, et al. Rapid learning health care in oncology’ – An approach towards decision support systems enabling customised radiotherapy. Radiother Oncol. 2013;109:159–164. - PubMed

[4] Lambin P, Roelofs E, Reymen B, et al. Rapid learning health care in oncology’ – An approach towards decision support systems enabling customised radiotherapy. Radiother Oncol. 2013;109:159–164. - PubMed

[5] Kuhn M, Wing J, Weston S, et al. Caret: Classification and Regression Training; 2016. https://CRAN.R-project.org/package=caret.

[6] Kuhn M, Wing J, Weston S, et al. Caret: Classification and Regression Training; 2016. https://CRAN.R-project.org/package=caret.

[7] Fernández‐Delgado M, Cernadas E, Barro S, Amorim D. Do we need hundreds of classifiers to solve real world classification problems? J Mach Learn Res. 2014;15:3133–3181.

[8] Fernández‐Delgado M, Cernadas E, Barro S, Amorim D. Do we need hundreds of classifiers to solve real world classification problems? J Mach Learn Res. 2014;15:3133–3181.

[9] Wainer J. Comparison of 14 different families of classification algorithms on 115 binary datasets. ArXiv160600930 Cs. June 2016. http://arxiv.org/abs/1606.00930. Accessed April 8, 2017.

[10] Wainer J. Comparison of 14 different families of classification algorithms on 115 binary datasets. ArXiv160600930 Cs. June 2016. http://arxiv.org/abs/1606.00930. Accessed April 8, 2017.

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Machine learning algorithms for outcome prediction in (chemo)radiotherapy: An empirical comparison of classifiers

Affiliations

Machine learning algorithms for outcome prediction in (chemo)radiotherapy: An empirical comparison of classifiers

Authors

Affiliations

Erratum in

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

Grants and funding

LinkOut - more resources

Full Text Sources

Other Literature Sources

Medical

Erratum in

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

Related information

Grants and funding

LinkOut - more resources

Full Text Sources

Other Literature Sources

Medical