Multi-centre radiomics for prediction of recurrence following radical radiotherapy for head and neck cancers: Consequences of feature selection, machine learning classifiers and batch-effect harmonization
- PMID: 37260438
- PMCID: PMC10227455
- DOI: 10.1016/j.phro.2023.100450
Multi-centre radiomics for prediction of recurrence following radical radiotherapy for head and neck cancers: Consequences of feature selection, machine learning classifiers and batch-effect harmonization
Abstract
Background and purpose: Radiomics models trained with limited single institution data are often not reproducible and generalisable. We developed radiomics models that predict loco-regional recurrence within two years of radiotherapy with private and public datasets and their combinations, to simulate small and multi-institutional studies and study the responsiveness of the models to feature selection, machine learning algorithms, centre-effect harmonization and increased dataset sizes.
Materials and methods: 562 patients histologically confirmed and treated for locally advanced head-and-neck cancer (LA-HNC) from two public and two private datasets; one private dataset exclusively reserved for validation. Clinical contours of primary tumours were not recontoured and were used for Pyradiomics based feature extraction. ComBat harmonization was applied, and LASSO-Logistic Regression (LR) and Support Vector Machine (SVM) models were built. 95% confidence interval (CI) of 1000 bootstrapped area-under-the-Receiver-operating-curves (AUC) provided predictive performance. Responsiveness of the models' performance to the choice of feature selection methods, ComBat harmonization, machine learning classifier, single and pooled data was evaluated.
Results: LASSO and SelectKBest selected 14 and 16 features, respectively; three were overlapping. Without ComBat, the LR and SVM models for three institutional data showed AUCs (CI) of 0.513 (0.481-0.559) and 0.632 (0.586-0.665), respectively. Performances following ComBat revealed AUCs of 0.559 (0.536-0.590) and 0.662 (0.606-0.690), respectively. Compared to single cohort AUCs (0.562-0.629), SVM models from pooled data performed significantly better at AUC = 0.680.
Conclusions: Multi-institutional retrospective data accentuates the existing variabilities that affect radiomics. Carefully designed prospective, multi-institutional studies and data sharing are necessary for clinically relevant head-and-neck cancer prognostication models.
Keywords: Head-and-neck cancer; Loco-regional recurrence; Machine learning; Multi-institutional; Prognosis; Radiomics.
© 2023 The Authors. Published by Elsevier B.V. on behalf of European Society of Radiotherapy & Oncology.
Conflict of interest statement
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
Figures





Similar articles
-
Impact of harmonization on predicting complications in head and neck cancer after radiotherapy using MRI radiomics and machine learning techniques.Med Phys. 2025 Jun;52(6):5091-5103. doi: 10.1002/mp.17793. Epub 2025 Mar 31. Med Phys. 2025. PMID: 40162683
-
A multi-objective radiomics model for the prediction of locoregional recurrence in head and neck squamous cell cancer.Med Phys. 2020 Oct;47(10):5392-5400. doi: 10.1002/mp.14388. Epub 2020 Aug 5. Med Phys. 2020. PMID: 32657426
-
A comparison of different machine learning classifiers in predicting xerostomia and sticky saliva due to head and neck radiotherapy using a multi-objective, multimodal radiomics model.Biomed Phys Eng Express. 2025 Feb 6;11(2). doi: 10.1088/2057-1976/adafac. Biomed Phys Eng Express. 2025. PMID: 39879644
-
Phenotyping the Histopathological Subtypes of Non-Small-Cell Lung Carcinoma: How Beneficial Is Radiomics?Diagnostics (Basel). 2023 Mar 18;13(6):1167. doi: 10.3390/diagnostics13061167. Diagnostics (Basel). 2023. PMID: 36980475 Free PMC article.
-
Multimodality radiomics prediction of radiotherapy-induced the early proctitis and cystitis in rectal cancer patients: a machine learning study.Biomed Phys Eng Express. 2023 Dec 20;10(1). doi: 10.1088/2057-1976/ad0f3e. Biomed Phys Eng Express. 2023. PMID: 37995359
Cited by
-
The effect of harmonization on the variability of PET radiomic features extracted using various segmentation methods.Ann Nucl Med. 2024 Jul;38(7):493-507. doi: 10.1007/s12149-024-01923-7. Epub 2024 Apr 4. Ann Nucl Med. 2024. PMID: 38575814 Free PMC article.
-
Performance Analysis of Six Semi-Automated Tumour Delineation Methods on [18F] Fluorodeoxyglucose Positron Emission Tomography/Computed Tomography (FDG PET/CT) in Patients with Head and Neck Cancer.Sensors (Basel). 2023 Sep 18;23(18):7952. doi: 10.3390/s23187952. Sensors (Basel). 2023. PMID: 37766009 Free PMC article.
-
Reproducibility in Radiomics: A Comparison of Feature Extraction Methods and Two Independent Datasets.Appl Sci (Basel). 2024 Feb 20;166(1):s00701-024-05977-4. doi: 10.3390/app13127291. Appl Sci (Basel). 2024. PMID: 38725869 Free PMC article.
References
-
- Elhalawani H., Mohamed A.S., Mulder S., Grossberg A., Smith K.E., Gunn G.B., et al. Radiomics prediction of radiation treatment outcomes in oropharyngeal cancer: a clinical and image repository in concert with the cancer imaging archive (TCIA) Int J Radiat Oncol Biol Phys. 2018;102:e215–e216. doi: 10.1016/j.ijrobp.2018.07.748. - DOI
Grants and funding
LinkOut - more resources
Full Text Sources