. 2023 May 16:26:100450.

doi: 10.1016/j.phro.2023.100450. eCollection 2023 Apr.

Multi-centre radiomics for prediction of recurrence following radical radiotherapy for head and neck cancers: Consequences of feature selection, machine learning classifiers and batch-effect harmonization

Amal Joseph Varghese¹, Varsha Gouthamchand², Balu Krishna Sasidharan¹, Leonard Wee², Sharief K Sidhique¹, Julia Priyadarshini Rao¹, Andre Dekker², Frank Hoebers², Devadhas Devakumar³, Aparna Irodi⁴, Timothy Peace Balasingh¹, Henry Finlay Godson¹, T Joel¹, Manu Mathew¹, Rajesh Gunasingam Isiah¹, Simon Pradeep Pavamani¹, Hannah Mary T Thomas¹

Affiliations

¹ Department of Radiation Oncology, Christian Medical College, Vellore, Tamil Nadu, India.
² Department of Radiation Oncology (Maastro), GROW School for Oncology and Reproduction, Maastricht University Medical Centre+, Maastricht, The Netherlands.
³ Department of Nuclear Medicine, Christian Medical College, Vellore, Tamil Nadu, India.
⁴ Department of Radiology, Christian Medical College, Vellore, Tamil Nadu, India.

PMID: 37260438
PMCID: PMC10227455
DOI: 10.1016/j.phro.2023.100450

Multi-centre radiomics for prediction of recurrence following radical radiotherapy for head and neck cancers: Consequences of feature selection, machine learning classifiers and batch-effect harmonization

Amal Joseph Varghese et al. Phys Imaging Radiat Oncol. 2023.

. 2023 May 16:26:100450.

doi: 10.1016/j.phro.2023.100450. eCollection 2023 Apr.

Authors

Affiliations

¹ Department of Radiation Oncology, Christian Medical College, Vellore, Tamil Nadu, India.
² Department of Radiation Oncology (Maastro), GROW School for Oncology and Reproduction, Maastricht University Medical Centre+, Maastricht, The Netherlands.
³ Department of Nuclear Medicine, Christian Medical College, Vellore, Tamil Nadu, India.
⁴ Department of Radiology, Christian Medical College, Vellore, Tamil Nadu, India.

PMID: 37260438
PMCID: PMC10227455
DOI: 10.1016/j.phro.2023.100450

Abstract

Background and purpose: Radiomics models trained with limited single institution data are often not reproducible and generalisable. We developed radiomics models that predict loco-regional recurrence within two years of radiotherapy with private and public datasets and their combinations, to simulate small and multi-institutional studies and study the responsiveness of the models to feature selection, machine learning algorithms, centre-effect harmonization and increased dataset sizes.

Materials and methods: 562 patients histologically confirmed and treated for locally advanced head-and-neck cancer (LA-HNC) from two public and two private datasets; one private dataset exclusively reserved for validation. Clinical contours of primary tumours were not recontoured and were used for Pyradiomics based feature extraction. ComBat harmonization was applied, and LASSO-Logistic Regression (LR) and Support Vector Machine (SVM) models were built. 95% confidence interval (CI) of 1000 bootstrapped area-under-the-Receiver-operating-curves (AUC) provided predictive performance. Responsiveness of the models' performance to the choice of feature selection methods, ComBat harmonization, machine learning classifier, single and pooled data was evaluated.

Results: LASSO and SelectKBest selected 14 and 16 features, respectively; three were overlapping. Without ComBat, the LR and SVM models for three institutional data showed AUCs (CI) of 0.513 (0.481-0.559) and 0.632 (0.586-0.665), respectively. Performances following ComBat revealed AUCs of 0.559 (0.536-0.590) and 0.662 (0.606-0.690), respectively. Compared to single cohort AUCs (0.562-0.629), SVM models from pooled data performed significantly better at AUC = 0.680.

Conclusions: Multi-institutional retrospective data accentuates the existing variabilities that affect radiomics. Carefully designed prospective, multi-institutional studies and data sharing are necessary for clinically relevant head-and-neck cancer prognostication models.

Keywords: Head-and-neck cancer; Loco-regional recurrence; Machine learning; Multi-institutional; Prognosis; Radiomics.

PubMed Disclaimer

Conflict of interest statement

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Figures

**Fig. 1**
Features selected by LASSO for the single and multi-institutional pooled datasets.

**Fig. 2**
Features selected by SelectKBest for the single and multi-institutional pooled datasets.

**Fig. 3**
The Kernel Density Estimate (KDE) (A and C) and Box (B and D) plots for one representative feature (GLCM Maximum Probability) before (A and B) and after (C and D) ComBat harmonization.

**Fig. 4**
Performance of the LR and SVM models trained on pooled datasets prior to and post ComBat harmonization. Model performance is reported on validation data HN3-MAASTRO.

**Fig. 5**
Performance of the models trained with data from single institution versus multi-institutional pooled data. Validation ROC of Logistic Regression (A) and SVM LRR models (B) for an example single dataset (HN-CMC) and its pooled dataset combinations. The ROCs correspond to the HN-CMC (red), HN-CMC + HN1-MAASTRO (blue), HN-CMC + HN-MONTREAL (green) and HN-CMC + HN1-MAASTRO + HN-MONTREAL datasets (orange), respectively. C) Test AUC across all single and pooled datasets in this experiment. Validation data was HN3-MAASTRO. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)

See this image and copyright information in PMC

Cited by

The effect of harmonization on the variability of PET radiomic features extracted using various segmentation methods.
Hosseini SA, Shiri I, Ghaffarian P, Hajianfar G, Avval AH, Seyfi M, Servaes S, Rosa-Neto P, Zaidi H, Ay MR. Hosseini SA, et al. Ann Nucl Med. 2024 Jul;38(7):493-507. doi: 10.1007/s12149-024-01923-7. Epub 2024 Apr 4. Ann Nucl Med. 2024. PMID: 38575814 Free PMC article.
Performance Analysis of Six Semi-Automated Tumour Delineation Methods on [¹⁸F] Fluorodeoxyglucose Positron Emission Tomography/Computed Tomography (FDG PET/CT) in Patients with Head and Neck Cancer.
Bianconi F, Salis R, Fravolini ML, Khan MU, Minestrini M, Filippi L, Marongiu A, Nuvoli S, Spanu A, Palumbo B. Bianconi F, et al. Sensors (Basel). 2023 Sep 18;23(18):7952. doi: 10.3390/s23187952. Sensors (Basel). 2023. PMID: 37766009 Free PMC article.
Reproducibility in Radiomics: A Comparison of Feature Extraction Methods and Two Independent Datasets.
Thomas HMT, Wang HYC, Varghese AJ, Donovan EM, South CP, Saxby H, Nisbet A, Prakash V, Sasidharan BK, Pavamani SP, Devadhas D, Mathew M, Isiah RG, Evans PM. Thomas HMT, et al. Appl Sci (Basel). 2024 Feb 20;166(1):s00701-024-05977-4. doi: 10.3390/app13127291. Appl Sci (Basel). 2024. PMID: 38725869 Free PMC article.

References

1. Chang J.H., Wu Y., Wu A.T.H. Locoregionally recurrent head and neck squamous cell carcinoma: incidence, survival, prognostic factors, and treatment outcomes. Oncotarget. 2017;8 doi: 10.18632/oncotarget.16340. 55600–12. - DOI - PMC - PubMed
1. Alsahafi E., Begg K., Amelio I., Raulf N., Lucarelli P., Sauter T., et al. Clinical update on head and neck cancer: molecular biology and ongoing challenges. Cell Death Dis. 2019;10:1–17. doi: 10.1038/s41419-019-1769-9. - DOI - PMC - PubMed
1. Massa S.T., Osazuwa-Peters N., Christopher K.M., Arnold L.D., Schootman M., Walker R.J., et al. Competing causes of death in the head and neck cancer population. Oral Oncol. 2017;65:8–15. doi: 10.1016/j.oraloncology.2016.12.006. - DOI - PubMed
1. Elhalawani H., Mohamed A.S., Mulder S., Grossberg A., Smith K.E., Gunn G.B., et al. Radiomics prediction of radiation treatment outcomes in oropharyngeal cancer: a clinical and image repository in concert with the cancer imaging archive (TCIA) Int J Radiat Oncol Biol Phys. 2018;102:e215–e216. doi: 10.1016/j.ijrobp.2018.07.748. - DOI
1. Kalendralis P., Shi Z., Traverso A., Choudhury A., Sloep M., Zhovannik I., et al. FAIR-compliant clinical, radiomics and DICOM metadata of RIDER, interobserver, Lung1 and head-Neck1 TCIA collections. Med Phys. 2020;47:5931–5940. doi: 10.1002/mp.14322. - DOI - PMC - PubMed

Grants and funding

LinkOut - more resources

Full Text Sources

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Multi-centre radiomics for prediction of recurrence following radical radiotherapy for head and neck cancers: Consequences of feature selection, machine learning classifiers and batch-effect harmonization

Affiliations

Multi-centre radiomics for prediction of recurrence following radical radiotherapy for head and neck cancers: Consequences of feature selection, machine learning classifiers and batch-effect harmonization

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

Grants and funding

LinkOut - more resources

Full Text Sources

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

Related information

Grants and funding

LinkOut - more resources

Full Text Sources