Post-processing steps improve generalisability and robustness of an MRI-based radiogenomic model for human papillomavirus status prediction in oropharyngeal cancer
- PMID: 40478348
- PMCID: PMC12634727
- DOI: 10.1007/s00330-025-11709-8
Post-processing steps improve generalisability and robustness of an MRI-based radiogenomic model for human papillomavirus status prediction in oropharyngeal cancer
Abstract
Objectives: To assess the impact of image post-processing steps on the generalisability of MRI-based radiogenomic models. Using a human papillomavirus (HPV) status in oropharyngeal squamous cell carcinoma (OPSCC) prediction model, this study examines the potential of different post-processing strategies to increase its generalisability across data from different centres and image acquisition protocols.
Materials and methods: Contrast-enhanced T1-weighted MR images of OPSCC patients of two cohorts from different centres, with confirmed HPV status, were manually segmented. After radiomic feature extraction, the HPV prediction model trained on a training set with 91 patients was subsequently tested on two independent cohorts: a test set with 62 patients and an externally derived cohort of 157 patients. The data processing options included: data harmonisation, a process to ensure consistency in data from different centres; exclusion of unstable features across different segmentations and scan protocols; and removal of highly correlated features to reduce redundancy.
Results: The predictive model, trained without post-processing, showed high performance on the test set, with an AUC of 0.79 (95% CI: 0.66-0.90, p < 0.001). However, when tested on the external data, the model performed less well, resulting in an AUC of 0.52 (95% CI: 0.45-0.58, p = 0.334). The model's generalisability substantially improved after performing post-processing steps. The AUC for the test set reached 0.76 (95% CI: 0.63-0.87, p < 0.001), while for the external cohort, the predictive model achieved an AUC of 0.73 (95% CI: 0.64-0.81, p < 0.001).
Conclusions: When applied before model development, post-processing steps can enhance the robustness and generalisability of predictive radiogenomics models.
Key points: Question How do post-processing steps impact the generalisability of MRI-based radiogenomic prediction models? Findings Applying post-processing steps, i.e., data harmonisation, identification of stable radiomic features, and removal of correlated features, before model development can improve model robustness and generalisability. Clinical relevance Post-processing steps in MRI radiogenomic model generation lead to reliable non-invasive diagnostic tools for personalised cancer treatment strategies.
Keywords: Human papillomavirus; Imaging genomics; Machine learning; Magnetic resonance imaging; Radiomics.
© 2025. The Author(s).
Conflict of interest statement
Compliance with ethical standards. Guarantor: The scientific guarantor of this publication is Prof. Michiel van den Brekel. Conflict of interest: R.G.H.B.T. is a member of the Scientific Editorial Board of European Radiology (section: oncology). As such, they have not participated in the selection or review processes for this article. The remaining authors report no conflicts of interest. Statistics and biometry: No complex statistical methods were necessary for this paper. Informed consent: Written informed consent was obtained from all patients at our institution, and the institutional review board granted permission for this study. Ethical approval: Institutional Review Board approval was obtained. Study subjects or cohorts overlap: An abstract based on our findings has been accepted as a poster presentation at the 110th Scientific Assembly and Annual Meeting of the Radiological Society of North America, December 1–5, 2024, Chicago, Illinois. The presenting author of this abstract is Milad Ahmadian. This study spans two datasets derived from two different centres. To our knowledge, parts of this data have been reported in: https://doi.org/10.1016/j.ejrad.2021.109701 , https://doi.org/10.1002/hed.26505 , and https://doi.org/10.1007/s00330-022-09255-8 . These studies predominantly focused on the pilot study of HPV prediction or specifically on clinical outcomes. Our work focuses on the external validation of this established model as well as measuring the impact of data pre/post-processing on generalisability. Methodology: Retrospective Multicentre study Diagnostic or prognostic study
Figures
References
-
- Bodalal Z, Trebeschi S, Nguyen-Kim TDL et al (2019) Radiogenomics: bridging imaging and genomics. Abdom Radiol (NY) 44:1960–1984 - PubMed
-
- Lambin P, Leijenaar RTH, Deist TM et al (2017) Radiomics: the bridge between medical imaging and personalized medicine. Nat Rev Clin Oncol 14:749–762 - PubMed
-
- Ong YH, Zheng W, Khong PL, Ni Q (2024) Application of radiogenomics in head and neck cancer: a new tool toward diagnosis and therapy. iRADIOLOGY 2:113–127
-
- Bagher-Ebadian H, Siddiqui F, Ghanem AI et al (2022) Radiomics outperforms clinical factors in characterizing human papilloma virus (HPV) for patients with oropharyngeal squamous cell carcinomas. Biomed Phys Eng Express. 10.1088/2057-1976/ac39ab - PubMed
MeSH terms
LinkOut - more resources
Full Text Sources
Medical
