Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Feb 18;15(1):5985.
doi: 10.1038/s41598-025-89053-6.

Survival and data-driven phenotypes in head and neck cancer

Affiliations

Survival and data-driven phenotypes in head and neck cancer

Anni Heinolainen et al. Sci Rep. .

Abstract

Head and neck cancer (HNC) is the seventh most common cancer worldwide, with a 5-year overall survival of 50-60%. Despite known survival factors, unexpected deaths underscore the need to refine pre-treatment survival predictions. This study aimed to identify novel data-driven HNC patient phenotypes, associated features and predict overall survival using deep survival clustering model VaDeSC. We used a retrospective cohort of 1341 HNC patients from Helsinki University Hospital, utilizing their pre-treatment clinical and demographic data from electronic health records. We identified six previously unrecognized HNC phenotypes with distinct survival patterns, highlighting the significance of BMI, sleep apnoea, TNM stage, tumour site, treatment intention, gender, and age-related treatment modalities. VaDeSC demonstrated strong predictive accuracy, clustering performance and generalizability, achieving C-index of 0.895 ± 0.015 on training and 0.782 ± 0.013 on test data. The phenotype purity was 0.102 ± 0.11 on training-validation data and 0.143 ± 0.006 on test data. Our findings demonstrated that clustering pre-treatment clinical survival data offers intuitively interpretable results with accurate individualised predictions. This approach holds significant potential for novel phenotype discovery across various diseases and endpoints.

PubMed Disclaimer

Conflict of interest statement

Declarations. Competing interests: The authors declare no competing interests.

Figures

Fig. 1
Fig. 1
Patient count per ICD-10 in the dataset.
Fig. 2
Fig. 2
Clustering results of the combined training validation set.
Fig. 3
Fig. 3
Cluster-specific significantly enriched features.
Fig. 4
Fig. 4
Features significantly enriched in multiple clusters.
Fig. 5
Fig. 5
Inclusion criteria of the patient cohort.

References

    1. Sung, H. et al. Global Cancer statistics 2020: GLOBOCAN estimates of incidence and Mortality Worldwide for 36 cancers in 185 countries. CA Cancer J. Clin.71, 209–249 (2021). - DOI - PubMed
    1. Giraldi, L. et al. Alcohol and cigarette consumption predict mortality in patients with head and neck cancer: a pooled analysis within the International Head and Neck Cancer Epidemiology (INHANCE) Consortium. Ann. Oncol.28, 2843–2851 (2017). - DOI - PMC - PubMed
    1. Eytan, D. F., Blackford, A. L., Eisele, D. W. & Fakhry, C. Prevalence of comorbidities and effect on survival in survivors of human papillomavirus-related and human papillomavirus-unrelated head and neck cancer in the United States. Cancer125, 249–260 (2019). - DOI - PubMed
    1. Mahal, B. A. et al. Incidence and demographic Burden of HPV-Associated Oropharyngeal Head and Neck Cancers in the United States. Cancer Epidemiol. Biomarkers Prev.28, 1660–1667 (2019). - DOI - PubMed
    1. Du, E. et al. Long-term survival in Head and Neck Cancer: Impact of Site, Stage, Smoking, and human papillomavirus status. Laryngoscope129, 2506–2513 (2019). - DOI - PMC - PubMed

LinkOut - more resources