Enhancing the reliability of deep learning-based head and neck tumour segmentation using uncertainty estimation with multi-modal images

Jintao Ren^{1

2

3}, Jonas Teuwen⁴, Jasper Nijkamp^{1

3}, Mathis Rasmussen^{1

2

3}, Zeno Gouw⁴, Jesper Grau Eriksen^{2

3}, Jan-Jakob Sonke⁴, Stine Korreman^{1

2

3}

Affiliations

¹ Danish Centre for Particle Therapy, Aarhus University Hospital, Palle Juul-Jensens Boulevard 25, 8200 Aarhus N, Denmark.
² Department of Oncology, Aarhus University Hospital, Palle Juul-Jensens Boulevard 25, 8200 Aarhus N, Denmark.
³ Department of Clinical Medicine, Aarhus University, Palle Juul-Jensens Boulevard 25, 8200 Aarhus N, Denmark.
⁴ Department of Radiation Oncology, Netherlands Cancer Institute, Plesmanlaan 121, 1066 CX Amsterdam, The Netherlands.

PMID: 39059432
DOI: 10.1088/1361-6560/ad682d

Enhancing the reliability of deep learning-based head and neck tumour segmentation using uncertainty estimation with multi-modal images

Jintao Ren et al. Phys Med Biol. 2024.

. 2024 Aug 5;69(16).

doi: 10.1088/1361-6560/ad682d.

Authors

Jintao Ren^{1

2

3}, Jonas Teuwen⁴, Jasper Nijkamp^{1

3}, Mathis Rasmussen^{1

2

3}, Zeno Gouw⁴, Jesper Grau Eriksen^{2

3}, Jan-Jakob Sonke⁴, Stine Korreman^{1

2

3}

Affiliations

¹ Danish Centre for Particle Therapy, Aarhus University Hospital, Palle Juul-Jensens Boulevard 25, 8200 Aarhus N, Denmark.
² Department of Oncology, Aarhus University Hospital, Palle Juul-Jensens Boulevard 25, 8200 Aarhus N, Denmark.
³ Department of Clinical Medicine, Aarhus University, Palle Juul-Jensens Boulevard 25, 8200 Aarhus N, Denmark.
⁴ Department of Radiation Oncology, Netherlands Cancer Institute, Plesmanlaan 121, 1066 CX Amsterdam, The Netherlands.

PMID: 39059432
DOI: 10.1088/1361-6560/ad682d

Abstract

Objective.Deep learning shows promise in autosegmentation of head and neck cancer (HNC) primary tumours (GTV-T) and nodal metastases (GTV-N). However, errors such as including non-tumour regions or missing nodal metastases still occur. Conventional methods often make overconfident predictions, compromising reliability. Incorporating uncertainty estimation, which provides calibrated confidence intervals can address this issue. Our aim was to investigate the efficacy of various uncertainty estimation methods in improving segmentation reliability. We evaluated their confidence levels in voxel predictions and ability to reveal potential segmentation errors.Approach.We retrospectively collected data from 567 HNC patients with diverse cancer sites and multi-modality images (CT, PET, T1-, and T2-weighted MRI) along with their clinical GTV-T/N delineations. Using the nnUNet 3D segmentation pipeline, we compared seven uncertainty estimation methods, evaluating them based on segmentation accuracy (Dice similarity coefficient, DSC), confidence calibration (Expected Calibration Error, ECE), and their ability to reveal segmentation errors (Uncertainty-Error overlap using DSC, UE-DSC).Main results.Evaluated on the hold-out test dataset (n= 97), the median DSC scores for GTV-T and GTV-N segmentation across all uncertainty estimation methods had a narrow range, from 0.73 to 0.76 and 0.78 to 0.80, respectively. In contrast, the median ECE exhibited a wider range, from 0.30 to 0.12 for GTV-T and 0.25 to 0.09 for GTV-N. Similarly, the median UE-DSC also ranged broadly, from 0.21 to 0.38 for GTV-T and 0.22 to 0.36 for GTV-N. A probabilistic network-PhiSeg method consistently demonstrated the best performance in terms of ECE and UE-DSC.Significance.Our study highlights the importance of uncertainty estimation in enhancing the reliability of deep learning for autosegmentation of HNC GTV. The results show that while segmentation accuracy can be similar across methods, their reliability, measured by calibration error and uncertainty-error overlap, varies significantly. Used with visualisation maps, these methods may effectively pinpoint uncertainties and potential errors at the voxel level.

Keywords: deep learning; gross tumour volume; head and neck cancer; radiotherapy; tumour segmentation; uncertainty estimation; uncertainty quantification.

PubMed Disclaimer

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

LinkOut - more resources

Full Text Sources
- IOP Publishing Ltd.
Medical
- MedlinePlus Health Information
- The YODA Project

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Enhancing the reliability of deep learning-based head and neck tumour segmentation using uncertainty estimation with multi-modal images

Affiliations

Enhancing the reliability of deep learning-based head and neck tumour segmentation using uncertainty estimation with multi-modal images

Authors

Affiliations

Abstract

MeSH terms

LinkOut - more resources

Full Text Sources

Medical