Interobserver ground-truth variability limits performance of automated glioblastoma segmentation on [18F]FET PET

doi:10.1186/s40658-025-00767-y

. 2025 Jun 6;12(1):54.

doi: 10.1186/s40658-025-00767-y.

Interobserver ground-truth variability limits performance of automated glioblastoma segmentation on [¹⁸F]FET PET

Selene De Sutter¹, Ine Dirks^{2

3}, Laurens Raes⁴, Wietse Geens⁵, Hendrik Everaert⁴, Sophie Bourgeois⁴, Johnny Duerinck⁵, Jef Vandemeulebroucke^{2

6

3}

Affiliations

¹ Department of Electronics and Informatics (ETRO), Vrije Universiteit Brussel (VUB), Pleinlaan 9, Elsene, 1050, Brussels, Belgium. selene.de.sutter@vub.be.
² Department of Electronics and Informatics (ETRO), Vrije Universiteit Brussel (VUB), Pleinlaan 9, Elsene, 1050, Brussels, Belgium.
³ Imec, Leuven, Belgium.
⁴ Department of Nuclear Medicine, Vrije Universiteit Brussel (VUB), Universitair Ziekenhuis Brussel (UZ Brussel), Brussels, Belgium.
⁵ Department of Neurosurgery, Vrije Universiteit Brussel (VUB), Universitair Ziekenhuis Brussel (UZ Brussel), Brussels, Belgium.
⁶ Department of Radiology, Vrije Universiteit Brussel (VUB), Universitair Ziekenhuis Brussel (UZ Brussel), Brussels, Belgium.

PMID: 40478497
PMCID: PMC12144010
DOI: 10.1186/s40658-025-00767-y

Interobserver ground-truth variability limits performance of automated glioblastoma segmentation on [¹⁸F]FET PET

Selene De Sutter et al. EJNMMI Phys. 2025.

. 2025 Jun 6;12(1):54.

doi: 10.1186/s40658-025-00767-y.

Authors

Selene De Sutter¹, Ine Dirks^{2

3}, Laurens Raes⁴, Wietse Geens⁵, Hendrik Everaert⁴, Sophie Bourgeois⁴, Johnny Duerinck⁵, Jef Vandemeulebroucke^{2

6

3}

Affiliations

¹ Department of Electronics and Informatics (ETRO), Vrije Universiteit Brussel (VUB), Pleinlaan 9, Elsene, 1050, Brussels, Belgium. selene.de.sutter@vub.be.
² Department of Electronics and Informatics (ETRO), Vrije Universiteit Brussel (VUB), Pleinlaan 9, Elsene, 1050, Brussels, Belgium.
³ Imec, Leuven, Belgium.
⁴ Department of Nuclear Medicine, Vrije Universiteit Brussel (VUB), Universitair Ziekenhuis Brussel (UZ Brussel), Brussels, Belgium.
⁵ Department of Neurosurgery, Vrije Universiteit Brussel (VUB), Universitair Ziekenhuis Brussel (UZ Brussel), Brussels, Belgium.
⁶ Department of Radiology, Vrije Universiteit Brussel (VUB), Universitair Ziekenhuis Brussel (UZ Brussel), Brussels, Belgium.

PMID: 40478497
PMCID: PMC12144010
DOI: 10.1186/s40658-025-00767-y

Abstract

Background: Positron emission tomography (PET) with a [¹⁸F]fluoroethyl)-L-tyrosine ([¹⁸F]FET) tracer is of growing importance in the management of glioblastoma for the estimation of tumor extent and extraction of diagnostic and prognostic parameters. Robust and accurate glioblastoma segmentation methods are essential to maximize the benefits of this imaging modality. Given the importance of setting the foreground threshold during manual tumor delineation, this study investigates the added value of incorporating such prior knowledge to guide the automated segmentation and improve performance. Two segmentation networks were trained based on the nnU-Net guidelines: one with the [¹⁸F]FET PET image as sole input, and one with an additional input channel for the threshold map. For the latter, we investigate the benefit of manually obtained thresholds and explore automated prediction and generation of such maps. A fully automated pipeline was constructed by selecting the best performing threshold prediction approach and cascading this with the tumor segmentation model.

Results: The proposed two-channel network shows increased performance with guidance of threshold maps originating from the same reader whose ground-truth tumor label the prediction is compared to (DSC = 0.901). When threshold maps were generated by a different reader, performance reverted to levels comparable to the one-channel network and inter-reader variability. The proposed full pipeline achieves results on par with current state of the art (DSC = 0.807).

Conclusions: Incorporating a threshold map can significantly improve tumor segmentation performance when it aligns well with the ground-truth label. However, the current inability to reliably reproduce these maps-both manually and automatically-or the ground-truth tumor labels, restricts the achievable accuracy for automated glioblastoma segmentation on [¹⁸F]FET PET, highlighting the need for more consistent definitions of such ground-truth delineations.

Keywords: Brain; Deep learning; Glioblastoma; Positron emission tomography; Segmentation.

PubMed Disclaimer

Conflict of interest statement

Declarations. Ethics approval and consent to participate: This single-center retrospective study was performed in line with the principles of the Declaration of Helsinki. Approval was granted by the Ethics Committee of Universitair Ziekenhuis Brussel (Commissie Medische Ethiek; protocol code EC-2021–137; date of approval 28–07-2021). This study is a retrospective analysis of data obtained during a prospective study (Axig (NCT01562197), GliAvAx (NCT03291314), and GlitIpNi (NCT03233152)), during which all patients signed informed consent for the use of their data. Consent for publication: The authors affirm that human research participants provided informed consent for publication of the images in Figs. 2 and 6. Competing interests: The authors declare that they have no competing interests.

Figures

**Fig. 1**
Overview of data partitioning and annotation strategy, including fivefold cross-validation for training, and a fully independent test set for validation of the final model

**Fig. 2**
Overview of the proposed approach. The full pipeline consists of initial prediction of a threshold map from the [¹⁸F]FET PET image using an automated threshold estimation network. An overview of the investigated threshold prediction networks is shown (green) in correspondence the manual segmentation workflow (orange): from the PET image, U-Net_BKG predicts the background VOI, DenseNet_TH predicts the threshold value, and U-Net_TM predicts the threshold map. The image and threshold map are subsequently fed as input channels to the segmentation network, a two-channel U-Net, for the prediction of the tumor label. A multi-slice representation of the background VOI is shown below. VOI = Volume Of Interest

**Fig. 3**
Bland–Altman plots illustrating the differences between ground-truth thresholds and thresholds predicted using the various approaches for automated threshold prediction. Each point corresponds to a pair of predicted and ground-truth threshold values. Inter-reader differences are shown in (d) for different pairs of readers (1–4), where each point corresponds to a pair of threshold values, both determined by a different reader. The plots display the mean difference (bias) and 95% limits of agreement. Red crosses in (a) indicate cases where the network failed to segment a background VOI, resulting in a threshold set to 0. SD = Standard Deviation

**Fig. 4**
Bland–Altman plots illustrating the differences between ground-truth MTV and MTV predicted using 2 C-U-Net with threshold maps generated by the various automated threshold prediction approaches. Each point corresponds to a pair of predicted and ground-truth volumes. Inter-reader differences are shown in (d) for the different pairs of readers (1–4), where each point corresponds to a pair of tumor volumes, both determined by a different reader. The plots display the mean difference (bias) and 95% limits of agreement. GT = Ground Truth; MTV = Metabolic Tumor Volume; SD = Standard Deviation

**Fig. 5**
Performance of the full pipeline as a function of lesion volume (a–c) and scanner (d–f). DSC = Dice Similarity Coefficient; MTV = Metabolic Tumor Volume; NSD = Normalized Surface Dice; AVE = Absolute Volume Error

**Fig. 6**
Example segmentations of representative subject. Threshold map from reader A, from reader B and automatically generated from U-Net_TM are shown in the first column. Tumor label predictions from 2 C-U-Net using these threshold maps are shown in the second column and compared to ground-truth labels of both readers. Overlap between labels of both readers are visualized with corresponding metrics. AVE = Absolute Volume Error; DSC = Dice Similarity Coefficient; NSD = Normalized Surface Dice

See this image and copyright information in PMC

References

1. Galldiks N, Niyazi M, Grosu AL, et al. Contribution of PET imaging to radiotherapy planning and monitoring in glioma patients—a report of the PET/RANO group. Neuro Oncol. 2021;23(6):881–93. 10.1093/neuonc/noab013. - PMC - PubMed
1. Albert NL, Weller M, Suchorska B, et al. Response assessment in neuro-oncology working group and European association for neuro-oncology recommendations for the clinical use of PET imaging in gliomas. Neuro Oncol. 2016;18(9):1199–208. 10.1093/neuonc/now058. - PMC - PubMed
1. Pauleit D, Stoffels G, Bachofner A, et al. Comparison of 18F-FET and 18F-FDG PET in brain tumors. Nucl Med Biol. 2009;36(7):779–87. 10.1016/j.nucmedbio.2009.05.005. - PubMed
1. Pauleit D, Floeth F, Hamacher K, et al. O-(2-[18F] fluoroethyl)-L-tyrosine PET combined with MRI improves the diagnostic assessment of cerebral gliomas. Brain. 2005;128(3):678–87. 10.1093/brain/awh399. - PubMed
1. Pöpperl G, Götz C, Rachinger W, Gildehaus F-J, Tonn J-C, Tatsch K. Value of O-(2-[18F] fluoroethyl)-L-tyrosine PET for the diagnosis of recurrent glioma. Eur J Nucl Med Mol Imaging. 2004;31:1464–70. 10.1007/s00259-004-1590-1. - PubMed

Grants and funding

Grant Number 101016834/Horizon 2020

LinkOut - more resources

Full Text Sources
- PubMed Central
- Springer

[1] Galldiks N, Niyazi M, Grosu AL, et al. Contribution of PET imaging to radiotherapy planning and monitoring in glioma patients—a report of the PET/RANO group. Neuro Oncol. 2021;23(6):881–93. 10.1093/neuonc/noab013. - PMC - PubMed

[2] Galldiks N, Niyazi M, Grosu AL, et al. Contribution of PET imaging to radiotherapy planning and monitoring in glioma patients—a report of the PET/RANO group. Neuro Oncol. 2021;23(6):881–93. 10.1093/neuonc/noab013. - PMC - PubMed

[3] Albert NL, Weller M, Suchorska B, et al. Response assessment in neuro-oncology working group and European association for neuro-oncology recommendations for the clinical use of PET imaging in gliomas. Neuro Oncol. 2016;18(9):1199–208. 10.1093/neuonc/now058. - PMC - PubMed

[4] Albert NL, Weller M, Suchorska B, et al. Response assessment in neuro-oncology working group and European association for neuro-oncology recommendations for the clinical use of PET imaging in gliomas. Neuro Oncol. 2016;18(9):1199–208. 10.1093/neuonc/now058. - PMC - PubMed

[5] Pauleit D, Stoffels G, Bachofner A, et al. Comparison of 18F-FET and 18F-FDG PET in brain tumors. Nucl Med Biol. 2009;36(7):779–87. 10.1016/j.nucmedbio.2009.05.005. - PubMed

[6] Pauleit D, Stoffels G, Bachofner A, et al. Comparison of 18F-FET and 18F-FDG PET in brain tumors. Nucl Med Biol. 2009;36(7):779–87. 10.1016/j.nucmedbio.2009.05.005. - PubMed

[7] Pauleit D, Floeth F, Hamacher K, et al. O-(2-[18F] fluoroethyl)-L-tyrosine PET combined with MRI improves the diagnostic assessment of cerebral gliomas. Brain. 2005;128(3):678–87. 10.1093/brain/awh399. - PubMed

[8] Pauleit D, Floeth F, Hamacher K, et al. O-(2-[18F] fluoroethyl)-L-tyrosine PET combined with MRI improves the diagnostic assessment of cerebral gliomas. Brain. 2005;128(3):678–87. 10.1093/brain/awh399. - PubMed

[9] Pöpperl G, Götz C, Rachinger W, Gildehaus F-J, Tonn J-C, Tatsch K. Value of O-(2-[18F] fluoroethyl)-L-tyrosine PET for the diagnosis of recurrent glioma. Eur J Nucl Med Mol Imaging. 2004;31:1464–70. 10.1007/s00259-004-1590-1. - PubMed

[10] Pöpperl G, Götz C, Rachinger W, Gildehaus F-J, Tonn J-C, Tatsch K. Value of O-(2-[18F] fluoroethyl)-L-tyrosine PET for the diagnosis of recurrent glioma. Eur J Nucl Med Mol Imaging. 2004;31:1464–70. 10.1007/s00259-004-1590-1. - PubMed

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Interobserver ground-truth variability limits performance of automated glioblastoma segmentation on [¹⁸F]FET PET

Affiliations

Interobserver ground-truth variability limits performance of automated glioblastoma segmentation on [¹⁸F]FET PET

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

Similar articles

References

Grants and funding

LinkOut - more resources

Full Text Sources

Abstract

Conflict of interest statement

Figures

Similar articles

References

Related information

Grants and funding

LinkOut - more resources

Full Text Sources