Inter-rater agreement in glioma segmentations on longitudinal MRI
- PMID: 30825711
- PMCID: PMC6396436
- DOI: 10.1016/j.nicl.2019.101727
Inter-rater agreement in glioma segmentations on longitudinal MRI
Abstract
Background: Tumor segmentation of glioma on MRI is a technique to monitor, quantify and report disease progression. Manual MRI segmentation is the gold standard but very labor intensive. At present the quality of this gold standard is not known for different stages of the disease, and prior work has mainly focused on treatment-naive glioblastoma. In this paper we studied the inter-rater agreement of manual MRI segmentation of glioblastoma and WHO grade II-III glioma for novices and experts at three stages of disease. We also studied the impact of inter-observer variation on extent of resection and growth rate.
Methods: In 20 patients with WHO grade IV glioblastoma and 20 patients with WHO grade II-III glioma (defined as non-glioblastoma) both the enhancing and non-enhancing tumor elements were segmented on MRI, using specialized software, by four novices and four experts before surgery, after surgery and at time of tumor progression. We used the generalized conformity index (GCI) and the intra-class correlation coefficient (ICC) of tumor volume as main outcome measures for inter-rater agreement.
Results: For glioblastoma, segmentations by experts and novices were comparable. The inter-rater agreement of enhancing tumor elements was excellent before surgery (GCI 0.79, ICC 0.99) poor after surgery (GCI 0.32, ICC 0.92), and good at progression (GCI 0.65, ICC 0.91). For non-glioblastoma, the inter-rater agreement was generally higher between experts than between novices. The inter-rater agreement was excellent between experts before surgery (GCI 0.77, ICC 0.92), was reasonable after surgery (GCI 0.48, ICC 0.84), and good at progression (GCI 0.60, ICC 0.80). The inter-rater agreement was good between novices before surgery (GCI 0.66, ICC 0.73), was poor after surgery (GCI 0.33, ICC 0.55), and poor at progression (GCI 0.36, ICC 0.73). Further analysis showed that the lower inter-rater agreement of segmentation on postoperative MRI could only partly be explained by the smaller volumes and fragmentation of residual tumor. The median interquartile range of extent of resection between raters was 8.3% and of growth rate was 0.22 mm/year.
Conclusion: Manual tumor segmentations on MRI have reasonable agreement for use in spatial and volumetric analysis. Agreement in spatial overlap is of concern with segmentation after surgery for glioblastoma and with segmentation of non-glioblastoma by non-experts.
Keywords: Glioblastoma; Glioma; Inter-rater agreement; Low-grade glioma; MRI; Manual segmentation.
Copyright © 2019 The Authors. Published by Elsevier Inc. All rights reserved.
Figures







Similar articles
-
Reliability of Semi-Automated Segmentations in Glioblastoma.Clin Neuroradiol. 2017 Jun;27(2):153-161. doi: 10.1007/s00062-015-0471-2. Epub 2015 Oct 21. Clin Neuroradiol. 2017. PMID: 26490369
-
Intra-rater variability in low-grade glioma segmentation.J Neurooncol. 2017 Jan;131(2):393-402. doi: 10.1007/s11060-016-2312-9. Epub 2016 Nov 11. J Neurooncol. 2017. PMID: 27837437
-
Glioblastoma Segmentation: Comparison of Three Different Software Packages.PLoS One. 2016 Oct 25;11(10):e0164891. doi: 10.1371/journal.pone.0164891. eCollection 2016. PLoS One. 2016. PMID: 27780224 Free PMC article.
-
Timing of postoperative magnetic resonance imaging (MRI) following glioma resection: Shattering the 72 hour window.J Pak Med Assoc. 2019 Aug;69(8):1224-1225. J Pak Med Assoc. 2019. PMID: 31431787 Review.
-
The Multimodal Brain Tumor Image Segmentation Benchmark (BRATS).IEEE Trans Med Imaging. 2015 Oct;34(10):1993-2024. doi: 10.1109/TMI.2014.2377694. Epub 2014 Dec 4. IEEE Trans Med Imaging. 2015. PMID: 25494501 Free PMC article. Review.
Cited by
-
Multi-Disease Segmentation of Gliomas and White Matter Hyperintensities in the BraTS Data Using a 3D Convolutional Neural Network.Front Comput Neurosci. 2019 Dec 20;13:84. doi: 10.3389/fncom.2019.00084. eCollection 2019. Front Comput Neurosci. 2019. PMID: 31920609 Free PMC article.
-
[18]F-fluoroethyl-l-tyrosine positron emission tomography for radiotherapy target delineation: Results from a Radiation Oncology credentialing program.Phys Imaging Radiat Oncol. 2024 Mar 13;30:100568. doi: 10.1016/j.phro.2024.100568. eCollection 2024 Apr. Phys Imaging Radiat Oncol. 2024. PMID: 38585372 Free PMC article.
-
The FinnBrain multimodal neonatal template and atlas collection.Commun Biol. 2025 Apr 11;8(1):600. doi: 10.1038/s42003-025-07963-7. Commun Biol. 2025. PMID: 40217092 Free PMC article.
-
Collagen disorder architecture features are associated with clinical, molecular, genetic factors and survival outcomes in colon cancer.NPJ Precis Oncol. 2025 Aug 28;9(1):304. doi: 10.1038/s41698-025-01098-y. NPJ Precis Oncol. 2025. PMID: 40877439 Free PMC article.
-
Automatic Tumor Segmentation With a Convolutional Neural Network in Multiparametric MRI: Influence of Distortion Correction.Tomography. 2019 Sep;5(3):292-299. doi: 10.18383/j.tom.2019.00010. Tomography. 2019. PMID: 31572790 Free PMC article. Clinical Trial.
References
-
- Amelot A., Deroulers C., Badoual M., Polivka M., Adle-Biassette H., Houdart E., Carpentier A.F., Froelich S., Mandonnet E. Surgical decision making from image-based biophysical modeling of glioblastoma: not ready for primetime. Neurosurgery. 2017;80:793–799. - PubMed
-
- Bartko J.J. Measurement and reliability: statistical thinking considerations. Schizophr. Bull. 1991;17:483–489. - PubMed
-
- Ben Abdallah M., Blonski M., Wantz-Mezieres S., Gaudeau Y., Taillandier L., Moureaux J.-M. 2016 38th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC) IEEE; 2016. Statistical evaluation of manual segmentation of a diffuse low-grade glioma MRI dataset; pp. 4403–4406. - PubMed
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Medical