Statistical validation of image segmentation quality based on a spatial overlap index
- PMID: 14974593
- PMCID: PMC1415224
- DOI: 10.1016/s1076-6332(03)00671-8
Statistical validation of image segmentation quality based on a spatial overlap index
Abstract
Rationale and objectives: To examine a statistical validation method based on the spatial overlap between two sets of segmentations of the same anatomy.
Materials and methods: The Dice similarity coefficient (DSC) was used as a statistical validation metric to evaluate the performance of both the reproducibility of manual segmentations and the spatial overlap accuracy of automated probabilistic fractional segmentation of MR images, illustrated on two clinical examples. Example 1: 10 consecutive cases of prostate brachytherapy patients underwent both preoperative 1.5T and intraoperative 0.5T MR imaging. For each case, 5 repeated manual segmentations of the prostate peripheral zone were performed separately on preoperative and on intraoperative images. Example 2: A semi-automated probabilistic fractional segmentation algorithm was applied to MR imaging of 9 cases with 3 types of brain tumors. DSC values were computed and logit-transformed values were compared in the mean with the analysis of variance (ANOVA).
Results: Example 1: The mean DSCs of 0.883 (range, 0.876-0.893) with 1.5T preoperative MRI and 0.838 (range, 0.819-0.852) with 0.5T intraoperative MRI (P < .001) were within and at the margin of the range of good reproducibility, respectively. Example 2: Wide ranges of DSC were observed in brain tumor segmentations: Meningiomas (0.519-0.893), astrocytomas (0.487-0.972), and other mixed gliomas (0.490-0.899).
Conclusion: The DSC value is a simple and useful summary measure of spatial overlap, which can be applied to studies of reproducibility and accuracy in image segmentation. We observed generally satisfactory but variable validation results in two clinical applications. This metric may be adapted for similar validation tasks.
Figures




References
-
- Bonar DC, Schaper KA, Anderson JR, Rottenberg DA, Strother SC. Graphical analysis of MR feature space for measurement of CSF, gray matter, and white-matter volumes. J Comput Assist Tomogr. 1993;17:461–470. - PubMed
-
- Warfield SK, Westin CF, Guttmann CRG, Albert M, Jolesz FA, Kikinis R. Fractional segmentation of white matter. In: Proceedings of Second International Conference on Medical Imaging Computing and Computer Assisted Interventions, Sept 19–22, 1999, Cambridge, UK. New York: Springer, 62–71.
-
- Zou KH, Wells M III, Kaus MR, Kikinis R, Jolesz FA, Warfield SK. Statistical validation of automated probabilistic fractional segmentation against composite latent expert gold standard in MR imaging of brain tumors. In: Proceedings of 5th International Conference on Medical Imaging Computing and Computer Assisted Interventions, Sept 25–28, 2002, Tokyo, Japan. Berlin: Springer-Verlag, 315–322.
-
- Grabowski TJ, Frank RJ, Szumski NR, Brown CK, Damasio H. Validation of partial tissue segmentation of single-channel magnetic resonance images of the brain. Neuroimage. 2000;12:640 –656. - PubMed
-
- Choi HS, Haynor DR, Kim Y. Partial volume tissue classification of multi-channel magnetic resonance images – a mixed model. IEEE Trans Med Imag. 1991;10:295–407. - PubMed
Publication types
MeSH terms
Grants and funding
- R03 HS013234/HS/AHRQ HHS/United States
- R01AG19513-01/AG/NIA NIH HHS/United States
- R03HS13234-01/HS/AHRQ HHS/United States
- R01CA86879/CA/NCI NIH HHS/United States
- R01RR11747/RR/NCRR NIH HHS/United States
- R01 LM007861/LM/NLM NIH HHS/United States
- R01 AG019513/AG/NIA NIH HHS/United States
- P41 RR013218/RR/NCRR NIH HHS/United States
- R01 CA086879/CA/NCI NIH HHS/United States
- R01LM7861/LM/NLM NIH HHS/United States
- P41RR13218/RR/NCRR NIH HHS/United States
- P01CA67165/CA/NCI NIH HHS/United States
- P01 CA067165/CA/NCI NIH HHS/United States
- R21CA89449-01/CA/NCI NIH HHS/United States
LinkOut - more resources
Full Text Sources
Other Literature Sources
Medical