Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2012 Jul 7;57(13):4425-46.
doi: 10.1088/0031-9155/57/13/4425. Epub 2012 Jun 20.

Task-based evaluation of segmentation algorithms for diffusion-weighted MRI without using a gold standard

Affiliations

Task-based evaluation of segmentation algorithms for diffusion-weighted MRI without using a gold standard

Abhinav K Jha et al. Phys Med Biol. .

Erratum in

  • Phys Med Biol. 2013 Jan 7;58(1):183

Abstract

In many studies, the estimation of the apparent diffusion coefficient (ADC) of lesions in visceral organs in diffusion-weighted (DW) magnetic resonance images requires an accurate lesion-segmentation algorithm. To evaluate these lesion-segmentation algorithms, region-overlap measures are used currently. However, the end task from the DW images is accurate ADC estimation, and the region-overlap measures do not evaluate the segmentation algorithms on this task. Moreover, these measures rely on the existence of gold-standard segmentation of the lesion, which is typically unavailable. In this paper, we study the problem of task-based evaluation of segmentation algorithms in DW imaging in the absence of a gold standard. We first show that using manual segmentations instead of gold-standard segmentations for this task-based evaluation is unreliable. We then propose a method to compare the segmentation algorithms that does not require gold-standard or manual segmentation results. The no-gold-standard method estimates the bias and the variance of the error between the true ADC values and the ADC values estimated using the automated segmentation algorithm. The method can be used to rank the segmentation algorithms on the basis of both the ensemble mean square error and precision. We also propose consistency checks for this evaluation technique.

PubMed Disclaimer

Figures

Figure 1
Figure 1
The regression lines estimated using the no-gold-standard method for the three segmentation algorithms and for the two simulated lesion datasets: DS1 and DS2. In these two datasets, the true ADC values were sampled from a normal distribution. The solid line is generated using the estimated linear model parameters, and the dashed line denotes the estimated standard deviation. The scatter plots show the computed ADC values vs. the true ADC value. ADC values are in units of mm2/103s. Note that although we have plotted the true ADC value on the x axis of the graph, this information was not used in computing the linear model parameters.
Figure 2
Figure 2
Consistency check: The Q-Q plot comparing the histogram of automated ADC values to the theoretically-estimated distribution for the clustering, MLE and EM-based segmentation algorithms and for the two simulated lesion datasets: DS1 and DS2. The quantiles of the theoretically-estimated distribution and the measured ADC distribution are plotted along the x and y axis, respectively. The 45° line is also plotted for visual convenience. We observe that the Q-Q plot lies approximately along the 45° line in all the scenarios, thus confirming that the measured and estimated distributions are consistent with each other.
Figure 3
Figure 3
The regression lines estimated using the no-gold-standard method for the three segmentation algorithms and for the two simulated lesion datasets: DS3 and DS4. The solid line is generated using the estimated linear model parameters, and the dashed line denotes the estimated standard deviation. The scatter plots show the computed ADC values vs. the true ADC value. ADC values are in units of mm2/103s.
Figure 4
Figure 4
Consistency check: The Q-Q plot comparing the histogram of automated ADC values to the theoretically-estimated distribution for the clustering, MLE and EM-based segmentation algorithms and for the two simulated lesion datasets: DS3 and DS4.

References

    1. Bammer R. Basic principles of diffusion-weighted imaging. Eur J Radiol. 2003 Mar;45:169–84. - PubMed
    1. Huisman TA. Diffusion-weighted imaging: basic concepts and application in cerebral stroke and head trauma. Eur Radiol. 2003 Oct;13:2283–97. - PubMed
    1. Chenevert TL, Stegman LD, Taylor JM, Robertson PL, Greenberg HS, Rehemtulla A, Ross BD. Diffusion magnetic resonance imaging: an early surrogate marker of therapeutic efficacy in brain tumors. J Natl Cancer Inst. 2000 Dec;92:2029–36. - PubMed
    1. Galons JP, Altbach MI, Paine-Murrieta GD, Taylor CW, Gillies RJ. Early increases in breast tumor xenograft water mobility in response to paclitaxel therapy detected by non-invasive diffusion magnetic resonance imaging. Neoplasia. 1999 Jun;1:113–7. - PMC - PubMed
    1. Bortner CD, Cidlowski JA. Apoptotic volume decrease and the incredible shrinking cell. Cell Death Differ. 2002 Dec;9:1307–10. - PubMed

Publication types

MeSH terms