. 2002 Mar;9(3):290-7.

doi: 10.1016/s1076-6332(03)80372-0.

Estimation in medical imaging without a gold standard

Matthew A Kupinski¹, John W Hoppin, Eric Clarkson, Harrison H Barrett, George A Kastis

Affiliations

PMID: 11887945
PMCID: PMC3143018
DOI: 10.1016/s1076-6332(03)80372-0

Estimation in medical imaging without a gold standard

Matthew A Kupinski et al. Acad Radiol. 2002 Mar.

. 2002 Mar;9(3):290-7.

doi: 10.1016/s1076-6332(03)80372-0.

Authors

Matthew A Kupinski¹, John W Hoppin, Eric Clarkson, Harrison H Barrett, George A Kastis

Affiliation

¹ Department of Radiology, Arizona Health Sciences Center, Tucson 85724-5067, USA.

PMID: 11887945
PMCID: PMC3143018
DOI: 10.1016/s1076-6332(03)80372-0

Abstract

Rationale and objectives: In medical imaging, physicians often estimate a parameter of interest (eg, cardiac ejection fraction) for a patient to assist in establishing a diagnosis. Many different estimation methods may exist, but rarely can one be considered a gold standard. Therefore, evaluation and comparison of different estimation methods are difficult. The purpose of this study was to examine a method of evaluating different estimation methods without use of a gold standard.

Materials and methods: This method is equivalent to fitting regression lines without the x axis. To use this method, multiple estimates of the clinical parameter of interest for each patient of a given population were needed. The authors assumed the statistical distribution for the true values of the clinical parameter of interest was a member of a given family of parameterized distributions. Furthermore, they assumed a statistical model relating the clinical parameter to the estimates of its value. Using these assumptions and observed data, they estimated the model parameters and the parameters characterizing the distribution of the clinical parameter.

Results: The authors applied the method to simulated cardiac ejection fraction data with varying numbers of patients, numbers of modalities, and levels of noise. They also tested the method on both linear and nonlinear models and characterized the performance of this method compared to that of conventional regression analysis by using x-axis information. Results indicate that the method follows trends similar to that of conventional regression analysis as patients and noise vary, although conventional regression analysis outperforms the method presented because it uses the gold standard which the authors assume is unavailable.

Conclusion: The method accurately estimates model parameters. These estimates can be used to rank the systems for a given estimation task.

PubMed Disclaimer

Figures

**Figure 1**
A graphic, two-modality example of the method studied where a shows the results for M = 1 and b shows the results for M = 2. The dotted lines represent ±σ̂_m. The slope, intercept, and noise terms were estimated by using RWT. Although the x coordinates are plotted, they were not used in estimating the linear model parameters.

**Figure 2**
A comparison of the true gold-standard density, pr(Θ), and the parameterized density, $\hat{pr} (Θ | \hat{\vec{r}})$ . The shape of the density, as characterized by r⃗, was determined with RWT but without previous information. The gold-standard density shown here is a truncated normal density, whereas the parameterized density used in the likelihood expression is a beta-density function. In a sense, this illustrates a beta density imitating a given truncated normal density. Note that the parameter of interest is limited to a finite domain.

**Figure 3**
**(a)** The $\bar{RMSE}$ for three different modalities versus the number of patients. As the number of patients increases, RMSE_m converges to σ_m/a_m by Equations (1) and (5). **(b)** A comparison between RWT and linear-regression analysis with a gold standard. Note that the RMSE is also averaged over the three modalities. As expected, conventional regression analysis has lower RMSE, but the performances of the two methods converge as the number of patients increases. For these experiments, a⃗ = [0.6,0.7,0.8], b⃗ = [−0.1,0.0,0.1], σ⃗ = [0.05,0.03,0.08], and the error bars represent the standard error calculated over 50 independent experiments.

**Figure 4**
The $\bar{RMSE}$ (averaged across simulations and modalities) versus the number of modalities used in a RWT experiment. A sharp decline in $\bar{RMSE}$ is seen from one to two modalities, followed by a slow decline. One might expect this, especially because RWT cannot work properly with only one modality. The performance of conventional regression analysis is independent of the number of modalities. The same model parameters were used for all modalities in all experiments (*a_m* = 1, *b_m* = 0.1, σ_m = 0.05, P = 100).

**Figure 5**
**(a)** The $\bar{RMSE}$ for three different modalities versus variance of the noise σ_m. The $\bar{RMSE}$ increases in accordance with 1/*a_m* by Equations (1) and (5). **(b)** A comparison between RWT and linear-regression analysis with a gold standard. Note that the RMSE is also averaged over the three modalities. The $\bar{RMSE}$ does not converge to zero for RWT as σ_m tends to zero. The parallel nature of the two graphs indicates that the comparative performance of RWT is independent of σ_m. For these experiments, a⃗ = [0.6,0.7,0.8], b⃗ = [−0.1,0.0,0.1], P = 100, and the error bars represent the standard error calculated over 50 independent experiments.

**Figure 6**
An application of RWT with a quadratic model. **(a)** For modality 1, a strong, nonlinear relationship with the gold standard and a relatively large variance were discovered qualitatively. **(b)** Modality 2 was slightly nonlinear with a small variance, whereas **(c)** modality 3 was linear with a large variance. Both were fit well by the quadratic RWT.

See this image and copyright information in PMC

Cited by

Need for objective task-based evaluation of AI-based segmentation methods for quantitative PET.
Liu Z, Mhlanga JC, Siegel BA, Jha AK. Liu Z, et al. Proc SPIE Int Soc Opt Eng. 2023 Feb;12467:124670R. doi: 10.1117/12.2647894. Epub 2023 Apr 3. Proc SPIE Int Soc Opt Eng. 2023. PMID: 37990707 Free PMC article.
A robust and accurate center-frequency estimation (RACE) algorithm for improving motion estimation performance of SinMod on tagged cardiac MR images without known tagging parameters.
Liu H, Wang J, Xu X, Song E, Wang Q, Jin R, Hung CC, Fei B. Liu H, et al. Magn Reson Imaging. 2014 Nov;32(9):1139-55. doi: 10.1016/j.mri.2014.07.005. Epub 2014 Aug 1. Magn Reson Imaging. 2014. PMID: 25087857 Free PMC article.
Task-based evaluation of segmentation algorithms for diffusion-weighted MRI without using a gold standard.
Jha AK, Kupinski MA, Rodríguez JJ, Stephen RM, Stopeck AT. Jha AK, et al. Phys Med Biol. 2012 Jul 7;57(13):4425-46. doi: 10.1088/0031-9155/57/13/4425. Epub 2012 Jun 20. Phys Med Biol. 2012. PMID: 22713231 Free PMC article.
Perfusion-weighted MR imaging studies in brain hypervascular diseases: comparison of arterial input function extractions for perfusion measurement.
Ducreux D, Buvat I, Meder JF, Mikulis D, Crawley A, Fredy D, TerBrugge K, Lasjaunias P, Bittoun J. Ducreux D, et al. AJNR Am J Neuroradiol. 2006 May;27(5):1059-69. AJNR Am J Neuroradiol. 2006. PMID: 16687543 Free PMC article.
A no-gold-standard technique for objective assessment of quantitative nuclear-medicine imaging methods.
Jha AK, Caffo B, Frey EC. Jha AK, et al. Phys Med Biol. 2016 Apr 7;61(7):2780-800. doi: 10.1088/0031-9155/61/7/2780. Epub 2016 Mar 16. Phys Med Biol. 2016. PMID: 26982626 Free PMC article.

See all "Cited by" articles

References

1. Barrett HH. Objective assessment of image quality: effects of quantum noise and object variability. J Opt Soc Am A. 1990;7:1266–1278. - PubMed
1. Feig SA. Estimation of currently attainable benefit from mammographic screening in women aged 40–49. Cancer. 1995;75:2412–2419. - PubMed
1. Walter SD, Irwig LM. Estimation of test error rates, disease prevalence, and relative risk from misclassified data: a review. J Clin Epidemiol. 1988;41:923–937. - PubMed
1. Metz CE. ROC methodology in radiologic imaging. Invest Radiol. 1986;21:720–733. - PubMed
1. Henkelman RM, Kay I, Bronskill MJ. Receiver operator characteristic (ROC) analysis without truth. Med Decis Making. 1990;10:24–29. - PubMed

Publication types

Actions
Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions

Grants and funding

LinkOut - more resources

Full Text Sources
- Europe PubMed Central
- PubMed Central
Medical
- MedlinePlus Health Information

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Estimation in medical imaging without a gold standard

Affiliation

Estimation in medical imaging without a gold standard

Authors

Affiliation

Abstract

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

Grants and funding

LinkOut - more resources

Full Text Sources

Medical