Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2002 Mar;9(3):290-7.
doi: 10.1016/s1076-6332(03)80372-0.

Estimation in medical imaging without a gold standard

Affiliations

Estimation in medical imaging without a gold standard

Matthew A Kupinski et al. Acad Radiol. 2002 Mar.

Abstract

Rationale and objectives: In medical imaging, physicians often estimate a parameter of interest (eg, cardiac ejection fraction) for a patient to assist in establishing a diagnosis. Many different estimation methods may exist, but rarely can one be considered a gold standard. Therefore, evaluation and comparison of different estimation methods are difficult. The purpose of this study was to examine a method of evaluating different estimation methods without use of a gold standard.

Materials and methods: This method is equivalent to fitting regression lines without the x axis. To use this method, multiple estimates of the clinical parameter of interest for each patient of a given population were needed. The authors assumed the statistical distribution for the true values of the clinical parameter of interest was a member of a given family of parameterized distributions. Furthermore, they assumed a statistical model relating the clinical parameter to the estimates of its value. Using these assumptions and observed data, they estimated the model parameters and the parameters characterizing the distribution of the clinical parameter.

Results: The authors applied the method to simulated cardiac ejection fraction data with varying numbers of patients, numbers of modalities, and levels of noise. They also tested the method on both linear and nonlinear models and characterized the performance of this method compared to that of conventional regression analysis by using x-axis information. Results indicate that the method follows trends similar to that of conventional regression analysis as patients and noise vary, although conventional regression analysis outperforms the method presented because it uses the gold standard which the authors assume is unavailable.

Conclusion: The method accurately estimates model parameters. These estimates can be used to rank the systems for a given estimation task.

PubMed Disclaimer

Figures

Figure 1
Figure 1
A graphic, two-modality example of the method studied where a shows the results for M = 1 and b shows the results for M = 2. The dotted lines represent ±σ̂m. The slope, intercept, and noise terms were estimated by using RWT. Although the x coordinates are plotted, they were not used in estimating the linear model parameters.
Figure 2
Figure 2
A comparison of the true gold-standard density, pr(Θ), and the parameterized density, pr^(Θ|r^). The shape of the density, as characterized by r⃗, was determined with RWT but without previous information. The gold-standard density shown here is a truncated normal density, whereas the parameterized density used in the likelihood expression is a beta-density function. In a sense, this illustrates a beta density imitating a given truncated normal density. Note that the parameter of interest is limited to a finite domain.
Figure 3
Figure 3
(a) The RMSE¯ for three different modalities versus the number of patients. As the number of patients increases, RMSEm converges to σm/am by Equations (1) and (5). (b) A comparison between RWT and linear-regression analysis with a gold standard. Note that the RMSE is also averaged over the three modalities. As expected, conventional regression analysis has lower RMSE, but the performances of the two methods converge as the number of patients increases. For these experiments, a⃗ = [0.6,0.7,0.8], b⃗ = [−0.1,0.0,0.1], σ⃗ = [0.05,0.03,0.08], and the error bars represent the standard error calculated over 50 independent experiments.
Figure 4
Figure 4
The RMSE¯ (averaged across simulations and modalities) versus the number of modalities used in a RWT experiment. A sharp decline in RMSE¯ is seen from one to two modalities, followed by a slow decline. One might expect this, especially because RWT cannot work properly with only one modality. The performance of conventional regression analysis is independent of the number of modalities. The same model parameters were used for all modalities in all experiments (am = 1, bm = 0.1, σm = 0.05, P = 100).
Figure 5
Figure 5
(a) The RMSE¯ for three different modalities versus variance of the noise σm. The RMSE¯ increases in accordance with 1/am by Equations (1) and (5). (b) A comparison between RWT and linear-regression analysis with a gold standard. Note that the RMSE is also averaged over the three modalities. The RMSE¯ does not converge to zero for RWT as σm tends to zero. The parallel nature of the two graphs indicates that the comparative performance of RWT is independent of σm. For these experiments, a⃗ = [0.6,0.7,0.8], b⃗ = [−0.1,0.0,0.1], P = 100, and the error bars represent the standard error calculated over 50 independent experiments.
Figure 6
Figure 6
An application of RWT with a quadratic model. (a) For modality 1, a strong, nonlinear relationship with the gold standard and a relatively large variance were discovered qualitatively. (b) Modality 2 was slightly nonlinear with a small variance, whereas (c) modality 3 was linear with a large variance. Both were fit well by the quadratic RWT.

Similar articles

Cited by

References

    1. Barrett HH. Objective assessment of image quality: effects of quantum noise and object variability. J Opt Soc Am A. 1990;7:1266–1278. - PubMed
    1. Feig SA. Estimation of currently attainable benefit from mammographic screening in women aged 40–49. Cancer. 1995;75:2412–2419. - PubMed
    1. Walter SD, Irwig LM. Estimation of test error rates, disease prevalence, and relative risk from misclassified data: a review. J Clin Epidemiol. 1988;41:923–937. - PubMed
    1. Metz CE. ROC methodology in radiologic imaging. Invest Radiol. 1986;21:720–733. - PubMed
    1. Henkelman RM, Kay I, Bronskill MJ. Receiver operator characteristic (ROC) analysis without truth. Med Decis Making. 1990;10:24–29. - PubMed

Publication types

LinkOut - more resources