Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2018 Jun 15;37(13):2067-2093.
doi: 10.1002/sim.7616. Epub 2018 Apr 2.

Relationship between Roe and Metz simulation model for multireader diagnostic data and Obuchowski-Rockette model parameters

Affiliations

Relationship between Roe and Metz simulation model for multireader diagnostic data and Obuchowski-Rockette model parameters

Stephen L Hillis. Stat Med. .

Abstract

For the typical diagnostic radiology study design, each case (ie, patient) undergoes several diagnostic tests (or modalities) and the resulting images are interpreted by several readers. Often, each reader is asked to assign a confidence-of-disease rating to each case for each test, and the diagnostic tests are compared with respect to reader-performance outcomes that are functions of the reader receiver operating characteristic (ROC) curves, such as the area under the ROC curve. These reader-performance outcomes are frequently analyzed using the Obuchowski and Rockette method, which allows conclusions to generalize to both the reader and case populations. The simulation model proposed by Roe and Metz (RM) in 1997 emulates confidence-of-disease data collected from such studies and has been an important tool for empirically evaluating various reader-performance analysis methods. However, because the RM model parameters are expressed in terms of a continuous decision variable rather than in terms of reader-performance outcomes, it has not been possible to evaluate the realism of the RM model. I derive the relationships between the RM and Obuchowski-Rockette model parameters for the empirical area under the ROC curve reader-performance outcome. These relationships make it possible to evaluate the realism of the RM parameter models and to assess the performance of Obuchowski-Rockette parameter estimates. An example illustrates the application of the relationships for assessing the performance of a proposed upper one-sided confidence bound for the Obuchowski-Rockette test-by-reader variance component, which is useful for sample size estimation.

Keywords: Obuchowski-Rockette; Roe and Metz model; diagnostic radiology; receiver operating characteristic (ROC) curve.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Bias of Obuchowski-Rockette error variance and covariance estimates, computed using the Delong et al [27] method, expressed as a percent of the true error variance σε:OR2. Outcome is the empirical AUC. Results are computed from data simulated from the constrained nonnull unequal-variance RM model, with 30,000 samples simulated for each of N = 36 input combinations: 3 sample sizes (25+/25−, 50+/50−, 100+,100−) × 4 structures(HL. LL, HH, LH) × 3 nonnull curve pairs (with Az(1) and Az(2) denoting the test 1 and test 2 values.) Ten readers are used for each combination. The parameter values are the same as given in Table 2, except that to produce nonnull simulations the μ+ values are replaced by μ+ −0.3 and μ+ +0.3 for tests 1 and 2 and Az values replaced by corresponding Az(1)=Φ[(μ+0.3)/1+b2] and Az(2)=Φ[(μ++0.3)/1+b2] values, respectively. Bias is computed with respect to the corresponding parameters in Table 3. Error bars indicate corresponding large sample 95% confidence intervals.
Figure 2
Figure 2
Percent bias of Obuchowski-Rockette estimates for the reader variance component σR:OR2, test×reader variance component σTR:OR2, var(AUC^1AUC^2) and E [MS (T*R)] based on data simulated from the constrained nonnull unequal-variance RM model as described in Figure 1. Ten readers are used for each combination. The OR estimates are defined by σ^R:OR2=12[MS(R)MS(TR)]Cov^1+Cov^3; σ^TR:OR2=MS(T∗R)σ^ε2+Cov^1+Cov^2Cov^3; and var^(AUC^1.AUC^2.)=2r[MS(T∗R)+r(Cov^2Cov^3)], where MS (R) and MS (T*R) are the OR reader and test×reader mean squares, respectively. Bias is computed with respect to the corresponding parameters in Table 3. Percent bias = 100×bias/ (estimand value).
Figure 3
Figure 3
Observed versus predicted bias for the Obuchowski-Rockette test×reader variance component estimate, σ^ε2, and the variance of the difference of the empirical reader-averaged AUC estimate, var^(AUC^1.AUC^2.), based on data simulated from the constrained nonnull unequal-variance RM model as described in Figure 1. Observed bias = (mean estimate – estimand), where the estimand is computed from the Table 3 formulas. The predicted bias is a function of the error variance and covariances, as given by (22–23), but with simulation-study estimates of the error-variance and covariance biases used in place of the true bias values.
Figure 4
Figure 4
Empirical coverage of the proposed upper one-sided confidence bound for the Obuchowski-Rockette test-by-reader variance component σTR:OR2, based on data simulated from the constrained nonnull unequal-variance RM model as described in Figure 1, except that there are 108 simulation settings because all three reader levels (3, 5, 10 readers) are included. Empirical coverage is the proportion of samples for a particular input combination such that ζ^1ασTR:OR2, where ζ^1α is the estimated (1 − α) 100% upper one-sided confidence bound and σTR:OR2 is the true value computed from the Table 3 formulas.
Figure 5
Figure 5
Monte Carlo versus predicted variance of the empirical reader-averaged AUC test difference. The Monte Carlo values are based on data simulated from the constrained nonnull unequal-variance RM model as described in Figure 1, except that there are 108 simulation settings because all three reader levels (3, 5, 10 readers) are included. The predicted variance is computed using the Table 3 formula. The dashed line is the fitted linear regression line.

Similar articles

Cited by

References

    1. Roe CA, Metz CE. Dorfman-Berbaum-Metz method for statistical analysis of multireader, multimodality receiver operating characteristic data: Validation with computer simulation. Academic Radiology. 1997;4:298–303. doi: 10.1016/S1076-6332(97)80032-3. - DOI - PubMed
    1. Dorfman DD, Berbaum KS, Lenth RV, Chen YF, Donaghy BA. Monte Carlo validation of a multireader method for receiver operating characteristic discrete rating data: factorial experimental design. Academic Radiology. 1998;5:591–602. - PubMed
    1. Beiden SV, Wagner RF, Campbell G. Components-of-variance models and multiple-bootstrap experiments: An alternative method for random-effects, receiver operating characteristic analysis. Academic Radiology. 2000;7(5):341–9. - PubMed
    1. Beiden SV, Wagner RF, Campbell G, Chan HP. Analysis of uncertainties in estimates of components of variance in multivariate ROC analysis. Academic Radiology. 2001;8(7):616–22. - PubMed
    1. Wagner RF, Beiden SV, Metz CE. Continuous versus categorical data for ROC analysis: some quantitative considerations. Academic Radiology. 2001;8(4):328–34. - PubMed

Publication types