Assessment of diagnostic markers by goodness-of-fit tests
- PMID: 12872305
- DOI: 10.1002/sim.1464
Assessment of diagnostic markers by goodness-of-fit tests
Abstract
Receiver operating characteristic (ROC) curves are useful statistical tools used to assess the precision of diagnostic markers or to compare new diagnostic markers with old ones. The most common index employed for these purposes is the area under the ROC curve (theta) and several statistical tests exist that test the null hypotheses H(0): theta= 0.5 or H(0): theta1=theta2, in the case of two-marker comparisons, against alternatives of interest. In this paper we show that goodness-of-fit of uniformity of the distribution of the false positive (true positive) rates can be used instead of tests based on the area index. A semi-parametric approach is based on a completely specified distribution of marker measurements for either the healthy (F) or diseased (G) subjects, and this is extended to the two-marker case. We then extend to the one- and two-marker case when neither distribution is specified (the non-parametric case). In general, ROC-based tests are more powerful than goodness-of-fit tests for location differences between the distributions of healthy and diseased subjects. However ROC-based tests are less powerful when location-scale differences exist (producing ROC curves that cross the diagonal) and are incapable of discriminating between healthy and diseased samples when theta=0.5 but F not equal G. In these cases, goodness-of-fit tests have a distinct advantage over ROC-based tests. In conclusion, ROC methodology should be used with recognition of its potential limitations and should be replaced by goodness-of-fit tests when appropriate. The latter are a viable alternative and can be used as a 'black box' or as an exploratory first step in the evaluation of novel diagnostic markers.
Copyright 2003 John Wiley & Sons, Ltd.
Similar articles
-
A new parametric method based on S-distributions for computing receiver operating characteristic curves for continuous diagnostic tests.Stat Med. 2002 May 15;21(9):1213-35. doi: 10.1002/sim.1086. Stat Med. 2002. PMID: 12111875
-
A non-inferiority test for diagnostic accuracy based on the paired partial areas under ROC curves.Stat Med. 2008 May 10;27(10):1762-76. doi: 10.1002/sim.3121. Stat Med. 2008. PMID: 17968858
-
Wilcoxon-based group sequential designs for comparison of areas under two correlated ROC curves.Stat Med. 2008 Jan 30;27(2):213-23. doi: 10.1002/sim.2856. Stat Med. 2008. PMID: 17357988
-
[ROC-curve analysis. A statistical method for the evaluation of diagnostic tests].Ugeskr Laeger. 1990 Jun 4;152(23):1650-3. Ugeskr Laeger. 1990. PMID: 2194326 Review. Danish.
-
Measuring diagnostic and predictive accuracy in disease management: an introduction to receiver operating characteristic (ROC) analysis.J Eval Clin Pract. 2006 Apr;12(2):132-9. doi: 10.1111/j.1365-2753.2005.00598.x. J Eval Clin Pract. 2006. PMID: 16579821 Review.
Cited by
-
The length of the receiver operating characteristic curve and the two cutoff Youden index within a robust framework for discovery, evaluation, and cutoff estimation in biomarker studies involving improper receiver operating characteristic curves.Stat Med. 2021 Mar 30;40(7):1767-1789. doi: 10.1002/sim.8869. Epub 2021 Feb 2. Stat Med. 2021. PMID: 33530129 Free PMC article.
-
Biomarker evaluation and comparison using the controls as a reference population.Biostatistics. 2009 Apr;10(2):228-44. doi: 10.1093/biostatistics/kxn029. Epub 2008 Aug 28. Biostatistics. 2009. PMID: 18755739 Free PMC article.
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Medical