The cost-effectiveness of reclassification sampling for prevalence estimation
- PMID: 22348146
- PMCID: PMC3278465
- DOI: 10.1371/journal.pone.0032058
The cost-effectiveness of reclassification sampling for prevalence estimation
Abstract
Background: Typically, a two-phase (double) sampling strategy is employed when classifications are subject to error and there is a gold standard (perfect) classifier available. Two-phase sampling involves classifying the entire sample with an imperfect classifier, and a subset of the sample with the gold-standard.
Methodology/principal findings: In this paper we consider an alternative strategy termed reclassification sampling, which involves classifying individuals using the imperfect classifier more than one time. Estimates of sensitivity, specificity and prevalence are provided for reclassification sampling, when either one or two binary classifications of each individual using the imperfect classifier are available. Robustness of estimates and design decisions to model assumptions are considered. Software is provided to compute estimates and provide advice on the optimal sampling strategy.
Conclusions/significance: Reclassification sampling is shown to be cost-effective (lower standard error of estimates for the same cost) for estimating prevalence as compared to two-phase sampling in many practical situations.
Conflict of interest statement
Similar articles
-
Optimal two-phase sampling design for comparing accuracies of two binary classification rules.Stat Med. 2014 Feb 10;33(3):500-13. doi: 10.1002/sim.5946. Epub 2013 Sep 4. Stat Med. 2014. PMID: 24038175
-
Automatic feed phase identification in multivariate bioprocess profiles by sequential binary classification.Anal Chim Acta. 2017 Aug 22;982:48-61. doi: 10.1016/j.aca.2017.05.034. Epub 2017 Jun 22. Anal Chim Acta. 2017. PMID: 28734365
-
Efficiency of two-phase designs for prevalence estimation.Int J Epidemiol. 2003 Dec;32(6):1072-8. doi: 10.1093/ije/dyg230. Int J Epidemiol. 2003. PMID: 14681277
-
Estimation of test error rates, disease prevalence and relative risk from misclassified data: a review.J Clin Epidemiol. 1988;41(9):923-37. doi: 10.1016/0895-4356(88)90110-2. J Clin Epidemiol. 1988. PMID: 3054000 Review.
-
Estimating the prevalence of infections in vector populations using pools of samples.Med Vet Entomol. 2012 Dec;26(4):361-71. doi: 10.1111/j.1365-2915.2012.01015.x. Epub 2012 Apr 8. Med Vet Entomol. 2012. PMID: 22486773 Review.
References
-
- Tenenbein A. A double sampling scheme for estimating binomial data with misclassifications. Journal of the American Statistical Association. 1970;65:1350–1361.
-
- McNamee R. Optimal designs of two-stage studies for estimation of sensitivity, specificity and positive predictive value. Statistics in Medicine. 2002;21:3609–3625. - PubMed
-
- McNamee R. Efficiency of two-phase designs for prevalence estimation. International Journal of Epidemiology. 2003;32:1072–1078. - PubMed
-
- Nofuentes JAR, del Castillo JDL. Comparing the Likelihood Ratios of Two Binary Diagnostic Tests in the Presence of Partial Verification. Biometrical Journal. 2005;4:442–457. - PubMed
-
- Wruck LM, Yiannoutsos CT, Hughes MD. A sequential design to estimate sensitivity and specificity of a diagnostic or screening test. Statistics in Medicine. 2006;25:3458–3473. - PubMed
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources