Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 1988;41(9):923-37.
doi: 10.1016/0895-4356(88)90110-2.

Estimation of test error rates, disease prevalence and relative risk from misclassified data: a review

Affiliations
Review

Estimation of test error rates, disease prevalence and relative risk from misclassified data: a review

S D Walter et al. J Clin Epidemiol. 1988.

Abstract

We review methods for the analysis of categorical clinical and epidemiological data, in which the observations are subject to misclassification. Under certain conditions, it is possible to estimate error parameters such as sensitivity, specificity, relative risk, or predictive value, even though no definitive classification (gold standard) is available. The parameter estimates are obtained by modelling the data, using maximum likelihood, with or without some constraints. The models recognize that the true classification of an individual is unknown, and so are sometimes referred to as "latent class" models. The latent class approach provides a unified framework for various methods found in a dispersed literature, characterising each by the number of populations or subgroups in the data, and the number of observations made on each individual; the statistical degrees of freedom are implied by the sampling design. Data sets with less than three replicate observations per individual necessarily require constraints for parameter estimation to be possible. Data sets with three or more replicates lead directly to estimates of the misclassification rates, subject to some simple assumptions. Some more complex problems are also discussed, including data where the response variable has more than two levels, sequential and irregular designs and the effects of assumption violations.

PubMed Disclaimer

Publication types

LinkOut - more resources