Validation data-based adjustments for outcome misclassification in logistic regression: an illustration
- PMID: 21487295
- PMCID: PMC3454464
- DOI: 10.1097/EDE.0b013e3182117c85
Validation data-based adjustments for outcome misclassification in logistic regression: an illustration
Abstract
Misclassification of binary outcome variables is a known source of potentially serious bias when estimating adjusted odds ratios. Although researchers have described frequentist and Bayesian methods for dealing with the problem, these methods have seldom fully bridged the gap between statistical research and epidemiologic practice. In particular, there have been few real-world applications of readily grasped and computationally accessible methods that make direct use of internal validation data to adjust for differential outcome misclassification in logistic regression. In this paper, we illustrate likelihood-based methods for this purpose that can be implemented using standard statistical software. Using main study and internal validation data from the HIV Epidemiology Research Study, we demonstrate how misclassification rates can depend on the values of subject-specific covariates, and we illustrate the importance of accounting for this dependence. Simulation studies confirm the effectiveness of the maximum likelihood approach. We emphasize clear exposition of the likelihood function itself, to permit the reader to easily assimilate appended computer code that facilitates sensitivity analyses as well as the efficient handling of main/external and main/internal validation-study data. These methods are readily applicable under random cross-sectional sampling, and we discuss the extent to which the main/internal analysis remains appropriate under outcome-dependent (case-control) sampling.
Similar articles
-
Binary regression with differentially misclassified response and exposure variables.Stat Med. 2015 Apr 30;34(9):1605-20. doi: 10.1002/sim.6440. Epub 2015 Feb 4. Stat Med. 2015. PMID: 25652841 Free PMC article.
-
Conditional validation sampling for consistent risk estimation with binary outcome data subject to misclassification.Pharmacoepidemiol Drug Saf. 2019 Feb;28(2):227-233. doi: 10.1002/pds.4701. Pharmacoepidemiol Drug Saf. 2019. PMID: 30746841
-
Comparing external and internal validation methods in correcting outcome misclassification bias in logistic regression: A simulation study and application to the case of postsurgical venous thromboembolism following total hip and knee arthroplasty.Pharmacoepidemiol Drug Saf. 2019 Feb;28(2):217-226. doi: 10.1002/pds.4693. Epub 2018 Dec 4. Pharmacoepidemiol Drug Saf. 2019. PMID: 30515908
-
The effect of misclassification on the estimation of association: a review.Int J Methods Psychiatr Res. 2005;14(2):92-101. doi: 10.1002/mpr.20. Int J Methods Psychiatr Res. 2005. PMID: 16175878 Free PMC article. Review.
-
Correcting for exposure misclassification using an alloyed gold standard.Epidemiology. 1996 Jul;7(4):406-10. doi: 10.1097/00001648-199607000-00011. Epidemiology. 1996. PMID: 8793367 Review.
Cited by
-
Missingness in the Setting of Competing Risks: from missing values to missing potential outcomes.Curr Epidemiol Rep. 2018 Jun;5(2):153-159. doi: 10.1007/s40471-018-0142-3. Epub 2018 Mar 19. Curr Epidemiol Rep. 2018. PMID: 30386717 Free PMC article.
-
Overcome the Limitation of Phenome-Wide Association Studies (PheWAS): Extension of PheWAS to Efficient and Robust Large-Scale ICD Codes Analysis.medRxiv [Preprint]. 2024 Apr 19:2024.04.15.24305098. doi: 10.1101/2024.04.15.24305098. medRxiv. 2024. PMID: 38699370 Free PMC article. Preprint.
-
Core concepts in pharmacoepidemiology: Validation of health outcomes of interest within real-world healthcare databases.Pharmacoepidemiol Drug Saf. 2023 Jan;32(1):1-8. doi: 10.1002/pds.5537. Epub 2022 Sep 14. Pharmacoepidemiol Drug Saf. 2023. PMID: 36057777 Free PMC article. Review.
-
Regression Analysis for Differentially Misclassified Correlated Binary Outcomes.J R Stat Soc Ser C Appl Stat. 2015 Apr;64(3):433-449. doi: 10.1111/rssc.12081. J R Stat Soc Ser C Appl Stat. 2015. PMID: 26005223 Free PMC article.
-
Guidance of development, validation, and evaluation of algorithms for populating health status in observational studies of routinely collected data (DEVELOP-RCD).Mil Med Res. 2024 Aug 6;11(1):52. doi: 10.1186/s40779-024-00559-y. Mil Med Res. 2024. PMID: 39107834 Free PMC article.
References
-
- Thomas D, Stram D, Dwyer J. Exposure measurement error: influence on exposure-disease relationships and methods of correction. Ann Rev Public Health. 1993;14:69–93. - PubMed
-
- Bross IDJ. Misclassification in 2×2 tables. Biometrics. 1954;10:478–486.
-
- Barron BA. The effects of misclassification on the estimation of relative risk. Biometrics. 1977;33:414–418. - PubMed
-
- Kleinbaum D, Kupper L, Morgenstern H. Epidemiologic Research: Principles and Quantitative Methods. Lifetime Learning; Belmont, CA: 1982.
-
- Greenland S, Kleinbaum DG. Correcting for misclassification in two-way tables and matched-pair studies. Int J Epidemiol. 1983;12:93–97. - PubMed
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Miscellaneous