Selection and Misclassification Biases in Longitudinal Studies
- PMID: 29892604
- PMCID: PMC5985700
- DOI: 10.3389/fvets.2018.00099
Selection and Misclassification Biases in Longitudinal Studies
Abstract
Using imperfect tests may lead to biased estimates of disease frequency and measures of association. Many studies have looked into the effect of misclassification on statistical inferences. These evaluations were either within a cross-sectional study framework, assessing biased prevalence, or for cohort study designs, evaluating biased incidence rate or risk ratio estimates based on misclassification at one of the two time-points (initial assessment or follow-up). However, both observations at risk and incident cases can be wrongly identified in longitudinal studies, leading to selection and misclassification biases, respectively. The objective of this paper was to evaluate the relative impact of selection and misclassification biases resulting from misclassification, together, on measures of incidence and risk ratio. To investigate impact on measure of disease frequency, data sets from a hypothetical cohort study with two samples collected one month apart were simulated and analyzed based on specific test and disease characteristics, with no elimination of disease during the sampling interval or clustering of observations. Direction and magnitude of bias due to selection, misclassification, and total bias was assessed for diagnostic test sensitivity and specificity ranging from 0.7 to 1.0 and 0.8 to 1.0, respectively, and for specific disease contexts, i.e., disease prevalences of 5 and 20%, and disease incidences of 0.01, 0.05, and 0.1 cases/animal-month. A hypothetical exposure with known strength of association was also generated. A total of 1,000 cohort studies of 1,000 observations each were simulated for these six disease contexts where the same diagnostic test was used to identify observations at risk at beginning of the cohort and incident cases at its end. Our results indicated that the departure of the estimates of disease incidence and risk ratio from their true value were mainly a function of test specificity, and disease prevalence and incidence. The combination of the two biases, at baseline and follow-up, revealed the importance of a good to excellent specificity relative to sensitivity for the diagnostic test. Small divergence from perfect specificity extended quickly to disease incidence over-estimation as true prevalence increased and true incidence decreased. A highly sensitive test to exclude diseased subjects at baseline was of less importance to minimize bias than using a highly specific one at baseline. Near perfect diagnostic test attributes were even more important to obtain a measure of association close to the true risk ratio, according to specific disease characteristics, especially its prevalence. Low prevalent and high incident disease lead to minimal bias if disease is diagnosed with high sensitivity and close to perfect specificity at baseline and follow-up. For more prevalent diseases we observed large risk ratio biases towards the null value, even with near perfect diagnosis.
Keywords: bias (epidemiology); epidemiologic methods; longitudinal study; misclassification; selection bias.
Figures





Similar articles
-
Diagnosing intramammary infection: Controlling misclassification bias in longitudinal udder health studies.Prev Vet Med. 2018 Feb 1;150:162-167. doi: 10.1016/j.prevetmed.2017.11.010. Epub 2017 Nov 11. Prev Vet Med. 2018. PMID: 29169686
-
Nondifferential disease misclassification may bias incidence risk ratios away from the null.J Clin Epidemiol. 2006 Mar;59(3):281-9. doi: 10.1016/j.jclinepi.2005.07.013. J Clin Epidemiol. 2006. PMID: 16488359
-
Bias due to misclassification in the estimation of relative risk.Am J Epidemiol. 1977 May;105(5):488-95. doi: 10.1093/oxfordjournals.aje.a112408. Am J Epidemiol. 1977. PMID: 871121
-
The effect of misclassification on the estimation of association: a review.Int J Methods Psychiatr Res. 2005;14(2):92-101. doi: 10.1002/mpr.20. Int J Methods Psychiatr Res. 2005. PMID: 16175878 Free PMC article. Review.
-
Prostate cancer association studies: pitfalls and solutions to cancer misclassification in the PSA era.J Cell Biochem. 2004 Feb 15;91(3):553-71. doi: 10.1002/jcb.10700. J Cell Biochem. 2004. PMID: 14755685 Review.
Cited by
-
Depressive symptoms partially mediate the relationship between psychosocial factors and epigenetic age acceleration in a multi-racial/ethnic sample of older adults.Brain Behav Immun Health. 2025 Apr 12;45:100994. doi: 10.1016/j.bbih.2025.100994. eCollection 2025 May. Brain Behav Immun Health. 2025. PMID: 40291341 Free PMC article.
-
Toward a better understanding about real-world evidence.Eur J Hosp Pharm. 2022 Jan;29(1):8-11. doi: 10.1136/ejhpharm-2021-003081. Epub 2021 Dec 2. Eur J Hosp Pharm. 2022. PMID: 34857642 Free PMC article. Review.
-
Association between pre-diagnosis recreational physical activity and risk of breast cancer recurrence: the California Teachers Study.Cancer Causes Control. 2024 Jul;35(7):1089-1100. doi: 10.1007/s10552-024-01870-8. Epub 2024 Apr 13. Cancer Causes Control. 2024. PMID: 38613744
-
Previous tuberculosis infection associated with increased frequency of asthma and respiratory symptoms in a Nordic-Baltic multicentre population study.ERJ Open Res. 2023 May 22;9(3):00011-2023. doi: 10.1183/23120541.00011-2023. eCollection 2023 May. ERJ Open Res. 2023. PMID: 37228275 Free PMC article.
-
Insufficient iodine nutrition status and the risk of pre-eclampsia: a systemic review and meta-analysis.BMJ Open. 2021 Feb 10;11(2):e043505. doi: 10.1136/bmjopen-2020-043505. BMJ Open. 2021. PMID: 33568375 Free PMC article.
References
-
- Silva DS., I Cancer Epidemiology: Principles and Methods . Lyon, France: IARC Scientific Publications; (1999).
-
- Rothman KJ, Lash TL, Greenland S. Modern Epidemiology. Pennsylvania, United States: Lippincott Williams & Wilkins; (2012).
LinkOut - more resources
Full Text Sources
Other Literature Sources