Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Dec 5;24(1):299.
doi: 10.1186/s12874-024-02432-x.

Detecting departures from the conditional independence assumption in diagnostic latent class models: a simulation study

Affiliations

Detecting departures from the conditional independence assumption in diagnostic latent class models: a simulation study

Yasin Okkaoglu et al. BMC Med Res Methodol. .

Abstract

Background: Latent class models can be used to estimate diagnostic accuracy without a gold standard test. Early studies often assumed independence between tests given the true disease state, however this can lead to biased estimates when there are inter-test dependencies. Residual correlation plots and chi-squared statistics have been commonly utilized to assess the validity of the conditional independence assumption and, when it does not hold, identify which test pairs are conditionally dependent. We aimed to assess the performance of these tools with a simulation study covering a wide range of scenarios.

Methods: We generated data sets from a model with four tests and a dependence between tests 1 and 2 within the diseased group. We varied sample size, prevalence, covariance, sensitivity and specificity, with 504 combinations of these in total, and 1000 data sets for each combination. We fitted the conditional independence model in a Bayesian framework, and reported absolute bias, coverage, and how often the residual correlation plots, G 2 and χ 2 statistics indicated lack-of-fit globally or for each test pair.

Results: Across all settings, residual correlation plots, pairwise G 2 and χ 2 detected the correct correlated pair of tests only 12.1%, 10.3%, and 10.3% of the time, respectively, but incorrectly suggested dependence between tests 3 and 4 64.9%, 49.7%, and 49.5% of the time. We observed some variation in this across parameter settings, with these tools appearing to perform more as intended when tests 3 and 4 were both much more accurate than tests 1 and 2. Residual correlation plots, G 2 and χ 2 statistics identified a lack of overall fit in 74.3%, 64.5% and 67.5% of models, respectively. The conditional independence model tended to overestimate the sensitivities of the correlated tests (median bias across all scenarios 0.094, 2.5th and 97.5th percentiles -0.003, 0.397) and underestimate prevalence and the specificities of the uncorrelated tests.

Conclusions: Residual correlation plots and chi-squared statistics cannot be relied upon to identify which tests are conditionally dependent, and also have relatively low power to detect lack of overall fit. This is important since failure to account for conditional dependence can lead to highly biased parameter estimates.

Keywords: Conditional independence; Diagnostic accuracy; Goodness of fit; Latent class model; Model selection; Residual correlation plots.

PubMed Disclaimer

Conflict of interest statement

Declarations. Ethics approval and consent to participate: Not applicable. Consent for publication: Not applicable. Competing interests: The authors declare no competing interests.

Figures

Fig. 1
Fig. 1
Bias jitter plots for the combinations with sensitivity = HHHH and varying specificities (π=0.5, ω=0.5, nobs=2000). The grey points represent median bias, and the error bars show 2.5th and 97.5th percentiles. The blue, pink, and green points represent bias in prevalence, sensitivity, and specificity estimates, respectively
Fig. 2
Fig. 2
Bias jitter plots for the combinations with sensitivity = LLHH and varying specificities (π=0.5, ω=0.5, nobs=2000). The grey points represent median bias, and the error bars show 2.5th and 97.5th percentiles. The blue, pink, and green points represent bias in prevalence, sensitivity, and specificity estimates, respectively

Similar articles

References

    1. Hui SL, Walter SD. Estimating the error rates of diagnostic-tests. Biometrics. 1980;36(1):167–71. - PubMed
    1. Young MA. Evaluating diagnostic criteria: a latent class paradigm. J Psychiatr Res. 1982;17(3):285–96. - PubMed
    1. Walter SD, Irwig LM. Estimation of test error rates, disease prevalence and relative risk from misclassified data - a review. J Clin Epidemiol. 1988;41(9):923–37. - PubMed
    1. Dendukuri N, Joseph L. Bayesian approaches to modeling the conditional dependence between multiple diagnostic tests. Biometrics. 2001;57(1):158–67. - PubMed
    1. Vacek PM. The effect of conditional dependence on the evaluation of diagnostic-tests. Biometrics. 1985;41(4):959–68. - PubMed

LinkOut - more resources