Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Jul 1;25(3):769-785.
doi: 10.1093/biostatistics/kxad020.

An integrative latent class model of heterogeneous data modalities for diagnosing kidney obstruction

Affiliations

An integrative latent class model of heterogeneous data modalities for diagnosing kidney obstruction

Jeong Hoon Jang et al. Biostatistics. .

Erratum in

  • Correction.
    [No authors listed] [No authors listed] Biostatistics. 2024 Dec 31;26(1):kxae029. doi: 10.1093/biostatistics/kxae029. Biostatistics. 2024. PMID: 39186534 Free PMC article. No abstract available.

Abstract

Radionuclide imaging plays a critical role in the diagnosis and management of kidney obstruction. However, most practicing radiologists in US hospitals have insufficient time and resources to acquire training and experience needed to interpret radionuclide images, leading to increased diagnostic errors. To tackle this problem, Emory University embarked on a study that aims to develop a computer-assisted diagnostic (CAD) tool for kidney obstruction by mining and analyzing patient data comprised of renogram curves, ordinal expert ratings on the obstruction status, pharmacokinetic variables, and demographic information. The major challenges here are the heterogeneity in data modes and the lack of gold standard for determining kidney obstruction. In this article, we develop a statistically principled CAD tool based on an integrative latent class model that leverages heterogeneous data modalities available for each patient to provide accurate prediction of kidney obstruction. Our integrative model consists of three sub-models (multilevel functional latent factor regression model, probit scalar-on-function regression model, and Gaussian mixture model), each of which is tailored to the specific data mode and depends on the unknown obstruction status (latent class). An efficient MCMC algorithm is developed to train the model and predict kidney obstruction with associated uncertainty. Extensive simulations are conducted to evaluate the performance of the proposed method. An application to an Emory renal study demonstrates the usefulness of our model as a CAD tool for kidney obstruction.

Keywords: Bayesian prediction; Function-on-scalar regression; Heterogeneous data modalities; Integrative modeling; Latent class model; Scalar-on-function regression.

PubMed Disclaimer

Conflict of interest statement

None declared.

Figures

Fig. 1
Fig. 1
The top panel depicts the mean baseline (left) and mean post-furosemide (right) renogram curves, stratified by the unanimous interpretation of the three nuclear medicine experts in the Emory renal study (black solid line for non-obstructed; red/black dashed line for equivocal; and green/gray solid line for obstructed). The bottom panel shows the baseline (left) and post-furosemide (right) renogram curves of 12 selected kidneys from the Emory renal study. Herein, 4 are unanimously interpreted as non-obstructed, 4 are unanimously interpreted as equivocal, and 4 are unanimously interpreted as obstructed by the three nuclear medicine experts (same coloring and line type as in the top panel).
Fig. 2
Fig. 2
Predictive probabilities of kidney obstruction and intervals representing the 5th and 95th percentiles of the posterior MCMC samples {P^(Cnew=1|D,Dnew,Θ[q]);q=1,,Q} for 50 testing kidneys. Black and green (gray in print) colors, respectively, denote non-obstructed and obstructed kidneys based on the semi-gold standard (review panel of nuclear medicine experts). Labels #6, #12, and #50 are indices of kidneys whose diagnoses differed between the proposed model and the semi-gold standard.

References

    1. Albert J. H., Chib S. (1993). Bayesian analysis of binary and polychotomous response data. Journal of the American Statistical Association 88, 669–679.
    1. Alonzo T. A., Pepe M. S., Lumley T. (2003). Estimating disease prevalence in two-phase studies. Biostatistics 4, 313–326. - PubMed
    1. Bhattacharya A., Dunson D. B. (2011). Sparse Bayesian infinite factor models. Biometrika 98, 291–306. - PMC - PubMed
    1. Chang C., Jang J. H., Manatunga A., Taylor A. T., Long Q. (2020). A Bayesian latent class model to predict kidney obstruction in the absence of gold standard. Journal of the American Statistical Association 115, 1645–1663. - PMC - PubMed
    1. Collins J., Huynh M. (2014). Estimation of diagnostic test accuracy without full verification: a review of latent class methods. Statistics in Medicine 33, 4141–4169. - PMC - PubMed

MeSH terms