Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Dec 12;24(1):177-192.
doi: 10.1093/biostatistics/kxab021.

Fast approximate inference for multivariate longitudinal data

Affiliations

Fast approximate inference for multivariate longitudinal data

David M Hughes et al. Biostatistics. .

Abstract

Collecting information on multiple longitudinal outcomes is increasingly common in many clinical settings. In many cases, it is desirable to model these outcomes jointly. However, in large data sets, with many outcomes, computational burden often prevents the simultaneous modeling of multiple outcomes within a single model. We develop a mean field variational Bayes algorithm, to jointly model multiple Gaussian, Poisson, or binary longitudinal markers within a multivariate generalized linear mixed model. Through simulation studies and clinical applications (in the fields of sight threatening diabetic retinopathy and primary biliary cirrhosis), we demonstrate substantial computational savings of our approximate approach when compared to a standard Markov Chain Monte Carlo, while maintaining good levels of accuracy of model parameters.

Keywords: Bayesian computing; Generalized linear mixed model; Markov chain Monte Carlo; Mean field variational Bayes; Multivariate mixed models; Repeated measurements.

PubMed Disclaimer

Figures

Fig. 1
Fig. 1
Accuracy scores for mean field variational Bayes compared to MCMC for simulated data sets with three continuous longitudinal markers (top panel) and three types of markers (bottom panel) in the simulation with formula image individuals.
Fig. 2
Fig. 2
Model results for a 12 marker multivariate generalized linear mixed model in the diabetic retinopathy. Panel (a) shows the accuracy heat maps of the MFVB fixed effects estimates and residual standard deviations (compared to the MCMC estimates), (b) shows the accuracy of the MFVB random effects covariance matrix (compared to the MCMC estimates), and (c) shows the implied matrix of correlations between the 12 longitudinal markers calculated using MFVB.
Fig. 3
Fig. 3
Fitted longitudinal markers for mean field variational Bayes (dashed lines) compared to MCMC (solid lines) for the 12 markers in the diabetic retinopathy data, for three patients. The orange stars, green dots, and blue triangles show the observed values for three different patients, with the respectively colored lines showing the fitted models for each individual. All continuous values, including time, have been scaled prior to analysis and the results plotted here are in terms of the scaled variables. The y-axis of each plot shows the scale version of the variable noted in the title of each panel. The original units for each variable can be found in the description at the start of Section 5.

References

    1. Baltagi, B. (2008). Econometric Analysis of Panel Data. Chichester, UK: Wiley.
    1. Bishop, C. M. (2006). Pattern Recognition and Machine Learning. New York: Springer.
    1. Blei, D. M., Kucukelbir, A. and McAuliffe, J. D. (2017). Variational inference: a review for statisticians. Journal of the American Statistical Association 112, 859–877.
    1. Bürkner, P.-C. (2017). brms: an R package for Bayesian multilevel models using stan. Journal of Statistical Software 80, 1–28.
    1. Diggle, P. J., Heagerty, P., Liang, K.-Y., Heagerty, P. J. and Zeger, S. (2002). Analysis of Longitudinal Data. Oxford, UK: Oxford University Press.

Publication types

LinkOut - more resources