Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2011 Jun;98(2):291-306.
doi: 10.1093/biomet/asr013.

Sparse Bayesian infinite factor models

Affiliations

Sparse Bayesian infinite factor models

A Bhattacharya et al. Biometrika. 2011 Jun.

Abstract

We focus on sparse modelling of high-dimensional covariance matrices using Bayesian latent factor models. We propose a multiplicative gamma process shrinkage prior on the factor loadings which allows introduction of infinitely many factors, with the loadings increasingly shrunk towards zero as the column index increases. We use our prior on a parameter-expanded loading matrix to avoid the order dependence typical in factor analysis models and develop an efficient Gibbs sampler that scales well as data dimensionality increases. The gain in efficiency is achieved by the joint conjugacy property of the proposed prior, which allows block updating of the loadings matrix. We propose an adaptive Gibbs sampler for automatically truncating the infinite loading matrix through selection of the number of important factors. Theoretical results are provided on the support of the prior and truncation approximation bounds. A fast algorithm is proposed to produce approximate Bayes estimates. Latent factor regression methods are developed for prediction and variable selection in applications with high-dimensional correlated predictors. Operating characteristics are assessed through simulation studies, and the approach is applied to predict survival times from gene expression data.

PubMed Disclaimer

References

    1. Amengual D, Watson M. Consistent estimation of the number of dynamic factors in a large N and T panel. J Bus Econ Statist. 2007;25:91–6.
    1. Ando T. Bayesian factor analysis with fat-tailed factors and its exact marginal likelihood. J Mult Anal. 2009;100:1717–26.
    1. Arminger G, Muthén B. A Bayesian approach to nonlinear latent variable models using the Gibbs sampler and the Metropolis–Hastings algorithm. Psychometrika. 1998;63:271–300.
    1. Bai J, Ng S. Determining the number of factors in approximate factor models. Econometrica. 2002;70:191–221.
    1. Bickel P, Levina E. Regularized estimation of large covariance matrices. Ann Statist. 2008;36:199–227.