Sparse Bayesian infinite factor models
- PMID: 23049129
- PMCID: PMC3419391
- DOI: 10.1093/biomet/asr013
Sparse Bayesian infinite factor models
Abstract
We focus on sparse modelling of high-dimensional covariance matrices using Bayesian latent factor models. We propose a multiplicative gamma process shrinkage prior on the factor loadings which allows introduction of infinitely many factors, with the loadings increasingly shrunk towards zero as the column index increases. We use our prior on a parameter-expanded loading matrix to avoid the order dependence typical in factor analysis models and develop an efficient Gibbs sampler that scales well as data dimensionality increases. The gain in efficiency is achieved by the joint conjugacy property of the proposed prior, which allows block updating of the loadings matrix. We propose an adaptive Gibbs sampler for automatically truncating the infinite loading matrix through selection of the number of important factors. Theoretical results are provided on the support of the prior and truncation approximation bounds. A fast algorithm is proposed to produce approximate Bayes estimates. Latent factor regression methods are developed for prediction and variable selection in applications with high-dimensional correlated predictors. Operating characteristics are assessed through simulation studies, and the approach is applied to predict survival times from gene expression data.
References
-
- Amengual D, Watson M. Consistent estimation of the number of dynamic factors in a large N and T panel. J Bus Econ Statist. 2007;25:91–6.
-
- Ando T. Bayesian factor analysis with fat-tailed factors and its exact marginal likelihood. J Mult Anal. 2009;100:1717–26.
-
- Arminger G, Muthén B. A Bayesian approach to nonlinear latent variable models using the Gibbs sampler and the Metropolis–Hastings algorithm. Psychometrika. 1998;63:271–300.
-
- Bai J, Ng S. Determining the number of factors in approximate factor models. Econometrica. 2002;70:191–221.
-
- Bickel P, Levina E. Regularized estimation of large covariance matrices. Ann Statist. 2008;36:199–227.
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
