LARGE COVARIANCE ESTIMATION THROUGH ELLIPTICAL FACTOR MODELS
- PMID: 30214095
- PMCID: PMC6133289
- DOI: 10.1214/17-AOS1588
LARGE COVARIANCE ESTIMATION THROUGH ELLIPTICAL FACTOR MODELS
Abstract
We propose a general Principal Orthogonal complEment Thresholding (POET) framework for large-scale covariance matrix estimation based on the approximate factor model. A set of high level sufficient conditions for the procedure to achieve optimal rates of convergence under different matrix norms is established to better understand how POET works. Such a framework allows us to recover existing results for sub-Gaussian data in a more transparent way that only depends on the concentration properties of the sample covariance matrix. As a new theoretical contribution, for the first time, such a framework allows us to exploit conditional sparsity covariance structure for the heavy-tailed data. In particular, for the elliptical distribution, we propose a robust estimator based on the marginal and spatial Kendall's tau to satisfy these conditions. In addition, we study conditional graphical model under the same framework. The technical tools developed in this paper are of general interest to high dimensional principal component analysis. Thorough numerical results are also provided to back up the developed theory.
Keywords: approximate factor model; conditional graphical model; elliptical distribution; marginal and spatial Kendall’s tau; principal component analysis; sub-Gaussian family.
Figures



Similar articles
-
Large Covariance Estimation by Thresholding Principal Orthogonal Complements.J R Stat Soc Series B Stat Methodol. 2013 Sep 1;75(4):10.1111/rssb.12016. doi: 10.1111/rssb.12016. J R Stat Soc Series B Stat Methodol. 2013. PMID: 24348088 Free PMC article.
-
Statistical analysis of latent generalized correlation matrix estimation in transelliptical distribution.Bernoulli (Andover). 2017 Feb;23(1):23-57. doi: 10.3150/15-BEJ702. Epub 2016 Sep 27. Bernoulli (Andover). 2017. PMID: 28337068 Free PMC article.
-
Robust Covariance Estimation for Approximate Factor Models.J Econom. 2019 Jan;208(1):5-22. doi: 10.1016/j.jeconom.2018.09.003. Epub 2018 Oct 6. J Econom. 2019. PMID: 30546195 Free PMC article.
-
Canonical correlation analysis for elliptical copulas.J Multivar Anal. 2021 May;183:104715. doi: 10.1016/j.jmva.2020.104715. Epub 2020 Nov 23. J Multivar Anal. 2021. PMID: 33518826 Free PMC article.
-
Asymptotics of empirical eigenstructure for high dimensional spiked covariance.Ann Stat. 2017 Jun;45(3):1342-1374. doi: 10.1214/16-AOS1487. Epub 2017 Jun 13. Ann Stat. 2017. PMID: 28835726 Free PMC article.
Cited by
-
Simultaneous differential network analysis and classification for matrix-variate data with application to brain connectivity.Biostatistics. 2022 Jul 18;23(3):967-989. doi: 10.1093/biostatistics/kxab007. Biostatistics. 2022. PMID: 33769450 Free PMC article.
-
PENALIZED REGRESSION FOR MULTIPLE TYPES OF MANY FEATURES WITH MISSING DATA.Stat Sin. 2023 Apr;33(2):633-662. doi: 10.5705/ss.202020.0401. Stat Sin. 2023. PMID: 37197479 Free PMC article.
-
Estimation of Large-Dimensional Covariance Matrices via Second-Order Stein-Type Regularization.Entropy (Basel). 2022 Dec 27;25(1):53. doi: 10.3390/e25010053. Entropy (Basel). 2022. PMID: 36673194 Free PMC article.
-
Unified Principal Component Analysis for Sparse and Dense Functional Data under Spatial Dependency.J Bus Econ Stat. 2022;40(4):1523-1537. doi: 10.1080/07350015.2021.1938085. Epub 2021 Jul 12. J Bus Econ Stat. 2022. PMID: 36582252 Free PMC article.
-
Robust estimation of high-dimensional covariance and precision matrices.Biometrika. 2018 Jun 1;105(2):271-284. doi: 10.1093/biomet/asy011. Epub 2018 Mar 27. Biometrika. 2018. PMID: 30337763 Free PMC article.
References
-
- Agarwal A, Negahban S, Wainwright MJ. Noisy matrix decomposition via convex relaxation: Optimal rates in high dimensions. The Annals of Statistics. 2012;40:1171–1197.
-
- Amini AA, Wainwright MJ. Information Theory, 2008 ISIT 2008 IEEE International Symposium on 2454–2458. IEEE; 2008. High-dimensional analysis of semidefinite relaxations for sparse principal components.
-
- Antoniadis A, Fan J. Regularization of wavelet approximations. Journal of the American Statistical Association. 2001;96
-
- Bai J, Li K. Statistical analysis of factor models of high dimension. The Annals of Statistics. 2012;40:436–465.
-
- Bai J, Ng S. Determining the number of factors in approximate factor models. Econometrica. 2002;70:191–221.
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
Research Materials