Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2018 Aug;46(4):1383-1414.
doi: 10.1214/17-AOS1588. Epub 2018 Jun 27.

LARGE COVARIANCE ESTIMATION THROUGH ELLIPTICAL FACTOR MODELS

Affiliations

LARGE COVARIANCE ESTIMATION THROUGH ELLIPTICAL FACTOR MODELS

Jianqing Fan et al. Ann Stat. 2018 Aug.

Abstract

We propose a general Principal Orthogonal complEment Thresholding (POET) framework for large-scale covariance matrix estimation based on the approximate factor model. A set of high level sufficient conditions for the procedure to achieve optimal rates of convergence under different matrix norms is established to better understand how POET works. Such a framework allows us to recover existing results for sub-Gaussian data in a more transparent way that only depends on the concentration properties of the sample covariance matrix. As a new theoretical contribution, for the first time, such a framework allows us to exploit conditional sparsity covariance structure for the heavy-tailed data. In particular, for the elliptical distribution, we propose a robust estimator based on the marginal and spatial Kendall's tau to satisfy these conditions. In addition, we study conditional graphical model under the same framework. The technical tools developed in this paper are of general interest to high dimensional principal component analysis. Thorough numerical results are also provided to back up the developed theory.

Keywords: approximate factor model; conditional graphical model; elliptical distribution; marginal and spatial Kendall’s tau; principal component analysis; sub-Gaussian family.

PubMed Disclaimer

Figures

Fig 1
Fig 1
Conditional sparse covariance matrix estimation. The 12 plots correspond to logarithms (base 2) of the ratios of average errors of the original and the robust POET estimators, measured in different norms. Data were generated from multivariate t-distribution with degree of freedom ν = 4.2 (black dotted), ν = 7 (blue dashed), ν = ∞ (orange solid) with p from 100 to 1000, n = p/2 and m = 3. 100 simulations were conducted for each p.
Fig 2
Fig 2
Comparison of relative Frobenius norms. The plots correspond to average errors of the original POET (for ν =∞, solid) and the robust POET (for ν = 4.2, dotted). In each setting, we compare ^T (red) and ^uT (blue) in the left panel and compare L1/2(Γ^Λ^Γ^BB)L1/2F (red) and L1/2BBL1/2F (blue) in the right panel.
Fig 3
Fig 3
Conditional graphical model estimation. The plots correspond to log ratios (base 2) of average errors of the original and the robust POET estimators for Ω^u and Ω^, measured in spectral norms. Data were generated from multivariate t-distribution with degree of freedom ν = 4.2 (black dotted), ν = 7 (blue dashed), ν = ∞ (orange solid) with p from 50 to 500, n = 0.6p and m = 3. 100 simulations were conducted for each p.

Similar articles

Cited by

References

    1. Agarwal A, Negahban S, Wainwright MJ. Noisy matrix decomposition via convex relaxation: Optimal rates in high dimensions. The Annals of Statistics. 2012;40:1171–1197.
    1. Amini AA, Wainwright MJ. Information Theory, 2008 ISIT 2008 IEEE International Symposium on 2454–2458. IEEE; 2008. High-dimensional analysis of semidefinite relaxations for sparse principal components.
    1. Antoniadis A, Fan J. Regularization of wavelet approximations. Journal of the American Statistical Association. 2001;96
    1. Bai J, Li K. Statistical analysis of factor models of high dimension. The Annals of Statistics. 2012;40:436–465.
    1. Bai J, Ng S. Determining the number of factors in approximate factor models. Econometrica. 2002;70:191–221.

LinkOut - more resources