Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2016;111(514):834-845.
doi: 10.1080/01621459.2015.1058265. Epub 2016 Aug 18.

Convex Banding of the Covariance Matrix

Affiliations

Convex Banding of the Covariance Matrix

Jacob Bien et al. J Am Stat Assoc. 2016.

Abstract

We introduce a new sparse estimator of the covariance matrix for high-dimensional models in which the variables have a known ordering. Our estimator, which is the solution to a convex optimization problem, is equivalently expressed as an estimator which tapers the sample covariance matrix by a Toeplitz, sparsely-banded, data-adaptive matrix. As a result of this adaptivity, the convex banding estimator enjoys theoretical optimality properties not attained by previous banding or tapered estimators. In particular, our convex banding estimator is minimax rate adaptive in Frobenius and operator norms, up to log factors, over commonly-studied classes of covariance matrices, and over more general classes. Furthermore, it correctly recovers the bandwidth when the true covariance is exactly banded. Our convex formulation admits a simple and efficient algorithm. Empirical studies demonstrate its practical effectiveness and illustrate that our exactly-banded estimator works well even when the true covariance matrix is only close to a banded matrix, confirming our theoretical results. Our method compares favorably with all existing methods, in terms of accuracy and speed. We illustrate the practical merits of the convex banding estimator by showing that it can be used to improve the performance of discriminant analysis for classifying sound recordings.

Keywords: High-dimensional; adaptive; hierarchical group lasso; structured sparsity.

PubMed Disclaimer

Figures

Figure 1
Figure 1
(Left) The group g3; (Middle) the nested groups in the penalty (2.1); (Right) a K-banded matrix where K = 2 and p = 6. Note that L = p − K − 1 = 2 counts the number of subdiagonals that are zero. This can be expressed as gL=0 or s=0 for 1 ≤ ℓ ≤ L.
Figure 2
Figure 2
Convergence in Frobenius norm. Monte Carlo estimate of (Left) E^F2/p and (Right) E^F2/(plogp) as a function of K, the bandwidth of Σ*
Figure 3
Figure 3
Comparison of our procedure with simple and general weights.
Figure 4
Figure 4
(Left:) For large n, expected E^F2/p is seen to decay like n−1 as can be seen from −1 slope of log-log plot (gray lines are of slope −1). (Right:) Scaling this quantity by log p aligns the curves for large n. Both of these phenomena are suggested by Corollary 2.
Figure 5
Figure 5
Comparison of methods in three settings in terms of (Top) squared Frobenius norm, relative to SF2 and (Bottom) operator norm relative to ‖S − Σop.
Figure 6
Figure 6
Comparison of methods in terms of (Left) squared Frobenius norm and (Right) operator norm in a setting where ‖Σop is increasing.
Figure 7
Figure 7
Test set misclassification error rates of discriminant analysis of phoneme data. For both QDA and LDA, using a regularized covariance matrix improves the misclassification error rate.

References

    1. Bach F, Jenatton R, Mairal J, Obozinski G. Structured sparsity through convex optimization. Statistical Science. 2012;27(4):450–468.
    1. Bickel PJ, Levina E. Regularized estimation of large covariance matrices. The Annals of Statistics. 2008:199–227.
    1. Bien J, Taylor J, Tibshirani R. A lasso for hierarchical interactions. The Annals of Statistics. 2013;41(3):1111–1141. - PMC - PubMed
    1. Bunea F, Xiao L. On the sample covariance matrix estimator of reduced ef fective rank population matrices, with applications to fpca. To appear in Bernoulli. 2014 arXiv:1212.5321.
    1. Cai TT, Yuan M. Adaptive covariance matrix estimation through block thresh-olding. The Annals of Statistics. 2012;40(4):2014–2042.