Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021;63(5):626-649.
doi: 10.1007/s10851-021-01019-1. Epub 2021 Feb 15.

Robust PCA via Regularized Reaper with a Matrix-Free Proximal Algorithm

Affiliations

Robust PCA via Regularized Reaper with a Matrix-Free Proximal Algorithm

Robert Beinert et al. J Math Imaging Vis. 2021.

Abstract

Principal component analysis (PCA) is known to be sensitive to outliers, so that various robust PCA variants were proposed in the literature. A recent model, called reaper, aims to find the principal components by solving a convex optimization problem. Usually the number of principal components must be determined in advance and the minimization is performed over symmetric positive semi-definite matrices having the size of the data, although the number of principal components is substantially smaller. This prohibits its use if the dimension of the data is large which is often the case in image processing. In this paper, we propose a regularized version of reaper which enforces the sparsity of the number of principal components by penalizing the nuclear norm of the corresponding orthogonal projector. If only an upper bound on the number of principal components is available, our approach can be combined with the L-curve method to reconstruct the appropriate subspace. Our second contribution is a matrix-free algorithm to find a minimizer of the regularized reaper which is also suited for high-dimensional data. The algorithm couples a primal-dual minimization approach with a thick-restarted Lanczos process. This appears to be the first efficient convex variational method for robust PCA that can handle high-dimensional data. As a side result, we discuss the topic of the bias in robust PCA. Numerical examples demonstrate the performance of our algorithm.

Keywords: Matrix-free PCA; PCA offset; Regularized reaper; Robust PCA; Thick-restarted Lanczos algorithm.

PubMed Disclaimer

Figures

Fig. 1
Fig. 1
Performance of rreaper with (2,1)-norm and Frobenius norm in the data fidelity term, respectively. The first one appears to be more robust against outliers
Fig. 2
Fig. 2
Performance of rreaper for different upper dimension estimators d=10,100 and regularization parameters α=14,12,34,1,2 (top down for each d)
Fig. 3
Fig. 3
Mean reconstruction error ||Π^-ΠL||tr for varying α and d. For every parameter choice, the experiment has been repeated 25 times
Fig. 4
Fig. 4
The L-curve for d=25 of the data set in Sect. 8.2 consisting of 100 inliers (σin=1) and 25 outliers (σout=1) with noise σnoise=0.1
Fig. 5
Fig. 5
The two components of the L-curves in Fig. 4 in dependence of the regularization parameter α
Fig. 6
Fig. 6
The used data set based on the Extended Yale Face Data Set B and its projections onto the determined subspace using matrix-free rreaper
Fig. 7
Fig. 7
Restoration of the first sample by projecting to the principle components determined by several robust PCA’s
Fig. 8
Fig. 8
Restoration of the artifact in the first sample by projecting to the principle components determined by several robust PCA’s

References

    1. Beck A. First-Order Methods in Optimization. Philadelphia: Society for Industrial and Applied Mathematics (SIAM), Mathematical Optimization Society; 2017.
    1. Beck A, Sabach S. Weiszfeld’s method: old and new results. J. Optim. Theory Appl. 2015;164(1):1–40. doi: 10.1007/s10957-014-0586-7. - DOI
    1. Burger M, Sawatzky A, Steidl G. First order algorithms in variational image processing. In: Glowinski R, Osher SJ, Yin W, editors. Splitting Methods in Communication, Imaging, Science, and Engineering. Cham: Springer; 2016. pp. 345–407.
    1. Candes EJ, Li X, Ma Y, Wright J. Robust principal component analysis? J. ACM. 2011;58(3):11. doi: 10.1145/1970392.1970395. - DOI
    1. Chambolle A, Pock T. A first-order primal-dual algorithm for convex problems with applications to imaging. J. Math. Imaging Vis. 2011;40(1):120–145. doi: 10.1007/s10851-010-0251-1. - DOI

LinkOut - more resources