Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2013 Mar;100(1):75-89.
doi: 10.1093/biomet/ass068.

Efficient Gaussian process regression for large datasets

Affiliations

Efficient Gaussian process regression for large datasets

Anjishnu Banerjee et al. Biometrika. 2013 Mar.

Abstract

Gaussian processes are widely used in nonparametric regression, classification and spatiotemporal modelling, facilitated in part by a rich literature on their theoretical properties. However, one of their practical limitations is expensive computation, typically on the order of n3 where n is the number of data points, in performing the necessary matrix inversions. For large datasets, storage and processing also lead to computational bottlenecks, and numerical stability of the estimates and predicted values degrades with increasing n. Various methods have been proposed to address these problems, including predictive processes in spatial data analysis and the subset-of-regressors technique in machine learning. The idea underlying these approaches is to use a subset of the data, but this raises questions concerning sensitivity to the choice of subset and limitations in estimating fine-scale structure in regions that are not well covered by the subset. Motivated by the literature on compressive sensing, we propose an alternative approach that involves linear projection of all the data points onto a lower-dimensional subspace. We demonstrate the superiority of this approach from a theoretical perspective and through simulated and real data examples.

Keywords: Bayesian regression; Compressive sensing; Dimensionality reduction; Gaussian process; Random projection.

PubMed Disclaimer

References

    1. Adler RJ. IMS Lecture Notes–Monograph Series. vol. 12. Institute of Mathematical Statistics; Hayward: 1990. An Introduction to Continuity, Extrema, and Related Topics for General Gaussian Processes; pp. 75–6.
    1. Banerjee S, Gelfand AE, Finley AO, Sang H. Gaussian predictive process models for large spatial data sets. J. R. Statist. Soc. B. 2008;70:825–48. - PMC - PubMed
    1. Bhatia R. Matrix Analysis. Springer; New York: 1997.
    1. Candès EJ, Romberg J, Tao T. Robust uncertainty principles: Exact signal reconstruction from highly incomplete frequency information. IEEE Trans. Info. Theory. 2006;52:489–509.
    1. Choi T, Schervish MJ. On posterior consistency in nonparametric regression problems. J. Mult. Anal. 2007;98:1969–87.