Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2017 Jun;12(2):583-614.
doi: 10.1214/17-BA1056R. Epub 2017 May 16.

High-Dimensional Bayesian Geostatistics

Affiliations

High-Dimensional Bayesian Geostatistics

Sudipto Banerjee. Bayesian Anal. 2017 Jun.

Abstract

With the growing capabilities of Geographic Information Systems (GIS) and user-friendly software, statisticians today routinely encounter geographically referenced data containing observations from a large number of spatial locations and time points. Over the last decade, hierarchical spatiotemporal process models have become widely deployed statistical tools for researchers to better understand the complex nature of spatial and temporal variability. However, fitting hierarchical spatiotemporal models often involves expensive matrix computations with complexity increasing in cubic order for the number of spatial locations and temporal points. This renders such models unfeasible for large data sets. This article offers a focused review of two methods for constructing well-defined highly scalable spatiotemporal stochastic processes. Both these processes can be used as "priors" for spatiotemporal random fields. The first approach constructs a low-rank process operating on a lower-dimensional subspace. The second approach constructs a Nearest-Neighbor Gaussian Process (NNGP) that ensures sparse precision matrices for its finite realizations. Both processes can be exploited as a scalable prior embedded within a rich hierarchical modeling framework to deliver full Bayesian inference. These approaches can be described as model-based solutions for big spatiotemporal datasets. The models ensure that the algorithmic complexity has ~ n floating point operations (flops), where n the number of spatial locations (per iteration). We compare these methods and provide some insight into their methodological underpinnings.

Keywords: Bayesian statistics; Gaussian process; Nearest Neighbor Gaussian process (NNGP); low rank Gaussian process; predictive process; sparse Gaussian process; spatiotemporal statistics.

PubMed Disclaimer

Figures

Figure 1
Figure 1
95% credible intervals for the nugget for 40 different low-rank radial-basis models with knots varying between 5 and 200 in steps of 5. The horizontal line at τ2 = 5 denotes the true value of τ2 with which the data was simulated.
Figure 2
Figure 2
Comparing estimates of a simulated random field using a full Gaussian Process (Full GP) and a Gaussian Predictive process (PPGP) with 64 knots. The oversmoothing by the low-rank predictive process is evident.
Figure 3
Figure 3
Sparsity using directed acyclic graphs.
Figure 4
Figure 4
Structure of the factors making up the sparse Kθ-1 matrix.
Figure 5
Figure 5
95% credible intervals for the effective spatial range from an NNGP model with m = 10 and a full GP model fitted to 10 different simulated datasets with true effective range fixed at values between 0.1 and 1.0 in increments of 0.1.

References

    1. Anderson TW. An Introduction to Multivariate Statistical Analysis. 3 New York, NY: Wiley; 2003.
    1. Banerjee S, Carlin BP, Gelfand AE. Hierarchical Modeling and Analysis for Spatial Data. 2 Boca Raton, FL: Chapman & Hall/CRC; 2014.
    1. Banerjee S, Finley AO, Waldmann P, Ericcson T. Hierarchical spatial process models for multiple traits in large genetic trials. Journal of the American Statistical Association. 2010;105:506–521. doi: http://dx.doi.org/10.1198/jasa.2009.ap09068. - DOI - PMC - PubMed
    1. Banerjee S, Gelfand AE, Finley AO, Sang H. Gaussian predictive process models for large spatial datasets. Journal of the Royal Statistical Society, Series B. 2008;70:825–848. - PMC - PubMed
    1. Barry R, Ver Hoef J. Blackbox kriging: Spatial prediction without specifying variogram models. Journal of Agricultural, Biological and Environmental Statistics. 1996;1:297–322.