Neural networks for geospatial data

Wentao Zhan¹, Abhirup Datta¹

Affiliations

PMID: 40255678
PMCID: PMC12007808 (available on 2026-01-01)
DOI: 10.1080/01621459.2024.2356293

Neural networks for geospatial data

Wentao Zhan et al. J Am Stat Assoc. 2025.

. 2025;120(549):535-547.

doi: 10.1080/01621459.2024.2356293. Epub 2024 Jun 24.

Authors

Wentao Zhan¹, Abhirup Datta¹

Affiliation

¹ Department of Biostatistics, Johns Hopkins University.

PMID: 40255678
PMCID: PMC12007808 (available on 2026-01-01)
DOI: 10.1080/01621459.2024.2356293

Abstract

Analysis of geospatial data has traditionally been model-based, with a mean model, customarily specified as a linear regression on the covariates, and a Gaussian process covariance model, encoding the spatial dependence. While nonlinear machine learning algorithms like neural networks are increasingly being used for spatial analysis, current approaches depart from the model-based setup and cannot explicitly incorporate spatial covariance. We propose NN-GLS, embedding neural networks directly within the traditional Gaussian process (GP) geostatistical model to accommodate non-linear mean functions while retaining all other advantages of GP, like explicit modeling of the spatial covariance and predicting at new locations via kriging. In NN-GLS, estimation of the neural network parameters for the non-linear mean of the Gaussian Process explicitly accounts for the spatial covariance through use of the generalized least squares (GLS) loss, thus extending the linear case. We show that NN-GLS admits a representation as a special type of graph neural network (GNN). This connection facilitates the use of standard neural network computational techniques for irregular geospatial data, enabling novel and scalable mini-batching, backpropagation, and kriging schemes. We provide methodology to obtain uncertainty bounds for estimation and predictions from NN-GLS. Theoretically, we show that NN-GLS will be consistent for irregularly observed spatially correlated data processes. We also provide a finite sample concentration rate, which quantifies the need to accurately model the spatial covariance in neural networks for dependent data. To our knowledge, these are the first large-sample results for any neural network algorithm for irregular spatial data. We demonstrate the methodology through numerous simulations and an application to air pollution modeling. We develop a software implementation of NN-GLS in the Python package geospaNN.

Keywords: Gaussian process; consistency; geostatistics; graph neural networks; kriging; machine learning; neural networks.

PubMed Disclaimer

References

1. Abramowitz M and Stegun IA (1948), Handbook of mathematical functions with formulas, graphs, and mathematical tables, Vol. 55, US Government printing office.
1. Banerjee S, Carlin BP and Gelfand AE (2014), Hierarchical modeling and analysis for spatial data, CRC press.
1. Breiman L. (2001), ‘Random forests’, Machine learning 45(1), 5–32.
1. Chen W, Li Y, Reich BJ and Sun Y (2024), ‘Deepkriging: Spatially dependent deep neural networks for spatial prediction’, Statistica Sinica 34, 291–311.
1. Cressie N and Wikle CK (2015), Statistics for spatio-temporal data, John Wiley & Sons.

Grants and funding

R01 ES033739/ES/NIEHS NIH HHS/United States

LinkOut - more resources

Miscellaneous
- NCI CPTAC Assay Portal

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Neural networks for geospatial data

Affiliation

Neural networks for geospatial data

Authors

Affiliation

Abstract

References

Grants and funding

LinkOut - more resources

Miscellaneous