Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2012 Sep;68(3):837-48.
doi: 10.1111/j.1541-0420.2011.01725.x. Epub 2011 Dec 29.

Space-time data fusion under error in computer model output: an application to modeling air quality

Affiliations

Space-time data fusion under error in computer model output: an application to modeling air quality

Veronica J Berrocal et al. Biometrics. 2012 Sep.

Abstract

We provide methods that can be used to obtain more accurate environmental exposure assessment. In particular, we propose two modeling approaches to combine monitoring data at point level with numerical model output at grid cell level, yielding improved prediction of ambient exposure at point level. Extending our earlier downscaler model (Berrocal, V. J., Gelfand, A. E., and Holland, D. M. (2010b). A spatio-temporal downscaler for outputs from numerical models. Journal of Agricultural, Biological and Environmental Statistics 15, 176-197), these new models are intended to address two potential concerns with the model output. One recognizes that there may be useful information in the outputs for grid cells that are neighbors of the one in which the location lies. The second acknowledges potential spatial misalignment between a station and its putatively associated grid cell. The first model is a Gaussian Markov random field smoothed downscaler that relates monitoring station data and computer model output via the introduction of a latent Gaussian Markov random field linked to both sources of data. The second model is a smoothed downscaler with spatially varying random weights defined through a latent Gaussian process and an exponential kernel function, that yields, at each site, a new variable on which the monitoring station data is regressed with a spatial linear model. We applied both methods to daily ozone concentration data for the Eastern US during the summer months of June, July and August 2001, obtaining, respectively, a 5% and a 15% predictive gain in overall predictive mean square error over our earlier downscaler model (Berrocal et al., 2010b). Perhaps more importantly, the predictive gain is greater at hold-out sites that are far from monitoring sites.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Training and validation sites used to fit and assess the out-of-sample predictive performance of the ordinary kriging model, the downscaler, the GMRF smoothed downscaler and the smoothed downscaler using spatially varying random weights.
Figure 2
Figure 2
(a) Daily mean (filled circles) and standard deviation (empty circles) of square root of observed ozone concentration at all the 800 monitoring sites. (b) Daily correlation betweeen square root of observed ozone concentration and square root of CMAQ output of ozone concentration. In both plots, the three days for which we will present results in Section 5 are surrounded by a box. They are, respectively, July 4, July 20, and August 9, 2001.
Figure 3
Figure 3
Spatial maps of: (a) the square root of the CMAQ output, x(B, t), (b) the posterior mean of (B, t), and (c) the posterior mean of (B, t), for July 4, 2001 for a subregion in the Northeast.
Figure 4
Figure 4
(a)–(c) Observed ozone concentration (ppb) on August 9, 2001 in two subregions of the Eastern US. (b)–(d) Posterior predictive mean of ozone concentration on August 9, 2001 as yielded by the smoothed downscaler with spatially varying random weights.
Figure 5
Figure 5
(a) Location of the four sites for which we are displaying the posterior predictive mean of the spatially varying random weights wk (s, t). (b)–(e) Posterior predictive mean of the spatially varying random weights wk (s, t) for sites: (b) s1; (c) s2; (d) s3; and (e) s4 on July 4, 2001.

References

    1. Banerjee S, Carlin BP, Gelfand AE. Hierarchical Modeling and Analysis for Spatial Data. Boca Raton, FL: Chapman & Hall/CRC; 2004.
    1. Banerjee S, Gelfand AE, Finley AO, Sang H. Gaussian predictive process models for large spatial datasets. Journal of the Royal Statistical Society, Series B. 2008;70:825–848. - PMC - PubMed
    1. Berrocal VJ, Gelfand AE, Holland DM. A bivariate spatio-temporal downscaler under space and time misalignment. Annals of Applied Statistics. 2010a;4:1942–1975. - PMC - PubMed
    1. Berrocal VJ, Gelfand AE, Holland DM. A spatio-temporal downscaler for outputs from numerical models. Journal of Agricultural, Biological and Environmental Statistics. 2010b;15:176–197. - PMC - PubMed
    1. Besag J. Spatial interaction and the statistical analysis of lattice systems. Journal of the Royal Statistical Society, Series B. 1974;136:192–236.