Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2017 Jan;66(1):3-28.
doi: 10.1111/rssc.12148. Epub 2016 Apr 6.

A novel principal component analysis for spatially misaligned multivariate air pollution data

Affiliations

A novel principal component analysis for spatially misaligned multivariate air pollution data

Roman A Jandarov et al. J R Stat Soc Ser C Appl Stat. 2017 Jan.

Abstract

We propose novel methods for predictive (sparse) PCA with spatially misaligned data. These methods identify principal component loading vectors that explain as much variability in the observed data as possible, while also ensuring the corresponding principal component scores can be predicted accurately by means of spatial statistics at locations where air pollution measurements are not available. This will make it possible to identify important mixtures of air pollutants and to quantify their health effects in cohort studies, where currently available methods cannot be used. We demonstrate the utility of predictive (sparse) PCA in simulated data and apply the approach to annual averages of particulate matter speciation data from national Environmental Protection Agency (EPA) regulatory monitors.

Keywords: Air pollution; Dimension reduction; Principal component analysis; Spatial misalignment; Universal kriging.

PubMed Disclaimer

Figures

Fig. 1
Fig. 1
Scatterplots of the predictions of the scores without a penalty. These are obtained using cross-validation and UK
Fig. 2
Fig. 2
The principal component loadings from two PCA approaches without penalty
Fig. 3
Fig. 3
The principal component loadings from two PCA approaches with penalty
Fig. 4
Fig. 4
Scatterplots of the predictions of the scores with a penalty to maximize predictability of the pollutants
Fig. 5
Fig. 5
Heatmaps based on the predictions of the scores from predictive sparse PCA

Similar articles

Cited by

References

    1. Abdi H. Partial least squares regression (pls-regression) Encyclopedia for research methods for the social sciences. 2003:792–795.
    1. Analitis A, Michelozzi P, DIppoliti D, deDonato F, Menne B, Matthies F, Atkinson RW, Iñiguez C, Basagaña X, Schneider A, et al. Effects of heat waves on mortality: effect modification and confounding by air pollutants. Epidemiology. 2014;25(1):15–22. - PubMed
    1. Anderson TW. An Introduction to Multivariate Statistical Analysis. 2003. (Wiley Series in Probability and Statistics).
    1. Bell ML, Davis DL. Reassessment of the lethal London fog of 1952: novel indicators of acute and chronic consequences of acute exposure to air pollution. Environmental health perspectives. 2001;109(Suppl 3):389. - PMC - PubMed
    1. Bergen S, Sheppard L, Sampson PD, Kim S-Y, Richards M, Vedal S, Kaufman JD, Szpiro AA. A National Prediction Model for PM2.5 Component Exposures and Measurement Error–Corrected Health Effect Inference. Environmental health perspectives. 2013;121(9):1017. - PMC - PubMed

Publication types

LinkOut - more resources