Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2019 Jun 17;16(12):2137.
doi: 10.3390/ijerph16122137.

Machine Learning-Based Integration of High-Resolution Wildfire Smoke Simulations and Observations for Regional Health Impact Assessment

Affiliations

Machine Learning-Based Integration of High-Resolution Wildfire Smoke Simulations and Observations for Regional Health Impact Assessment

Yufei Zou et al. Int J Environ Res Public Health. .

Abstract

Large wildfires are an increasing threat to the western U.S. In the 2017 fire season, extensive wildfires occurred across the Pacific Northwest (PNW). To evaluate public health impacts of wildfire smoke, we integrated numerical simulations and observations for regional fire events during August-September of 2017. A one-way coupled Weather Research and Forecasting and Community Multiscale Air Quality modeling system was used to simulate fire smoke transport and dispersion. To reduce modeling bias in fine particulate matter (PM2.5) and to optimize smoke exposure estimates, we integrated modeling results with the high-resolution Multi-Angle Implementation of Atmospheric Correction satellite aerosol optical depth and the U.S. Environmental Protection Agency AirNow ground-level monitoring PM2.5 concentrations. Three machine learning-based data fusion algorithms were applied: An ordinary multi-linear regression method, a generalized boosting method, and a random forest (RF) method. 10-Fold cross-validation found improved surface PM2.5 estimation after data integration and bias correction, especially with the RF method. Lastly, to assess transient health effects of fire smoke, we applied the optimized high-resolution PM2.5 exposure estimate in a short-term exposure-response function. Total estimated regional mortality attributable to PM2.5 exposure during the smoke episode was 183 (95% confidence interval: 0, 432), with 85% of the PM2.5 pollution and 95% of the consequent multiple-cause mortality contributed by fire emissions. This application demonstrates both the profound health impacts of fire smoke over the PNW and the need for a high-performance fire smoke forecasting and reanalysis system to reduce public health risks of smoke hazards in fire-prone regions.

Keywords: PM2.5 air pollution; fire smoke modeling; health impact assessment; machine learning-based data fusion.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflict of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript, or in the decision to publish the results.

Figures

Figure 1
Figure 1
Fire hotspots and smoke over the PNW region on 9/5/2017. (a) satellite imagery detected by the NASA/NOAA Suomi NPP satellite at 20:36 UTC (13:36 PDT) (credit: Image by the NASA’s Land, Atmosphere Near real-time Capability for EOS (LANCE/EOSDIS) Rapid Response team). (b) WRF-CMAQ PM2.5 simulation based on the SENS experiment with all sources. PNW: Pacific Northwest. NASA: National Aeronautics and Space Administration. NOAA: National Oceanic and Atmospheric Administration. NPP: National Polar-orbiting Partnership. WRF: Weather Research and Forecasting model. CMAQ: Community Multiscale Air Quality model. PM2.5: Fine particulate matter. SENS: sensitivity experiment. UTC: Coordinated Universal Time; PDT: Pacific Daylight Time; EOS: Earth Observing System.
Figure 2
Figure 2
Comparison of the MAIAC AOD at 550 nm and model simulated AOD results at 20:15 UTC (13:15 PDT) on 9/7/2017. (a) the MAIAC AOD product onboard the MODIS Aqua satellite; (b) CMAQ simulated AOD; (c) bias-adjusted AOD by the RF method; (d) same as (c) but by the GBM method. The r-values on top-left corners of the subplots (b)–(d) denote spatial correlation coefficients between each model result and corresponding MAIAC AOD product. MAIAC: Multi-Angle Implementation of Atmospheric Correction. AOD: aerosol optical depth. MODIS: Moderate Resolution Imaging Spectroradiometer. CMAQ: Community Multiscale Air Quality model. RF: Random forest. GBM: Generalized boosting model.
Figure 3
Figure 3
Comparisons of the aerosol vertical distribution at 11:30 UTC (04:30 PDT) on 9/7/2017 between the CATS satellite retrievals and the CMAQ simulation. (a) CATS total attenuated backscatter (unit: km−1 sr−1) at 1064 nm; (b) CATS aerosol types; (c) 2-D cross section of PM2.5 concentrations (unit: μg m3) from the WRF-CMAQ SENS experiment along the CATS satellite track. Please note that the X-axis range in subplot (c) is different due to the smaller model domain. (credit: The CATS products in subplots (a) and (b) were produced and distributed by NASA Goddard Space Flight Center). CATS: Cloud-Aerosol Transport System. SENS: sensitivity experiment.
Figure 4
Figure 4
Statistical evaluation results of the raw WRF-CMAQ SENS simulated PM2.5 surface concentrations during 8/15–9/14/2017: (a) monthly averaged PM2.5 surface concentrations (unit: μg m−3). The black circles and red numbers denote 16 selected ground sites for the time series evaluation in Figure 5; (b) temporal correlations (unitless) between the SENS simulation and AirNow observations at each ground site; (c) fractional biases (unit: 100%) based on the SENS simulation and AirNow observations; (d) RMSE (unit: μg m−3) based on the SENS simulation and AirNow observations. RMSE: root mean squared error.
Figure 5
Figure 5
Time series of daily PM2.5 concentrations at the 16 selected AirNow ground sites in Figure 4a. The red solid lines denote AirNow ground measurements. The blue dashed lines denote CMAQ simulation results. The green dashed lines denote the MLR results. The golden dashed lines denote the RF results. The purple dashed lines denote the GBM results. The numbers on top-left corners of each subplot denote temporal correlations between these model results and observations. MLR: Multi-linear regression.
Figure 6
Figure 6
Comparisons of daily PM2.5 model simulation before and after data integration with AirNow ground observations during the fire smoke pollution episode. (a) a scatter plot of PM2.5 concentrations on a log scale based on the raw WRF-CMAQ SENS simulation and AirNow observations; (b) same as (a) but based on the MLR data integration method; (c) same as (a) but based on the RF method; (d) same as (a) but based on the GBM method. The color shading in all subplots denotes the PM2.5 data density in terms of sample counts.
Figure 7
Figure 7
Comparison of PM2.5 concentrations and speciation between the IMPROVE ground measurement network (left pie charts with site numbers) and the WRF-CMAQ SENS model simulation (right pie charts without site numbers). The colors in each pie chart denote PM2.5 chemical compositions and the size of each pie chart denotes bulk PM2.5 concentrations as shown by the reference numbers in the legend.
Figure 8
Figure 8
PM2.5 pollution exposure evaluation and attribution. (a) multiple-cause mortality (unit: #) at the county level over the PNW in August-December of 2017; (b) county-level deaths (unit: #) attributed to the PM2.5 exposure during the 2017 fire pollution episode; (c) PM2.5 concentrations (unit: μg m−3) contributed by fire sources (CMAQ_RF minus CMAQ_CTRL); (d) PM2.5 concentrations (unit: μg m−3) contributed by non-fire sources (CMAQ_CTRL).

References

    1. Johnston F.H., Henderson S.B., Chen Y., Randerson J.T., Marlier M., DeFries R.S., Kinney P., Bowman D.M.J.S., Brauer M. Estimated global mortality attributable to smoke from landscape fires. Environ. Health Persp. 2012;120:695–701. doi: 10.1289/ehp.1104422. - DOI - PMC - PubMed
    1. Fann N., Alman B., Broome R.A., Morgan G.G., Johnston F.H., Pouliot G., Rappold A.G. The health impacts and economic value of wildland fire episodes in the U.S.: 2008–2012. Sci. Total Environ. 2018;610–611:802–809. doi: 10.1016/j.scitotenv.2017.08.024. - DOI - PMC - PubMed
    1. Liu Y.Q. Variability of wildland fire emissions across the contiguous United States. Atmos. Environ. 2004;38:3489–3499. doi: 10.1016/j.atmosenv.2004.02.004. - DOI
    1. Reid C.E., Brauer M., Johnston F.H., Jerrett M., Balmes J.R., Elliott C.T. Critical review of health impacts of wildfire smoke exposure. Environ. Health Persp. 2016;124:1334–1343. doi: 10.1289/ehp.1409277. - DOI - PMC - PubMed
    1. Kondo M.C., De Roos A.J., White L.S., Heilman W.E., Mockrin M.H., Gross-Davis C.A., Burstyn I. Meta-analysis of heterogeneity in the effects of wildfire smoke exposure on respiratory health in North America. Int. J. Environ. Res. Public Health. 2019;16:960. doi: 10.3390/ijerph16060960. - DOI - PMC - PubMed

Publication types