Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 May 16;15(1):17087.
doi: 10.1038/s41598-025-00911-9.

Novel hybrid data-driven modeling based on feature space reconstruction and multihead self-attention gated recurrent unit: applied to PM2.5 concentrations prediction

Affiliations

Novel hybrid data-driven modeling based on feature space reconstruction and multihead self-attention gated recurrent unit: applied to PM2.5 concentrations prediction

Xiaoxin Yue et al. Sci Rep. .

Abstract

In response to the problem of neglecting the periodic and global characteristics of sequence data when predicting PM2.5 concentrations via machine learning models, a PM2.5 concentrations prediction model based on feature space reconstruction and multihead self-attention gated recurrent unit (FSR-MSAGRU) is proposed in this study. First, the raw sequence data are subjected to frequency spectrum analysis to determine the period value of the PM2.5 sequence data. Subsequently, the seasonal trend decomposition procedure based on loess (STL) is employed to capture the periodicity and trend information in the PM2.5 sequence data. Then, the feature space of the PM2.5 sequence data is reconstructed using the raw PM2.5 sequence data, decomposed seasonal components, trend components, and residual components. Finally, the reconstructed feature data are input into multihead self-attention gated recurrent unit (MSAGRU) with the ability to capture global feature information to predict PM2.5 concentrations. Favorable prediction results were attained by the proposed FSR-MSAGRU model across 6 distinct experimental datasets, with a PCC exceeding 0.98 and a decrease in the prediction accuracy metric SMAPE of at least 68% compared to that of the GRU model. Comparative experimental results with 13 reference models demonstrate that the proposed model exhibits better prediction performances and stronger generalization abilities.

Keywords: Feature space reconstruction; Gated recurrent unit; Machine learning; Multihead Self-attention; PM2.5 concentration prediction.

PubMed Disclaimer

Conflict of interest statement

Declarations. Competing interests: The authors declare no competing interests.

Figures

Fig. 1
Fig. 1
Locations of cities for data collection.
Fig. 2
Fig. 2
Distribution of the dataset samples.
Fig. 3
Fig. 3
Flowchart of the data processing.
Fig. 4
Fig. 4
Frequency spectrum of PM2.5 in different datasets.
Fig. 5
Fig. 5
Reconstructed feature spaces in the different datasets.
Fig. 6
Fig. 6
Framework of the FSR-MSAGRU model.
Fig. 7
Fig. 7
Comparison curves of the ablation experiment prediction results.
Fig. 8
Fig. 8
Comparison curves and scatter plots of the single models, hybrid models, and FSR-MSAGRU model prediction results (Lanzhou).
Fig. 9
Fig. 9
Performance metrics of the different models on various datasets.
Fig. 10
Fig. 10
Taylor diagram of the prediction results of the different models on the various datasets.

Similar articles

References

    1. Wu, Z. et al. Prediction of air pollutant concentrations based on the long short-term memory neural network. J. Hazard. Mater.465, 133099. 10.1016/j.jhazmat.2023.133099 (2024). - PubMed
    1. Bryan, L. & Landrigan, P. PM(2.5) pollution in Texas: a Geospatial analysis of health impact functions. Front. Public. Health. 11, 1286755. 10.3389/fpubh.2023.1286755 (2023). - PMC - PubMed
    1. Kim, K. H., Kabir, E. & Kabir, S. A review on the human health impact of airborne particulate matter. Environ. Int.74, 136–143. 10.1016/j.envint.2014.10.005 (2015). - PubMed
    1. Hill, W. et al. Lung adenocarcinoma promotion by air pollutants. Nature616, 159–167. 10.1038/s41586-023-05874-3 (2023). - PMC - PubMed
    1. Manisalidis, I., Stavropoulou, E., Stavropoulos, A. & Bezirtzoglou, E. Environmental and health impacts of air pollution: A review. Front. Public. Health. 8, 14. 10.3389/fpubh.2020.00014 (2020). - PMC - PubMed

LinkOut - more resources