Machine learning for air quality index (AQI) forecasting: shallow learning or deep learning?

Elham Kalantari¹, Hamid Gholami², Hossein Malakooti³, Ali Reza Nafarzadegan¹, Vahid Moosavi⁴

Affiliations

¹ Department of Natural Resources Engineering, University of Hormozgan, Bandar-Abbas, Hormozgan, Iran.
² Department of Natural Resources Engineering, University of Hormozgan, Bandar-Abbas, Hormozgan, Iran. hgholami@hormozgan.ac.ir.
³ Department of Marine and Atmospheric Science (Non-Biologic), Faculty of Marine Science and Technology, University of Hormozgan, Bandar Abbas, Iran.
⁴ Department of Watershed Management Engineering, Tarbiat Modares University, Noor, Mazandaran, Iran.

PMID: 39467867
DOI: 10.1007/s11356-024-35404-1

Machine learning for air quality index (AQI) forecasting: shallow learning or deep learning?

Elham Kalantari et al. Environ Sci Pollut Res Int. 2024 Nov.

. 2024 Nov;31(54):62962-62982.

doi: 10.1007/s11356-024-35404-1. Epub 2024 Oct 28.

Authors

Elham Kalantari¹, Hamid Gholami², Hossein Malakooti³, Ali Reza Nafarzadegan¹, Vahid Moosavi⁴

Affiliations

¹ Department of Natural Resources Engineering, University of Hormozgan, Bandar-Abbas, Hormozgan, Iran.
² Department of Natural Resources Engineering, University of Hormozgan, Bandar-Abbas, Hormozgan, Iran. hgholami@hormozgan.ac.ir.
³ Department of Marine and Atmospheric Science (Non-Biologic), Faculty of Marine Science and Technology, University of Hormozgan, Bandar Abbas, Iran.
⁴ Department of Watershed Management Engineering, Tarbiat Modares University, Noor, Mazandaran, Iran.

PMID: 39467867
DOI: 10.1007/s11356-024-35404-1

Abstract

In this study, several machine learning (ML) models consisting of shallow learning (SL) models (e.g., random forest (RF), K-nearest neighbor (KNN), weighted K-nearest neighbor (WKNN), support vector machine (SVM), artificial neural network (ANN), and deep learning (DL) models (e.g., long short-term memory (LSTM), gated recurrent unit (GRU), recurrent neural network (RNN), and convolutional neural network (CNN)) have been employed for predicting air pollution and its classification. The models were selected based on factors such as prediction accuracy, model generalization, model complexity, and training time. Our study focuses on analyzing and predicting the air quality index (AQI) using daily PM₁₀ concentration as natural pollutants and nine meteorological parameters from March 2013 to February 2022 in Zabol. We also utilized the information gain (IG) method for feature selection. Several measures including accuracy, F1 score, precision, recall, and the area under the curve (AUC), are computed to assess model performance. This study demonstrates the efficacy of DL models, particularly CNN, in predicting the AQI with remarkable accuracy. Our findings reveal that all models effectively classify air quality levels, with an AUC of 0.95 for the good class in both DL and ANN models, significantly outperforming SL models. The AUC values for the hazardous and moderate classes of DL models were also impressive, at 0.90 and 0.83, respectively, underscoring their effectiveness in critical classifications. In terms of performance, CNN achieved an accuracy of 0.60, leading the models, while RF followed closely at 0.58. RNN, GRU, ANN, and SVM each reached an accuracy of 0.57, demonstrating a competitive edge. LSTM and WKNN recorded an accuracy of 0.55, and KNN was slightly lower at 0.53. These results highlight the superior capabilities of DL models in addressing complex air quality classifications, providing invaluable insights for policymakers. By leveraging these advanced techniques, stakeholders can implement more effective strategies to combat air pollution and safeguard public health. It is worth noting that irregular monitoring of air quality data may affect the robustness of our predictions, highlighting the need for more consistent data collection to ensure an accurate representation of pollution levels.

Keywords: Air quality index; Deep learning; Feature selection; Machine learning; Zabol.

PubMed Disclaimer

Conflict of interest statement

Declarations. Ethics approval and consent to participate: Not applicable. Consent to participate: Not applicable. Consent for publication: Not applicable. Competing interests: The authors declare no competing interests.

References

1. Aksangür İ, Eren B, Erden C (2022) Evaluation of data preprocessing and feature selection process for prediction of hourly PM10 concentration using long short-term memory models. Environ Pollut 311:119973 - DOI
1. Al-Hemoud A, Al-Dousari A, Al-Shatti A, Al-Khayat A, Behbehani W, Malak M (2018) Health impact assessment associated with exposure to PM10 and dust storms in Kuwait. Atmosphere 9(1):6 - DOI
1. Alizadeh-Choobari O, Zawar-Reza P, Sturman A (2014) The “wind of 120 days” and dust storm activity over the Sistan Basin. Atmos Res 143:328–341 - DOI
1. Almaliki AH, Derdour A, Ali E (2023) Air Quality Index (AQI) Prediction in Holy Makkah based on machine learning methods. Sustainability 15(17):13168 - DOI
1. Ameer S, Shah MA, Khan A, Song H, Maple C, Islam SU, Asghar MN (2019) Comparative analysis of machine learning techniques for predicting air quality in smart cities. IEEE Access 7:128325–128338. https://doi.org/10.1109/ACCESS.2019.2925082 - DOI

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Substances

Actions
Actions

LinkOut - more resources

Full Text Sources
- Springer
Medical
- MedlinePlus Health Information

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Machine learning for air quality index (AQI) forecasting: shallow learning or deep learning?

Affiliations

Machine learning for air quality index (AQI) forecasting: shallow learning or deep learning?

Authors

Affiliations

Abstract

Conflict of interest statement

References

MeSH terms

Substances

LinkOut - more resources

Full Text Sources

Medical