Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Mar 21;14(1):6795.
doi: 10.1038/s41598-024-54807-1.

Optimized machine learning model for air quality index prediction in major cities in India

Affiliations

Optimized machine learning model for air quality index prediction in major cities in India

Suresh Kumar Natarajan et al. Sci Rep. .

Abstract

Industrial advancements and utilization of large amount of fossil fuels, vehicle pollution, and other calamities increases the Air Quality Index (AQI) of major cities in a drastic manner. Major cities AQI analysis is essential so that the government can take proper preventive, proactive measures to reduce air pollution. This research incorporates artificial intelligence in AQI prediction based on air pollution data. An optimized machine learning model which combines Grey Wolf Optimization (GWO) with the Decision Tree (DT) algorithm for accurate prediction of AQI in major cities of India. Air quality data available in the Kaggle repository is used for experimentation, and major cities like Delhi, Hyderabad, Kolkata, Bangalore, Visakhapatnam, and Chennai are considered for analysis. The proposed model performance is experimentally verified through metrics like R-Square, RMSE, MSE, MAE, and accuracy. Existing machine learning models, like k-nearest Neighbor, Random Forest regressor, and Support vector regressor, are compared with the proposed model. The proposed model attains better prediction performance compared to traditional machine learning algorithms with maximum accuracy of 88.98% for New Delhi city, 91.49% for Bangalore city, 94.48% for Kolkata, 97.66% for Hyderabad, 95.22% for Chennai and 97.68% for Visakhapatnam city.

Keywords: Air pollution; Air quality index; Decision tree regression; Grey-wolf optimization; Machine learning; Optimization algorithm.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing interests.

Figures

Figure 1
Figure 1
AQI categories for different pollutants.
Figure 2
Figure 2
Sample data in air quality dataset for Delhi City.
Figure 3
Figure 3
Sample data in air quality dataset for Kolkata city.
Figure 4
Figure 4
Data balancing using SMOTE.
Figure 5
Figure 5
Grey wolf optimization algorithm.
Figure 6
Figure 6
Position update in grey wolf optimization algorithm.
Figure 7
Figure 7
Proposed prediction model.
Figure 8
Figure 8
Details of New Delhi city before and after preprocessing and data balancing.
Figure 9
Figure 9
Details of Bangalore city before and after preprocessing and data balancing.
Figure 10
Figure 10
Details of Chennai city before and after preprocessing and data balancing.
Figure 11
Figure 11
Details of Kolkata city before and after preprocessing and data balancing.
Figure 12
Figure 12
Details of Hyderabad city before and after preprocessing and data balancing.
Figure 13
Figure 13
Details of Visakhapatnam city before and after preprocessing and data balancing.
Figure 14
Figure 14
R-square analysis.
Figure 15
Figure 15
Error analysis.
Figure 16
Figure 16
Accuracy analysis.

Similar articles

Cited by

References

    1. Yuan Y, Yang Q, Ren J, Fan J, Shen Q, Wang X, Zhao Y. Learning-imitation strategy-assisted alpine skiing optimization for the boom of offshore drilling platform. Ocean Eng. 2023;278:114317. doi: 10.1016/j.oceaneng.2023.114317. - DOI
    1. Yuan Y, Wang S, Lv L, Song X. An adaptive resistance and stamina strategy-based dragonfly algorithm for solving engineering optimization problems. Eng. Comput. 2022;38(5):2228–2251. doi: 10.1108/EC-08-2019-0362. - DOI
    1. Yuan Y, Xiaokai Mu, Shao X, Ren J, Zhao Y, Wang Z. Optimization of an auto drum fashioned brake using the elite opposition-based learning and chaotic k-best gravitational search strategy based grey wolf optimizer algorithm. Appl. Soft Comput. 2022;123:10897. doi: 10.1016/j.asoc.2022.108947. - DOI
    1. Gladkova E, Saychenko L. Applying machine learning techniques in air quality prediction. Transport. Res. Proc. 2022;63:1999–2006. doi: 10.1016/j.trpro.2022.06.222. - DOI
    1. Zhou Y, De S, Ewa G, Perera C, Moessner K. Data-driven air quality characterization for urban environments: A case study. IEEE Access. 2018;6:77996–78006. doi: 10.1109/ACCESS.2018.2884647. - DOI