A land use regression model using machine learning and locally developed low cost particulate matter sensors in Uganda
- PMID: 34043968
- DOI: 10.1016/j.envres.2021.111352
A land use regression model using machine learning and locally developed low cost particulate matter sensors in Uganda
Abstract
The application of land use regression (LUR) modeling for estimating air pollution exposure has been used only rarely in sub-Saharan Africa (SSA). This is generally due to a lack of air quality monitoring networks in the region. Low cost air quality sensors developed locally in sub-Saharan Africa presents a sustainable operating mechanism that may help generate the air monitoring data needed for exposure estimation of air pollution with LUR models. The primary objective of our study is to investigate whether a network of locally developed low-cost air quality sensors can be used in LUR modeling for accurately predicting monthly ambient fine particulate matter (PM2.5) air pollution in urban areas of central and eastern Uganda. Secondarily, we aimed to explore whether the application of machine learning (ML) can improve LUR predictions compared to ordinary least squares (OLS) regression. We used data for the entire year of 2020 from a network of 23 PM2.5 low-cost sensors located in urban municipalities of eastern and central Uganda. Between January 1, 2020 and December 31, 2020, these sensors collected highly time-resolved measurement data of PM2.5 air concentrations. We used monthly-averaged PM2.5 concentration data for LUR prediction modeling of monthly PM2.5 concentrations. We used eight different ML base-learner algorithms as well as ensemble modeling. We applied 5-fold cross validation (80% training/20% test random splits) to evaluate the models with resampling and Root mean squared error (RMSE). The relative explanatory power and accuracy of the ML algorithms were evaluated by comparing coefficient of determination (R2) and RMSE, using OLS as the reference approach. The overall average PM2.5 concentration during the study period was 52.22 μg/m3 (IQR: 38.11, 62.84 μg/m3)-well above World Health Organization PM2.5 ambient air guidelines. From the base-learner and ensemble models, RMSE and R2 values ranged between 7.65 μg/m3 - 16.85 μg/m3 and 0.24-0.84, respectively. Extreme gradient boosting (xgbTree) performed best out of the base learner algorithms (R2 = 0.84; RMSE = 7.65 μg/m3). Model performance from ensemble modeling with Lasso and Elastic-Net Regularized Generalized Linear Models (glmnet) did not outperform xgbTree, but prediction performance was comparable to that of xgbTree. The most important temporal and spatial predictors of monthly PM2.5 levels were monthly precipitation, percent of the population using solid fuels for cooking, distance to Lake Victoria, and greenspace (NDVI) within a 500-m buffer of air monitors. In conclusion, data from locally developed low-cost PM sensors provide evidence that they can be used for spatio-temporal prediction modeling of air pollution exposures in Uganda. Moreover, the non-parametric ML and ensemble approaches to LUR modeling clearly outperformed OLS regression algorithm for the prediction of monthly PM2.5 concentrations. Deploying low-cost air quality sensors in concert with implementation of data quality control measures, can help address the critical need for expanding and improving air quality monitoring in resource-constrained settings of sub-Saharan Africa. These low-cost sensors, in conjunction with non-parametric ML algorithms, may provide a rapid path forward for PM2.5 exposure assessment and to spur air pollution epidemiology research in the region.
Keywords: Land use regression; Low-cost sensors; Machine learning; Particulate matter.
Copyright © 2021 Elsevier Inc. All rights reserved.
Similar articles
-
Comparison of Long-Term Air Pollution Exposure from Mobile and Routine Monitoring, Low-Cost Sensors, and Dispersion Models.Res Rep Health Eff Inst. 2025 Mar;2025(226):1-101. Res Rep Health Eff Inst. 2025. PMID: 40405483 Free PMC article.
-
Evaluating heterogeneity in indoor and outdoor air pollution using land-use regression and constrained factor analysis.Res Rep Health Eff Inst. 2010 Dec;(152):5-80; discussion 81-91. Res Rep Health Eff Inst. 2010. PMID: 21409949
-
Assessment and statistical modeling of the relationship between remotely sensed aerosol optical depth and PM2.5 in the eastern United States.Res Rep Health Eff Inst. 2012 May;(167):5-83; discussion 85-91. Res Rep Health Eff Inst. 2012. PMID: 22838153
-
A comprehensive review of the development of land use regression approaches for modeling spatiotemporal variations of ambient air pollution: A perspective from 2011 to 2023.Environ Int. 2024 Jan;183:108430. doi: 10.1016/j.envint.2024.108430. Epub 2024 Jan 7. Environ Int. 2024. PMID: 38219544 Review.
-
Low-Cost Particulate Matter Mass Sensors: Review of the Status, Challenges, and Opportunities for Single-Instrument and Network Calibration.ACS Sens. 2025 May 23;10(5):3207-3221. doi: 10.1021/acssensors.4c03293. Epub 2025 May 7. ACS Sens. 2025. PMID: 40331533 Review.
Cited by
-
High-Resolution Urban Air Quality Mapping for Multiple Pollutants Based on Dense Monitoring Data and Machine Learning.Int J Environ Res Public Health. 2022 Jun 29;19(13):8005. doi: 10.3390/ijerph19138005. Int J Environ Res Public Health. 2022. PMID: 35805664 Free PMC article.
-
Use of biomass fuels predicts indoor particulate matter and carbon monoxide concentrations; evidence from an informal urban settlement in Fort Portal city, Uganda.BMC Public Health. 2022 Sep 12;22(1):1723. doi: 10.1186/s12889-022-14015-w. BMC Public Health. 2022. PMID: 36089579 Free PMC article.
-
Performance Assessment of Two Low-Cost PM2.5 and PM10 Monitoring Networks in the Padana Plain (Italy).Sensors (Basel). 2024 Jun 18;24(12):3946. doi: 10.3390/s24123946. Sensors (Basel). 2024. PMID: 38931730 Free PMC article.
-
Ambient PM2.5 temporal variation and source apportionment in Mbarara, Uganda.Aerosol Air Qual Res. 2024 Apr;24:230203. doi: 10.4209/aaqr.230203. Epub 2024 Jan 5. Aerosol Air Qual Res. 2024. PMID: 38947180 Free PMC article.
-
Air pollution and mobility patterns in two Ugandan cities during COVID-19 mobility restrictions suggest the validity of air quality data as a measure for human mobility.Environ Sci Pollut Res Int. 2023 Mar;30(12):34856-34871. doi: 10.1007/s11356-022-24605-1. Epub 2022 Dec 15. Environ Sci Pollut Res Int. 2023. PMID: 36520281 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources
Medical
Research Materials