Identifying the key factors of mercury exposure in residents of southwestern Iran using machine learning algorithms
- PMID: 40450190
- DOI: 10.1007/s10653-025-02533-6
Identifying the key factors of mercury exposure in residents of southwestern Iran using machine learning algorithms
Abstract
It is necessary to predict hair mercury (Hg) levels and specify the related effective factors to develop preventive strategies to reduce Hg exposure in different regions. This study is the first effort to investigate the effectiveness of eight machine learning (ML) models (including multiple linear regression, decision tree regression, least absolute shrinkage and selection operator, multivariate adaptive regression splines, random forest, extreme gradient boosting, K-nearest neighbor, and Gaussian process) for predicting hair Hg levels and identifying the most important factors affecting them in residents of southwestern Iran. All ML models were trained with 70% of the dataset and their performance was evaluated using the determination coefficient (R2), root mean square error (RMSE), and mean absolute error (MAE) based on the remaining dataset. Finally, the Permutation Feature Importance (PFI) method was used to determine the relative importance (RI) of influencing factors. Mean hair Hg (3.31 µg g⁻1) was higher than the United States Environmental Protection Agency (US EPA) and World Health Organization (WHO) limits. It was indicated a high exposure risk for some people in this region. The extreme gradient boosting (XGB) model outperformed other algorithms in modeling hair Hg levels, with R2 = 0.61, RMSE = 2.2, and MAE = 1.25. According to the PFI analysis, weight (RI: 43.4%) and geographic place (RI: 41.8%) were found as the most important demographic factors influencing Hg variation in the study population. Additionally, occupation (RI: 46.1%) and the frequency of fish and canned fish consumption (RI: 22%) were identified as the most significant exposure factors controlling hair Hg variability in southwestern Iran. These findings can be useful for formulating appropriate strategies to reduce the health risk of Hg exposure and improve human health.
Keywords: Exposure factors; Hair mercury; Health risk; Machine learning; Southwest of Iran.
© 2025. The Author(s), under exclusive licence to Springer Nature B.V.
Conflict of interest statement
Declarations. Conflict of interest: The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper. Ethical approval: All procedures performed in this study involving human participants were in accordance with the ethical standards of the institutional research committee and with the 1964 Helsinki Declaration and its later amendments or comparable ethical standards. Consent to participate: Before the collection of hair samples, the objectives and protocol of the research were explained to participants and informed consent was obtained from all individual participants included in the study. Consent for publication: All authors have given their permission to publish this work.
References
-
- Abdolvand, S., Esfahani, S. K., & Dmirchi, S. (2014). Mercury (Hg) and methyl mercury (MMHg) bioaccumulation in three fish species (sea food) from Persian Gulf. Toxicology and Environmental Health Sciences, 6, 192–198. - DOI
-
- Abed, M. S., Kadhim, F. J., Almusawi, J. K., Imran, H., Bernardo, L. F. A., & Henedy, S. N. (2023). Utilizing multivariate adaptive regression splines (MARS) for precise estimation of soil compaction parameters. Applied Sciences, 13, 11634. - DOI
-
- Abood, H. G., & Salman, G. A. (2021). Utilizing variance inflation factor for electricity demand forecasting. In Proceedings of Symposium on Power Electronic and Renewable Energy Systems Control: PERESC 2020 (pp. 365-375). Springer.
-
- Alghazzawi, D. M., Alquraishee, A. G. A., Badri, S. K., & Hasan, S. H. (2023). ERF-XGB: Ensemble random forest-based XG boost for accurate prediction and classification of e-commerce product review. Sustainability, 15(9), 7076. - DOI
-
- Alin, A. (2010). Multicollinearity. Wiley Interdisciplinary Reviews: Computational Statistics, 2, 370–374. - DOI
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
Medical
Research Materials