Ensemble multiclassification model for aquatic toxicity of organic compounds
- PMID: 36587517
- DOI: 10.1016/j.aquatox.2022.106379
Ensemble multiclassification model for aquatic toxicity of organic compounds
Abstract
With environmental pollution becoming increasingly serious, organic compounds have become the main hazard of environmental pollution and exert substantial negative impacts on aquatic organisms. In research pertaining to the acute toxicity of organic compounds, traditional biological experimental methods are time-consuming and expensive. In addition, computer-aided binary classification models cannot accurately classify acute toxicity. Therefore, the multiclassication model is necessary for more accurate classification of acute toxicity. In this study, median lethal concentrations of 373 organic compounds in the environmental toxicology datasets ECOTOX and EAT5 were used. These chemicals were classified into four categories based on the European Economic Community criteria. Then the random forest, support vector machine, extreme gradient boosting, adaptive gradient boosting, and C5.0 decision tree algorithms and eight molecular fingerprints were used to build a multiclassification base model for the acute toxicity of organic compounds. The base models were repeated 100 times with fivefold cross-validation and external validation. The ensemble model was obtained by the voting method. The best base classifier was ExtendFP-C5.0, which had an accuracy, sensitivity and specificity values of 87.30%, 87.32% and 95.76% for external validation, and the voting ensemble model performance of 96.92%, 96.93% and 98.97%, respectively. The ensemble model achieved a higher accuracy than previously reported studies. Our study will help to further classify the acute toxicity of organic compounds to aquatic organisms and predict the hazard classes of organic compounds.
Keywords: Multiclassification model; Organic compounds, Aquatic toxicity, Ensemble machine learning approach, Ecological risk assessment.
Copyright © 2022. Published by Elsevier B.V.
Conflict of interest statement
Declaration of Competing Interest The authors declare that there is no conflict of interests regarding the publication of this article.
Similar articles
-
Multi-task aquatic toxicity prediction model based on multi-level features fusion.J Adv Res. 2025 Feb;68:477-489. doi: 10.1016/j.jare.2024.06.002. Epub 2024 Jun 4. J Adv Res. 2025. PMID: 38844122 Free PMC article.
-
Ensemble multiclassification model for predicting developmental toxicity in zebrafish.Aquat Toxicol. 2024 Jun;271:106936. doi: 10.1016/j.aquatox.2024.106936. Epub 2024 May 3. Aquat Toxicol. 2024. PMID: 38723470
-
QSAR modelling study of the bioconcentration factor and toxicity of organic compounds to aquatic organisms using machine learning and ensemble methods.Ecotoxicol Environ Saf. 2019 Sep 15;179:71-78. doi: 10.1016/j.ecoenv.2019.04.035. Epub 2019 Apr 23. Ecotoxicol Environ Saf. 2019. PMID: 31026752
-
Recommendations for Improving Methods and Models for Aquatic Hazard Assessment of Ionizable Organic Chemicals.Environ Toxicol Chem. 2020 Feb;39(2):269-286. doi: 10.1002/etc.4602. Environ Toxicol Chem. 2020. PMID: 31569266 Review.
-
Sources, pathways, and relative risks of contaminants in surface water and groundwater: a perspective prepared for the Walkerton inquiry.J Toxicol Environ Health A. 2002 Jan 11;65(1):1-142. doi: 10.1080/152873902753338572. J Toxicol Environ Health A. 2002. PMID: 11809004 Review.
Cited by
-
Progress, applications, and challenges in high-throughput effect-directed analysis for toxicity driver identification - is it time for HT-EDA?Anal Bioanal Chem. 2025 Jan;417(3):451-472. doi: 10.1007/s00216-024-05424-4. Epub 2024 Jul 12. Anal Bioanal Chem. 2025. PMID: 38992177 Free PMC article. Review.
-
Multi-task aquatic toxicity prediction model based on multi-level features fusion.J Adv Res. 2025 Feb;68:477-489. doi: 10.1016/j.jare.2024.06.002. Epub 2024 Jun 4. J Adv Res. 2025. PMID: 38844122 Free PMC article.
-
MLinvitroTox reloaded for high-throughput hazard-based prioritization of high-resolution mass spectrometry data.J Cheminform. 2025 Jan 31;17(1):14. doi: 10.1186/s13321-025-00950-4. J Cheminform. 2025. PMID: 39891244 Free PMC article.
-
Development of a Predictive Model for N-Dealkylation of Amine Contaminants Based on Machine Learning Methods.Toxics. 2024 Dec 22;12(12):931. doi: 10.3390/toxics12120931. Toxics. 2024. PMID: 39771146 Free PMC article.
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Miscellaneous