Application of random forest approach to QSAR prediction of aquatic toxicity
- PMID: 19860412
- DOI: 10.1021/ci900203n
Application of random forest approach to QSAR prediction of aquatic toxicity
Abstract
This work is devoted to the application of the random forest approach to QSAR analysis of aquatic toxicity of chemical compounds tested on Tetrahymena pyriformis. The simplex representation of the molecular structure approach implemented in HiT QSAR Software was used for descriptors generation on a two-dimensional level. Adequate models based on simplex descriptors and the RF statistical approach were obtained on a modeling set of 644 compounds. Model predictivity was validated on two external test sets of 339 and 110 compounds. The high impact of lipophilicity and polarizability of investigated compounds on toxicity was determined. It was shown that RF models were tolerant for insertion of irrelevant descriptors as well as for randomization of some part of toxicity values that were representing a "noise". The fast procedure of optimization of the number of trees in the random forest has been proposed. The discussed RF model had comparable or better statistical characteristics than the corresponding PLS or KNN models.
Similar articles
-
Combinatorial QSAR modeling of chemical toxicants tested against Tetrahymena pyriformis.J Chem Inf Model. 2008 Apr;48(4):766-84. doi: 10.1021/ci700443v. Epub 2008 Mar 1. J Chem Inf Model. 2008. PMID: 18311912
-
QSAR with quantum topological molecular similarity indices: toxicity of aromatic aldehydes to Tetrahymena pyriformis.SAR QSAR Environ Res. 2010 Jan 1;21(1):149-68. doi: 10.1080/10629360903568697. SAR QSAR Environ Res. 2010. PMID: 20373218
-
QSTR with extended topochemical atom (ETA) indices. 12. QSAR for the toxicity of diverse aromatic compounds to Tetrahymena pyriformis using chemometric tools.Chemosphere. 2009 Nov;77(7):999-1009. doi: 10.1016/j.chemosphere.2009.07.072. Epub 2009 Aug 25. Chemosphere. 2009. PMID: 19709717
-
Methods for deriving pesticide aquatic life criteria.Rev Environ Contam Toxicol. 2009;199:19-109. Rev Environ Contam Toxicol. 2009. PMID: 19110939 Review.
-
Prediction of Aquatic Toxicity of Benzene Derivatives to Tetrahymena pyriformis According to OECD Principles.Curr Pharm Des. 2016;22(33):5085-5094. doi: 10.2174/1381612822666160804095107. Curr Pharm Des. 2016. PMID: 27568732 Review.
Cited by
-
Predicting binding affinity of CSAR ligands using both structure-based and ligand-based approaches.J Chem Inf Model. 2013 Aug 26;53(8):1915-22. doi: 10.1021/ci400216q. Epub 2013 Jul 17. J Chem Inf Model. 2013. PMID: 23809015 Free PMC article.
-
Molecular Toxicity Virtual Screening Applying a Quantized Computational SNN-Based Framework.Molecules. 2023 Jan 31;28(3):1342. doi: 10.3390/molecules28031342. Molecules. 2023. PMID: 36771009 Free PMC article.
-
Machine Learning Based Toxicity Prediction: From Chemical Structural Description to Transcriptome Analysis.Int J Mol Sci. 2018 Aug 10;19(8):2358. doi: 10.3390/ijms19082358. Int J Mol Sci. 2018. PMID: 30103448 Free PMC article. Review.
-
Toxicity Prediction Method Based on Multi-Channel Convolutional Neural Network.Molecules. 2019 Sep 17;24(18):3383. doi: 10.3390/molecules24183383. Molecules. 2019. PMID: 31533341 Free PMC article.
-
Prediction of the Neurotoxic Potential of Chemicals Based on Modelling of Molecular Initiating Events Upstream of the Adverse Outcome Pathways of (Developmental) Neurotoxicity.Int J Mol Sci. 2022 Mar 11;23(6):3053. doi: 10.3390/ijms23063053. Int J Mol Sci. 2022. PMID: 35328472 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources