Three new consensus QSAR models for the prediction of Ames genotoxicity
- PMID: 15388809
- DOI: 10.1093/mutage/geh043
Three new consensus QSAR models for the prediction of Ames genotoxicity
Abstract
Three QSAR methods, artificial neural net (ANN), k-nearest neighbors (kNN), and Decision Forest (DF), were applied to 3363 diverse compounds tested for their Ames genotoxicity. The ratio of mutagens to non-mutagens was 60/40 for this dataset. This group of compounds includes >300 therapeutic drugs. All models were developed using the same initial set of 148 topological indices: molecular connectivity chi indices and electrotopological state indices (atom-type, bond-type and group-type E-state), as well as binary indicators. While previous studies have found logP to be a determining factor in genotoxicity, it was not found to be important by any modeling method employed in this study. The three models yielded an average training/test concordance value of 88%, with a low percentage of false positives and false negatives. External validation testing on 400 compounds not used for QSAR model development gave an average concordance of 82%. This value increased to 92% upon removal of less reliable outcomes, as determined by a reliability criterion used within each model. The ANN model showed the best performance in predicting drug compounds, yielding 97% concordance (34/35 drugs) after the removal of less reliable predictions. The appreciable commonality found among the top 10 ranked descriptors from each model is of particular interest because of the diversity in the learning algorithms and descriptor selection techniques employed in this study. Forty percent of the most important descriptors in any one model are found in one or two other models. Fourteen of the most important descriptors relate directly to known toxicophores involved in potent genotoxic responses in Salmonella typhimurium. A comparison of the validation results with those of MULTICASE and DEREK indicated that the new models presented in this work perform substantially better than the former models in predicting genotoxicity of therapeutic drugs. Substantially higher specificity was achieved with these new models as compared with MULTICASE or DEREK with comparable sensitivities among all models.
Similar articles
-
Assessment of the sensitivity of the computational programs DEREK, TOPKAT, and MCASE in the prediction of the genotoxicity of pharmaceutical molecules.Environ Mol Mutagen. 2004;43(3):143-58. doi: 10.1002/em.20013. Environ Mol Mutagen. 2004. PMID: 15065202
-
Predicting the genotoxicity of secondary and aromatic amines using data subsetting to generate a model ensemble.J Chem Inf Comput Sci. 2003 May-Jun;43(3):949-63. doi: 10.1021/ci034013i. J Chem Inf Comput Sci. 2003. PMID: 12767154
-
In silico screening of chemicals for bacterial mutagenicity using electrotopological E-state indices and MDL QSAR software.Regul Toxicol Pharmacol. 2005 Dec;43(3):313-23. doi: 10.1016/j.yrtph.2005.09.001. Epub 2005 Oct 19. Regul Toxicol Pharmacol. 2005. PMID: 16242226
-
QSAR modeling based on structure-information for properties of interest in human health.SAR QSAR Environ Res. 2005 Feb-Apr;16(1-2):13-41. doi: 10.1080/10629360412331319853. SAR QSAR Environ Res. 2005. PMID: 15844441 Review.
-
Neural networks in building QSAR models.Methods Mol Biol. 2008;458:137-58. Methods Mol Biol. 2008. PMID: 19065809 Review.
Cited by
-
Using MD Simulations To Calculate How Solvents Modulate Solubility.J Chem Theory Comput. 2016 Apr 12;12(4):1930-41. doi: 10.1021/acs.jctc.5b00934. Epub 2016 Mar 2. J Chem Theory Comput. 2016. PMID: 26878198 Free PMC article.
-
Prospecting for Cressa cretica to treat COVID-19 via in silico molecular docking models of the SARS-CoV-2.J Biomol Struct Dyn. 2022 Aug;40(12):5643-5652. doi: 10.1080/07391102.2021.1872419. Epub 2021 Jan 15. J Biomol Struct Dyn. 2022. PMID: 33446077 Free PMC article.
-
Application 2D Descriptors and Artificial Neural Networks for Beta-Glucosidase Inhibitors Screening.Molecules. 2020 Dec 15;25(24):5942. doi: 10.3390/molecules25245942. Molecules. 2020. PMID: 33333961 Free PMC article.
-
Prediction of acute mammalian toxicity using QSAR methods: a case study of sulfur mustard and its breakdown products.Molecules. 2012 Jul 27;17(8):8982-9001. doi: 10.3390/molecules17088982. Molecules. 2012. PMID: 22842643 Free PMC article.
-
QSPRmodeler - An open source application for molecular predictive analytics.Front Bioinform. 2024 Sep 23;4:1441024. doi: 10.3389/fbinf.2024.1441024. eCollection 2024. Front Bioinform. 2024. PMID: 39391332 Free PMC article.
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources
Miscellaneous