Utilizing high throughput screening data for predictive toxicology models: protocols and application to MLSCN assays
- PMID: 18283419
- DOI: 10.1007/s10822-008-9192-9
Utilizing high throughput screening data for predictive toxicology models: protocols and application to MLSCN assays
Abstract
Computational toxicology is emerging as an encouraging alternative to experimental testing. The Molecular Libraries Screening Center Network (MLSCN) as part of the NIH Molecular Libraries Roadmap has recently started generating large and diverse screening datasets, which are publicly available in PubChem. In this report, we investigate various aspects of developing computational models to predict cell toxicity based on cell proliferation screening data generated in the MLSCN. By capturing feature-based information in those datasets, such predictive models would be useful in evaluating cell-based screening results in general (for example from reporter assays) and could be used as an aid to identify and eliminate potentially undesired compounds. Specifically we present the results of random forest ensemble models developed using different cell proliferation datasets and highlight protocols to take into account their extremely imbalanced nature. Depending on the nature of the datasets and the descriptors employed we were able to achieve percentage correct classification rates between 70% and 85% on the prediction set, though the accuracy rate dropped significantly when the models were applied to in vivo data. In this context we also compare the MLSCN cell proliferation results with animal acute toxicity data to investigate to what extent animal toxicity can be correlated and potentially predicted by proliferation results. Finally, we present a visualization technique that allows one to compare a new dataset to the training set of the models to decide whether the new dataset may be reliably predicted.
Similar articles
-
Chapter 26 The Molecular Libraries Screening Center Network (MLSCN): Identifying Chemical Probes of Biological Systems.Annu Rep Med Chem. 2007;42:401-416. doi: 10.1016/S0065-7743(07)42026-7. Epub 2007 Nov 7. Annu Rep Med Chem. 2007. PMID: 32287469 Free PMC article. Review.
-
Computational toxicology as implemented by the U.S. EPA: providing high throughput decision support tools for screening and assessing chemical exposure, hazard and risk.J Toxicol Environ Health B Crit Rev. 2010 Feb;13(2-4):197-217. doi: 10.1080/10937404.2010.483935. J Toxicol Environ Health B Crit Rev. 2010. PMID: 20574897
-
Leveraging heterogeneous data from GHS toxicity annotations, molecular and protein target descriptors and Tox21 assay readouts to predict and rationalise acute toxicity.J Cheminform. 2019 May 31;11(1):36. doi: 10.1186/s13321-019-0356-5. J Cheminform. 2019. PMID: 31152262 Free PMC article.
-
Nonanimal Models for Acute Toxicity Evaluations: Applying Data-Driven Profiling and Read-Across.Environ Health Perspect. 2019 Apr;127(4):47001. doi: 10.1289/EHP3614. Environ Health Perspect. 2019. PMID: 30933541 Free PMC article.
-
Predictive models and computational toxicology.Methods Mol Biol. 2013;947:343-74. doi: 10.1007/978-1-62703-131-8_26. Methods Mol Biol. 2013. PMID: 23138916 Review.
Cited by
-
Predicting cytotoxicity from heterogeneous data sources with Bayesian learning.J Cheminform. 2010 Dec 9;2(1):11. doi: 10.1186/1758-2946-2-11. J Cheminform. 2010. PMID: 21143909 Free PMC article.
-
Binding affinity prediction with property-encoded shape distribution signatures.J Chem Inf Model. 2010 Feb 22;50(2):298-308. doi: 10.1021/ci9004139. J Chem Inf Model. 2010. PMID: 20095526 Free PMC article.
-
Global analysis reveals families of chemical motifs enriched for HERG inhibitors.PLoS One. 2015 Feb 20;10(2):e0118324. doi: 10.1371/journal.pone.0118324. eCollection 2015. PLoS One. 2015. PMID: 25700001 Free PMC article.
-
On the interpretation and interpretability of quantitative structure-activity relationship models.J Comput Aided Mol Des. 2008 Dec;22(12):857-71. doi: 10.1007/s10822-008-9240-5. Epub 2008 Sep 11. J Comput Aided Mol Des. 2008. PMID: 18784976
-
Modelling compound cytotoxicity using conformal prediction and PubChem HTS data.Toxicol Res (Camb). 2016 Oct 31;6(1):73-80. doi: 10.1039/c6tx00252h. eCollection 2017 Jan 1. Toxicol Res (Camb). 2016. PMID: 30090478 Free PMC article.
References
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources