Predicting the Skin Sensitization Potential of Small Molecules with Machine Learning Models Trained on Biologically Meaningful Descriptors
- PMID: 34451887
- PMCID: PMC8402010
- DOI: 10.3390/ph14080790
Predicting the Skin Sensitization Potential of Small Molecules with Machine Learning Models Trained on Biologically Meaningful Descriptors
Abstract
In recent years, a number of machine learning models for the prediction of the skin sensitization potential of small organic molecules have been reported and become available. These models generally perform well within their applicability domains but, as a result of the use of molecular fingerprints and other non-intuitive descriptors, the interpretability of the existing models is limited. The aim of this work is to develop a strategy to replace the non-intuitive features by predicted outcomes of bioassays. We show that such replacement is indeed possible and that as few as ten interpretable, predicted bioactivities are sufficient to reach competitive performance. On a holdout data set of 257 compounds, the best model ("Skin Doctor CP:Bio") obtained an efficiency of 0.82 and an MCC of 0.52 (at the significance level of 0.20). Skin Doctor CP:Bio is available free of charge for academic research. The modeling strategies explored in this work are easily transferable and could be adopted for the development of more interpretable machine learning models for the prediction of the bioactivity and toxicity of small organic compounds.
Keywords: bioactivity descriptors; conformal prediction; in silico prediction; machine learning; random forest; skin sensitization; toxicity prediction.
Conflict of interest statement
A.W. is funded by Beiersdorf AG through HITeC e.V and J.K. (Jochen Kühnl) is employed at Beiersdorf AG.
Figures







Similar articles
-
ChemBioSim: Enhancing Conformal Prediction of In Vivo Toxicity by Use of Predicted Bioactivities.J Chem Inf Model. 2021 Jul 26;61(7):3255-3272. doi: 10.1021/acs.jcim.1c00451. Epub 2021 Jun 21. J Chem Inf Model. 2021. PMID: 34153183 Free PMC article.
-
Skin Doctor: Machine Learning Models for Skin Sensitization Prediction that Provide Estimates and Indicators of Prediction Reliability.Int J Mol Sci. 2019 Sep 28;20(19):4833. doi: 10.3390/ijms20194833. Int J Mol Sci. 2019. PMID: 31569429 Free PMC article.
-
ADMET Evaluation in Drug Discovery. Part 17: Development of Quantitative and Qualitative Prediction Models for Chemical-Induced Respiratory Toxicity.Mol Pharm. 2017 Jul 3;14(7):2407-2421. doi: 10.1021/acs.molpharmaceut.7b00317. Epub 2017 Jun 21. Mol Pharm. 2017. PMID: 28595388
-
Data-driven modeling and prediction of blood glucose dynamics: Machine learning applications in type 1 diabetes.Artif Intell Med. 2019 Jul;98:109-134. doi: 10.1016/j.artmed.2019.07.007. Epub 2019 Jul 26. Artif Intell Med. 2019. PMID: 31383477 Review.
-
Machine Learning Toxicity Prediction: Latest Advances by Toxicity End Point.ACS Omega. 2022 Dec 13;7(51):47536-47546. doi: 10.1021/acsomega.2c05693. eCollection 2022 Dec 27. ACS Omega. 2022. PMID: 36591139 Free PMC article. Review.
Cited by
-
The Good, The Bad, and The Perplexing: Structural Alerts and Read-Across for Predicting Skin Sensitization Using Human Data.Chem Res Toxicol. 2023 May 15;36(5):734-746. doi: 10.1021/acs.chemrestox.2c00383. Epub 2023 May 1. Chem Res Toxicol. 2023. PMID: 37126467 Free PMC article.
References
-
- van Amerongen C.C.A., Ofenloch R.F., Cazzaniga S., Elsner P., Gonçalo M., Naldi L., Svensson Å., Bruze M., Schuttelaar M.L.A. Skin Exposure to Scented Products Used in Daily Life and Fragrance Contact Allergy in the European General Population—The EDEN Fragrance Study. Contact Dermat. 2021;84:385–394. doi: 10.1111/cod.13807. - DOI - PMC - PubMed
Grants and funding
LinkOut - more resources
Full Text Sources
Miscellaneous