Comparing Multiple Machine Learning Algorithms and Metrics for Estrogen Receptor Binding Prediction

doi:10.1021/acs.molpharmaceut.8b00546

. 2018 Oct 1;15(10):4361-4370.

doi: 10.1021/acs.molpharmaceut.8b00546. Epub 2018 Aug 28.

Comparing Multiple Machine Learning Algorithms and Metrics for Estrogen Receptor Binding Prediction

Daniel P Russo^{1

2}, Kimberley M Zorn¹, Alex M Clark³, Hao Zhu², Sean Ekins¹

Affiliations

¹ Collaborations Pharmaceuticals, Inc. , 840 Main Campus Drive, Lab 3510 , Raleigh , North Carolina 27606 , United States.
² The Rutgers Center for Computational and Integrative Biology , Camden , New Jersey 08102 , United States.
³ Molecular Materials Informatics, Inc. , Montreal , Quebec H3J 2S1 , Canada.

PMID: 30114914
PMCID: PMC6181119
DOI: 10.1021/acs.molpharmaceut.8b00546

Comparing Multiple Machine Learning Algorithms and Metrics for Estrogen Receptor Binding Prediction

Daniel P Russo et al. Mol Pharm. 2018.

. 2018 Oct 1;15(10):4361-4370.

doi: 10.1021/acs.molpharmaceut.8b00546. Epub 2018 Aug 28.

Authors

Daniel P Russo^{1

2}, Kimberley M Zorn¹, Alex M Clark³, Hao Zhu², Sean Ekins¹

Affiliations

¹ Collaborations Pharmaceuticals, Inc. , 840 Main Campus Drive, Lab 3510 , Raleigh , North Carolina 27606 , United States.
² The Rutgers Center for Computational and Integrative Biology , Camden , New Jersey 08102 , United States.
³ Molecular Materials Informatics, Inc. , Montreal , Quebec H3J 2S1 , Canada.

PMID: 30114914
PMCID: PMC6181119
DOI: 10.1021/acs.molpharmaceut.8b00546

Abstract

Many chemicals that disrupt endocrine function have been linked to a variety of adverse biological outcomes. However, screening for endocrine disruption using in vitro or in vivo approaches is costly and time-consuming. Computational methods, e.g., quantitative structure-activity relationship models, have become more reliable due to bigger training sets, increased computing power, and advanced machine learning algorithms, such as multilayered artificial neural networks. Machine learning models can be used to predict compounds for endocrine disrupting capabilities, such as binding to the estrogen receptor (ER), and allow for prioritization and further testing. In this work, an exhaustive comparison of multiple machine learning algorithms, chemical spaces, and evaluation metrics for ER binding was performed on public data sets curated using in-house cheminformatics software (Assay Central). Chemical features utilized in modeling consisted of binary fingerprints (ECFP6, FCFP6, ToxPrint, or MACCS keys) and continuous molecular descriptors from RDKit. Each feature set was subjected to classic machine learning algorithms (Bernoulli Naive Bayes, AdaBoost Decision Tree, Random Forest, Support Vector Machine) and Deep Neural Networks (DNN). Models were evaluated using a variety of metrics: recall, precision, F1-score, accuracy, area under the receiver operating characteristic curve, Cohen's Kappa, and Matthews correlation coefficient. For predicting compounds within the training set, DNN has an accuracy higher than that of other methods; however, in 5-fold cross validation and external test set predictions, DNN and most classic machine learning models perform similarly regardless of the data set or molecular descriptors used. We have also used the rank normalized scores as a performance-criteria for each machine learning method, and Random Forest performed best on the validation set when ranked by metric or by data sets. These results suggest classic machine learning algorithms may be sufficient to develop high quality predictive models of ER activity.

Keywords: Bayesian; deep learning; estrogen receptor; machine learning; support vector machine.

PubMed Disclaimer

Conflict of interest statement

Competing interests:

S.E. is owner, D.P.R. and K.M.Z., are employees and A.M.C is a consultant of Collaborations Pharmaceuticals Inc.

Figures

**Figure 1.**
Five-fold cross validation statistics for all eight datasets in this study organized by datasets (rows) and descriptors (columns). The green bar represents the best performing algorithm, measured by ACC, for a particular dataset-chemical space pair. All metrics were range scaled to [0, 1]. AdaBoost (ADA); Bernoulli Naïve-Bayes (BNB); Random Forest (RF); support vector classification (SVC); Deep Neural Networks (DNN).

**Figure 2.**
Validation set statistics for all eight datasets in this study. AdaBoost (ADA); Bernoulli Naive Bayes (BNB); Random forest (RF); support vector classification (SVC); Deep Neural Networks (DNN).

**Figure 3.**
Comparison of Assay Central model with other machine learning algorithms using a rank normalized score for the evaluation set. Assay Central (AC); AdaBoost (ADA); Bernoulli Naïve-Bayes (BNB); Random Forest (RF); support vector classification (SVC); Deep Neural Networks (DNN).

See this image and copyright information in PMC

Cited by

Machine Learning Models Identify New Inhibitors for Human OATP1B1.
Lane TR, Urbina F, Zhang X, Fye M, Gerlach J, Wright SH, Ekins S. Lane TR, et al. Mol Pharm. 2022 Nov 7;19(11):4320-4332. doi: 10.1021/acs.molpharmaceut.2c00662. Epub 2022 Oct 21. Mol Pharm. 2022. PMID: 36269563 Free PMC article.
Predictive modeling of estrogen receptor agonism, antagonism, and binding activities using machine- and deep-learning approaches.
Ciallella HL, Russo DP, Aleksunes LM, Grimm FA, Zhu H. Ciallella HL, et al. Lab Invest. 2021 Apr;101(4):490-502. doi: 10.1038/s41374-020-00477-2. Epub 2020 Aug 10. Lab Invest. 2021. PMID: 32778734 Free PMC article.
Staying Ahead of the Game: How SARS-CoV-2 has Accelerated the Application of Machine Learning in Pandemic Management.
Williams AH, Zhan CG. Williams AH, et al. BioDrugs. 2023 Sep;37(5):649-674. doi: 10.1007/s40259-023-00611-8. Epub 2023 Jul 18. BioDrugs. 2023. PMID: 37464099 Review.
Déjà vu: Stimulating open drug discovery for SARS-CoV-2.
Ekins S, Mottin M, Ramos PRPS, Sousa BKP, Neves BJ, Foil DH, Zorn KM, Braga RC, Coffee M, Southan C, Puhl AC, Andrade CH. Ekins S, et al. Drug Discov Today. 2020 May;25(5):928-941. doi: 10.1016/j.drudis.2020.03.019. Epub 2020 Apr 19. Drug Discov Today. 2020. PMID: 32320852 Free PMC article. Review.
Comparing Machine Learning Algorithms for Predicting Drug-Induced Liver Injury (DILI).
Minerali E, Foil DH, Zorn KM, Lane TR, Ekins S. Minerali E, et al. Mol Pharm. 2020 Jul 6;17(7):2628-2637. doi: 10.1021/acs.molpharmaceut.0c00326. Epub 2020 Jun 8. Mol Pharm. 2020. PMID: 32422053 Free PMC article.

See all "Cited by" articles

References

1. Hall JM; Couse JF; Korach KS The multifaceted mechanisms of estradiol and estrogen receptor signaling. J Biol Chem 2001, 276, (40), 36869–72. - PubMed
1. Giguere V; Yang N; Segui P; Evans RM Identification of a new class of steroid hormone receptors. Nature 1988, 331, (6151), 91–4. - PubMed
1. Soltysik K; Czekaj P Membrane estrogen receptors - is it an alternative way of estrogen action? J Physiol Pharmacol 2013, 64, (2), 129–42. - PubMed
1. Journe F; Body JJ; Leclercq G; Laurent G Hormone therapy for breast cancer, with an emphasis on the pure antiestrogen fulvestrant: mode of action, antitumor efficacy and effects on bone health. Expert Opin Drug Saf 2008, 7, (3), 241–58. - PubMed
1. Shanle EK; Xu W Endocrine disrupting chemicals targeting estrogen receptor signaling: identification and mechanisms of action. Chem Res Toxicol 2011, 24, (1), 6–19. - PMC - PubMed

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Substances

Actions

Grants and funding

LinkOut - more resources

Full Text Sources
Other Literature Sources
- scite Smart Citations

[1] Hall JM; Couse JF; Korach KS The multifaceted mechanisms of estradiol and estrogen receptor signaling. J Biol Chem 2001, 276, (40), 36869–72. - PubMed

[2] Hall JM; Couse JF; Korach KS The multifaceted mechanisms of estradiol and estrogen receptor signaling. J Biol Chem 2001, 276, (40), 36869–72. - PubMed

[3] Giguere V; Yang N; Segui P; Evans RM Identification of a new class of steroid hormone receptors. Nature 1988, 331, (6151), 91–4. - PubMed

[4] Giguere V; Yang N; Segui P; Evans RM Identification of a new class of steroid hormone receptors. Nature 1988, 331, (6151), 91–4. - PubMed

[5] Soltysik K; Czekaj P Membrane estrogen receptors - is it an alternative way of estrogen action? J Physiol Pharmacol 2013, 64, (2), 129–42. - PubMed

[6] Soltysik K; Czekaj P Membrane estrogen receptors - is it an alternative way of estrogen action? J Physiol Pharmacol 2013, 64, (2), 129–42. - PubMed

[7] Journe F; Body JJ; Leclercq G; Laurent G Hormone therapy for breast cancer, with an emphasis on the pure antiestrogen fulvestrant: mode of action, antitumor efficacy and effects on bone health. Expert Opin Drug Saf 2008, 7, (3), 241–58. - PubMed

[8] Journe F; Body JJ; Leclercq G; Laurent G Hormone therapy for breast cancer, with an emphasis on the pure antiestrogen fulvestrant: mode of action, antitumor efficacy and effects on bone health. Expert Opin Drug Saf 2008, 7, (3), 241–58. - PubMed

[9] Shanle EK; Xu W Endocrine disrupting chemicals targeting estrogen receptor signaling: identification and mechanisms of action. Chem Res Toxicol 2011, 24, (1), 6–19. - PMC - PubMed

[10] Shanle EK; Xu W Endocrine disrupting chemicals targeting estrogen receptor signaling: identification and mechanisms of action. Chem Res Toxicol 2011, 24, (1), 6–19. - PMC - PubMed

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Comparing Multiple Machine Learning Algorithms and Metrics for Estrogen Receptor Binding Prediction

Affiliations

Comparing Multiple Machine Learning Algorithms and Metrics for Estrogen Receptor Binding Prediction

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

Substances

Grants and funding

LinkOut - more resources

Full Text Sources

Other Literature Sources

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

Substances

Related information

Grants and funding

LinkOut - more resources

Full Text Sources

Other Literature Sources