Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2013 Nov 19:14:329.
doi: 10.1186/1471-2105-14-329.

Cheminformatic models based on machine learning for pyruvate kinase inhibitors of Leishmania mexicana

Affiliations

Cheminformatic models based on machine learning for pyruvate kinase inhibitors of Leishmania mexicana

Salma Jamal et al. BMC Bioinformatics. .

Abstract

Background: Leishmaniasis is a neglected tropical disease which affects approx. 12 million individuals worldwide and caused by parasite Leishmania. The current drugs used in the treatment of Leishmaniasis are highly toxic and has seen widespread emergence of drug resistant strains which necessitates the need for the development of new therapeutic options. The high throughput screen data available has made it possible to generate computational predictive models which have the ability to assess the active scaffolds in a chemical library followed by its ADME/toxicity properties in the biological trials.

Results: In the present study, we have used publicly available, high-throughput screen datasets of chemical moieties which have been adjudged to target the pyruvate kinase enzyme of L. mexicana (LmPK). The machine learning approach was used to create computational models capable of predicting the biological activity of novel antileishmanial compounds. Further, we evaluated the molecules using the substructure based approach to identify the common substructures contributing to their activity.

Conclusion: We generated computational models based on machine learning methods and evaluated the performance of these models based on various statistical figures of merit. Random forest based approach was determined to be the most sensitive, better accuracy as well as ROC. We further added a substructure based approach to analyze the molecules to identify potentially enriched substructures in the active dataset. We believe that the models developed in the present study would lead to reduction in cost and length of clinical studies and hence newer drugs would appear faster in the market providing better healthcare options to the patients.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Comparison of Accuracy and Balanced Classification Rate of the models generated in the present study.
Figure 2
Figure 2
Plot of Sensitivity and Specificity of models generated based on molecular descriptors.
Figure 3
Figure 3
ROC plot showing significant AUC values for Random Forest, Naive Bayes, J48 and SMO classifiers.
Figure 4
Figure 4
Molecular alignment of the 7 [[1]-[7]] enriched substructures (dark green) over the top 20 molecules of the active (1087) dataset obtained from PubChem (AID 1721).

Similar articles

Cited by

References

    1. World Health Organization. http://www.who.int/leishmaniasis/burden/magnitude/burden_magnitude/en/in....
    1. Alvar J, Yactayo S, Bern C. Leishmaniasis and poverty. Trends Parasitol. 2006;22(12):552. doi: 10.1016/j.pt.2006.09.004. - DOI - PubMed
    1. Yamey G, Torreele E. The world’s most neglected diseases. Br Med J. 2002;325:176–177. doi: 10.1136/bmj.325.7357.176. - DOI - PMC - PubMed
    1. Mathers CD, Ezzati M, Lopez AD. Measuring the burden of neglected tropical diseases: the global burden of disease framework. PLoS Negl Trop Dis. 2007;1:e114. doi: 10.1371/journal.pntd.0000114. - DOI - PMC - PubMed
    1. Alvar J, Ve’lez ID, Bern C, Herrero M, Desjeux P. Leishmaniasis worldwide and global estimates of its incidence. PLoS ONE. 2012;7(5):e35671. doi: 10.1371/journal.pone.0035671. - DOI - PMC - PubMed

Publication types

MeSH terms