A machine learning model trained on a high-throughput antibacterial screen increases the hit rate of drug discovery
- PMID: 36228001
- PMCID: PMC9624395
- DOI: 10.1371/journal.pcbi.1010613
A machine learning model trained on a high-throughput antibacterial screen increases the hit rate of drug discovery
Abstract
Screening for novel antibacterial compounds in small molecule libraries has a low success rate. We applied machine learning (ML)-based virtual screening for antibacterial activity and evaluated its predictive power by experimental validation. We first binarized 29,537 compounds according to their growth inhibitory activity (hit rate 0.87%) against the antibiotic-resistant bacterium Burkholderia cenocepacia and described their molecular features with a directed-message passing neural network (D-MPNN). Then, we used the data to train an ML model that achieved a receiver operating characteristic (ROC) score of 0.823 on the test set. Finally, we predicted antibacterial activity in virtual libraries corresponding to 1,614 compounds from the Food and Drug Administration (FDA)-approved list and 224,205 natural products. Hit rates of 26% and 12%, respectively, were obtained when we tested the top-ranked predicted compounds for growth inhibitory activity against B. cenocepacia, which represents at least a 14-fold increase from the previous hit rate. In addition, more than 51% of the predicted antibacterial natural compounds inhibited ESKAPE pathogens showing that predictions expand beyond the organism-specific dataset to a broad range of bacteria. Overall, the developed ML approach can be used for compound prioritization before screening, increasing the typical hit rate of drug discovery.
Conflict of interest statement
The authors have declared that no competing interests exist.
Figures






References
-
- CDC. Antibiotic Resistance Threats in the United States. Atlanta, GA: U.S. Department of Health and Human Services, CDC; 2019.
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources