Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2016 Dec 13:6:38860.
doi: 10.1038/srep38860.

Improved genome-scale multi-target virtual screening via a novel collaborative filtering approach to cold-start problem

Affiliations

Improved genome-scale multi-target virtual screening via a novel collaborative filtering approach to cold-start problem

Hansaim Lim et al. Sci Rep. .

Abstract

Conventional one-drug-one-gene approach has been of limited success in modern drug discovery. Polypharmacology, which focuses on searching for multi-targeted drugs to perturb disease-causing networks instead of designing selective ligands to target individual proteins, has emerged as a new drug discovery paradigm. Although many methods for single-target virtual screening have been developed to improve the efficiency of drug discovery, few of these algorithms are designed for polypharmacology. Here, we present a novel theoretical framework and a corresponding algorithm for genome-scale multi-target virtual screening based on the one-class collaborative filtering technique. Our method overcomes the sparseness of the protein-chemical interaction data by means of interaction matrix weighting and dual regularization from both chemicals and proteins. While the statistical foundation behind our method is general enough to encompass genome-wide drug off-target prediction, the program is specifically tailored to find protein targets for new chemicals with little to no available interaction data. We extensively evaluate our method using a number of the most widely accepted gene-specific and cross-gene family benchmarks and demonstrate that our method outperforms other state-of-the-art algorithms for predicting the interaction of new chemicals with multiple proteins. Thus, the proposed algorithm may provide a powerful tool for multi-target drug design.

PubMed Disclaimer

Figures

Figure 1
Figure 1. The components of the loss function.
INPUT: the sum of the interaction and impute matrices R + Q; the weight matrix W; the protein similarity matrix M; the chemical similarity matrix N. OUTPUT: the matrix of protein latent preferences F; the matrix of chemical latent preferences G; the matrix of predicted interaction probabilities formula image.
Figure 2
Figure 2. ZINC benchmark MCS.
The True Positive Rate (TPR) at top r% predictions formula image with varying number of (maximal) chemical structural similarity (MCS).
Figure 3
Figure 3. ZINC benchmark LT.
The True Positive Rate (TPR) at top r% predictions formula image with varying number of ligands per target (LT).
Figure 4
Figure 4. Accuracy over iterations.
The accuracy of COSINE (TPR at top 1%; y-axis) as a function of the number of iterations (x-axis) in different subsets of the ZINC benchmark (1–5, 6–10, 11–15, 26–20, and >20 ligands per target).
Figure 5
Figure 5. Accuracy as a function of noise.
The accuracy of COSINE in the 10-fold CV Yam08 benchmark as a function of the amount of missing interaction data (left) and as a function of the amount of incorrect interaction data (right).

References

    1. Xie L., Xie L., Kinnings S. L. & Bourne P. E. Novel computational approaches to polypharmacology as a means to define responses to individual drugs. Annu. Rev. Pharmacol. Toxicol. 52, 361–379 (2012). - PubMed
    1. Xie L. et al.. Towards structural systems pharmacology to study complex disease and personalized medicine. PLoS Comput. Biol. 10, e1003554 (2014). - PMC - PubMed
    1. Hart T. & Xie L. Providing data science support for systems pharmacology and its implications to drug discovery. Expert Opin. Drug Discov. 11, 241–256 (2016). - PMC - PubMed
    1. Xie L., Li J., Xie L. & Bourne P. E. Drug discovery using chemical systems biology: identification of the protein-ligand binding network to explain the side effects of CETP inhibitors. PLoS Comput. Biol. 5, e1000387 (2009). - PMC - PubMed
    1. Chang R. L., Xie L., Xie L., Bourne P. E. & Palsson B. Drug off-target effects predicted using structural analysis in the context of a metabolic network model. PLoS Comput. Biol. 6, e1000938 (2010). - PMC - PubMed

Publication types

LinkOut - more resources