Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Jul-Sep;105(3):368504221109215.
doi: 10.1177/00368504221109215.

Integration of various protein similarities using random forest technique to infer augmented drug-protein matrix for enhancing drug-disease association prediction

Affiliations

Integration of various protein similarities using random forest technique to infer augmented drug-protein matrix for enhancing drug-disease association prediction

Satanat Kitsiranuwat et al. Sci Prog. 2022 Jul-Sep.

Abstract

Identifying new therapeutic indications for existing drugs is a major challenge in drug repositioning. Most computational drug repositioning methods focus on known targets. Analyzing multiple aspects of various protein associations provides an opportunity to discover underlying drug-associated proteins that can be used to improve the performance of the drug repositioning approaches. In this study, machine learning models were developed based on the similarities of diversified biological features, including protein interaction, topological network, sequence alignment, and biological function to predict protein pairs associating with the same drugs. The crucial set of features was identified, and the high performances of protein pair predictions were achieved with an area under the curve (AUC) value of more than 93%. Based on drug chemical structures, the drug similarity levels of the promising protein pairs were used to quantify the inferred drug-associated proteins. Furthermore, these proteins were employed to establish an augmented drug-protein matrix to enhance the efficiency of three existing drug repositioning techniques: a similarity constrained matrix factorization for the drug-disease associations (SCMFDD), an ensemble meta-paths and singular value decomposition (EMP-SVD) model, and a topology similarity and singular value decomposition (TS-SVD) technique. The results showed that the augmented matrix helped to improve the performance up to 4% more in comparison to the original matrix for SCMFDD and EMP-SVD, and about 1% more for TS-SVD. In summary, inferring new protein pairs related to the same drugs increase the opportunity to reveal missing drug-associated proteins that are important for drug development via the drug repositioning technique.

Keywords: Protein-protein interaction network; drug repositioning; drug repurposing; drug-protein association; machine learning.

PubMed Disclaimer

Conflict of interest statement

The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Figures

Figure 1.
Figure 1.
Overview of this study.
Figure 2.
Figure 2.
Framework to achieve crucial features.
Figure 3.
Figure 3.
Performance of 10 experiments using the forward selection technique.
Figure 4.
Figure 4.
The number of experiments in which the observing feature was found.
Figure 5.
Figure 5.
ROC curve for five iterations.
Figure 6.
Figure 6.
Pr curve for five iterations.
Figure 7.
Figure 7.
The frequency of drugs and their associated proteins are based on direct target protein information from all approved drugs.
Figure 8.
Figure 8.
Frequency of drugs and their associated proteins based on our prediction results of protein pairs related to the same drugs.

Similar articles

Cited by

References

    1. Gottlieb A, Stein GY, Ruppin E, et al.PREDICT: a method for inferring novel drug indications with application to personalized medicine. Mol Syst Biol 2011; 7: 1–9. - PMC - PubMed
    1. Blockeel H, Kersting K, Nijssen S, et al.Machine Learning and Knowledge Discovery in Databases. 2013.
    1. Ashburn TT, Thor KB. Drug repositioning: identifying and developing new uses for existing drugs. Nat Rev Drug Discovery 2004; 3: 673–683. - PubMed
    1. Sleigh SH, Barton CL. Repurposing strategies for therapeutics. Pharmaceut Med 2010; 24: 151–159.
    1. Roses AD. Pharmacogenetics in drug discovery and development: a translational perspective. Nat Rev Drug Discovery 2008; 7: 807–817. - PubMed

LinkOut - more resources