Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2016 Apr 12:17:160.
doi: 10.1186/s12859-016-1005-x.

Predicting drug target interactions using meta-path-based semantic network analysis

Affiliations

Predicting drug target interactions using meta-path-based semantic network analysis

Gang Fu et al. BMC Bioinformatics. .

Abstract

Background: In the context of drug discovery, drug target interactions (DTIs) can be predicted based on observed topological features of a semantic network across the chemical and biological space. In a semantic network, the types of the nodes and links are different. In order to take into account the heterogeneity of the semantic network, meta-path-based topological patterns were investigated for link prediction.

Results: Supervised machine learning models were constructed based on meta-path topological features of an enriched semantic network, which was derived from Chem2Bio2RDF, and was expanded by adding compound and protein similarity neighboring links obtained from the PubChem databases. The additional semantic links significantly improved the predictive performance of the supervised learning models. The binary classification model built upon the enriched feature space using the Random Forest algorithm significantly outperformed an existing semantic link prediction algorithm, Semantic Link Association Prediction (SLAP), to predict unknown links between compounds and protein targets in an evolving network. In addition to link prediction, Random Forest also has an intrinsic feature ranking algorithm, which can be used to select the important topological features that contribute to link prediction.

Conclusions: The proposed framework has been demonstrated as a powerful alternative to SLAP in order to predict DTIs using the semantic network that integrates chemical, pharmacological, genomic, biological, functional, and biomedical information into a unified framework. It offers the flexibility to enrich the feature space by using different normalization processes on the topological features, and it can perform model construction and feature selection at the same time.

Keywords: Link prediction; Machine learning; Meta-path topological feature; Random forest; Semantic network analysis.

PubMed Disclaimer

Figures

Fig. 1
Fig. 1
Schematic representation of calculations of commuting matrix C15 through multiplying A2, A11, and A12
Fig. 2
Fig. 2
Receiver operating characteristic curves (a) and precision/recall curves (b) for the six models using two machine learning algorithms to build binary classification models upon three topological feature spaces. RF means Random Forest, SVM means support vector machine, FI means feature set I, FII means feature set II, and FIII means feature set III
Fig. 3
Fig. 3
ROC curves for the Random Forest model built upon feature set III and SLAP. RF means Random Forest and FIII means feature set III
Fig. 4
Fig. 4
Variable importance for Random Forest model built with feature set II. The color code for feature importance according to mean decrease accuracy: red (>70), blue (>45 and <70), green (<45); the color code for feature importance according to mean decrease Gini index: red (>240), blue (>240 and <100), green (<100)
Fig. 5
Fig. 5
Box plot for the variable importance varying in 1 000 Random Forest models

References

    1. Bredel M, Jacoby E. Chemogenomics: an emerging strategy for rapid target and drug discovery. Nat Rev Genet. 2004;5(4):262–275. doi: 10.1038/nrg1317. - DOI - PubMed
    1. Yamanishi Y. Chemogenomic approaches to infer drug-target interaction networks. Methods Mol Biol. 2013;939:97–113. doi: 10.1007/978-1-62703-107-3_9. - DOI - PubMed
    1. Oprea TI, May EE, Leitao A, Tropsha A. Computational systems chemical biology. Methods Mol Biol. 2011;672:459–488. doi: 10.1007/978-1-60761-839-3_18. - DOI - PMC - PubMed
    1. Oprea TI, Tropsha A, Faulon JL, Rintoul MD. Systems chemical biology. Nat Chem Biol. 2007;3(8):447–450. doi: 10.1038/nchembio0807-447. - DOI - PMC - PubMed
    1. Lee S, Park K, Kim D. Building a drug-target network and its applications. Expert Opin Drug Discov. 2009;4(11):1177–1189. doi: 10.1517/17460440903322234. - DOI - PubMed

Publication types