Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Jun 16;23(1):234.
doi: 10.1186/s12859-022-04766-z.

Multi-view heterogeneous molecular network representation learning for protein-protein interaction prediction

Affiliations

Multi-view heterogeneous molecular network representation learning for protein-protein interaction prediction

Xiao-Rui Su et al. BMC Bioinformatics. .

Abstract

Background: Protein-protein interaction (PPI) plays an important role in regulating cells and signals. Despite the ongoing efforts of the bioassay group, continued incomplete data limits our ability to understand the molecular roots of human disease. Therefore, it is urgent to develop a computational method to predict PPIs from the perspective of molecular system.

Methods: In this paper, a highly efficient computational model, MTV-PPI, is proposed for PPI prediction based on a heterogeneous molecular network by learning inter-view protein sequences and intra-view interactions between molecules simultaneously. On the one hand, the inter-view feature is extracted from the protein sequence by k-mer method. On the other hand, we use a popular embedding method LINE to encode the heterogeneous molecular network to obtain the intra-view feature. Thus, the protein representation used in MTV-PPI is constructed by the aggregation of its inter-view feature and intra-view feature. Finally, random forest is integrated to predict potential PPIs.

Results: To prove the effectiveness of MTV-PPI, we conduct extensive experiments on a collected heterogeneous molecular network with the accuracy of 86.55%, sensitivity of 82.49%, precision of 89.79%, AUC of 0.9301 and AUPR of 0.9308. Further comparison experiments are performed with various protein representations and classifiers to indicate the effectiveness of MTV-PPI in predicting PPIs based on a complex network.

Conclusion: The achieved experimental results illustrate that MTV-PPI is a promising tool for PPI prediction, which may provide a new perspective for the future interactions prediction researches based on heterogeneous molecular network.

Keywords: Heterogeneous molecular network; LINE; Network representation learning; Protein sequence; Protein–protein interaction.

PubMed Disclaimer

Conflict of interest statement

The authors declare that they have no competing interests.

Figures

Fig. 1
Fig. 1
The overview of MTV-PPI
Fig. 2
Fig. 2
An illustration of the process of extracting inter-view feature
Fig. 3
Fig. 3
An illustration of first-order proximity and second-order proximity in LINE
Fig. 4
Fig. 4
ROC and PR curves obtained by MTV-PPI and all baseline algorithms
Fig. 5
Fig. 5
Predictive performances with two different aggregators
Fig. 6
Fig. 6
Results with different network embedding algorithms
Fig. 7
Fig. 7
ROC and PR curves obtained by various features
Fig. 8
Fig. 8
ROC and PR curves obtained by various classifiers

Similar articles

Cited by

References

    1. Kotlyar M, Pastrello C, Pivetta F, Sardo AL, Cumbaa C, Li H, Naranian T, Niu Y, Ding Z, Vafaee F, et al. In silico prediction of physical protein interactions and characterization of interactome orphans. Nat Methods. 2015;12(1):79–84. doi: 10.1038/nmeth.3178. - DOI - PubMed
    1. Fields S, Song O-k. A novel genetic system to detect protein–protein interactions. Nature. 1989;340(6230):245–246. doi: 10.1038/340245a0. - DOI - PubMed
    1. Gavin A-C, Bösche M, Krause R, Grandi P, Marzioch M, Bauer A, Schultz J, Rick JM, Michon A-M, Cruciat C-M, et al. Functional organization of the yeast proteome by systematic analysis of protein complexes. Nature. 2002;415(6868):141–147. doi: 10.1038/415141a. - DOI - PubMed
    1. Ho Y, Gruhler A, Heilbut A, Bader GD, Moore L, Adams S-L, Millar A, Taylor P, Bennett K, Boutilier K, et al. Systematic identification of protein complexes in saccharomyces cerevisiae by mass spectrometry. Nature. 2002;415(6868):180–183. doi: 10.1038/415180a. - DOI - PubMed
    1. Luo X, Ming Z, You Z, Li S, Xia Y, Leung H. Improving network topology-based protein interactome mapping via collaborative filtering. Knowl Based Syst. 2015;90:23–32. doi: 10.1016/j.knosys.2015.10.003. - DOI

LinkOut - more resources