Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2011 May 3;6(5):e19349.
doi: 10.1371/journal.pone.0019349.

Global geometric affinity for revealing high fidelity protein interaction network

Affiliations

Global geometric affinity for revealing high fidelity protein interaction network

Yi Fang et al. PLoS One. .

Abstract

Protein-protein interaction (PPI) network analysis presents an essential role in understanding the functional relationship among proteins in a living biological system. Despite the success of current approaches for understanding the PPI network, the large fraction of missing and spurious PPIs and a low coverage of complete PPI network are the sources of major concern. In this paper, based on the diffusion process, we propose a new concept of global geometric affinity and an accompanying computational scheme to filter the uncertain PPIs, namely, reduce the spurious PPIs and recover the missing PPIs in the network. The main concept defines a diffusion process in which all proteins simultaneously participate to define a similarity metric (global geometric affinity (GGA)) to robustly reflect the internal connectivity among proteins. The robustness of the GGA is attributed to propagating the local connectivity to a global representation of similarity among proteins in a diffusion process. The propagation process is extremely fast as only simple matrix products are required in this computation process and thus our method is geared toward applications in high-throughput PPI networks. Furthermore, we proposed two new approaches that determine the optimal geometric scale of the PPI network and the optimal threshold for assigning the PPI from the GGA matrix. Our approach is tested with three protein-protein interaction networks and performs well with significant random noises of deletions and insertions in true PPIs. Our approach has the potential to benefit biological experiments, to better characterize network data sets, and to drive new discoveries.

PubMed Disclaimer

Conflict of interest statement

Competing Interests: The authors have declared that no competing interests exist.

Figures

Figure 1
Figure 1. The illustration of the diffusion process based inference.
(A) displays a spurious PPI (in green color solid line) and a missing PPI (in red color dash line). (B) shows that the spurious PPI (in green color dash link) is removed and the missing PPI (in red color solid line) is recovered in the network.
Figure 2
Figure 2. Geometric Representation of PPI network.
(A) displays a PPI network. formula image denotes the PPI and formula image denotes NPPI. formula image denotes the high dimensional feature vector of the point formula image and formula image denotes the high dimensional feature vector of the point formula image (a blue point and a red point). The formula image denotes the global geometric affinity between the point formula image and point formula image.
Figure 3
Figure 3. Determination of optimal propagation step.
This figure illustrates the process of determination of the optimal step. The horizontal axis denotes the propagation step, and the vertical axis denotes the AUC based probability value. Based on the observation, probability value increases initially and monotonically decreases after obtaining its maximum at step around formula image.
Figure 4
Figure 4. Determination of the threshold for PPI assignment by OLF.
This figure illustrates the procedure of computing optimal threshold for PPI assignment. Figure (A) displays the histogram of global geometric affinity (GGA). Figure (B) displays the value of match function at different threshold. The optimal threshold for GGA corresponds the one with the maxima of match function.
Figure 5
Figure 5. Comparison GGA-method and MDS-method by ROC curve.
This figure plots the ROC curve for the comparison between GGA-method and MDS-method using the CS PPI network data. The vertical axis denotes the sensitivity, and the horizontal axis denotes 1-specificity.
Figure 6
Figure 6. Comparison GGA-method and MDS-method by PR curve.
This Figure plots the PR curve for the comparison between GGA-method and MDS-method using the CS PPI network data. The vertical axis denotes the precision, and the horizontal axis denotes recall.

References

    1. Kuchaiev O, Rasajski M, Higham D, Przulj N. Geometric de-noising of protein-protein interaction networks. Plos Computational Biology. 2009;5 - PMC - PubMed
    1. Pawson T, Gish G, Nash P. Sh2 domains, interaction modules and cellular wiring. Trends in Cell Biology. 2001;11:504–511. - PubMed
    1. Ito T, Tashiro K, Muta S, Ozawa R, Chiba T, et al. Toward a protein-protein interaction map of the budding yeast: A comprehensive system to examine two-hybrid interactions in all possible combinations between the yeast proteins. Proceedings of the National Academy of Sciences of the United States of America. 2000;97:1143–1147. - PMC - PubMed
    1. Uetz P, Giot L, Cagney G, Mansfield T, Judson R, et al. A comprehensive analysis of protein-protein interactions in saccharomyces cerevisiae. Nature. 2000;403:623–627. - PubMed
    1. Giot L, Bader J, Brouwer C, Chaudhuri A, Kuang B, et al. A protein interaction map of drosophila melanogaster. Science. 2003;302:1727–1736. - PubMed

Publication types

LinkOut - more resources