Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2009 Nov 29:3:112.
doi: 10.1186/1752-0509-3-112.

Spectral affinity in protein networks

Affiliations

Spectral affinity in protein networks

Konstantin Voevodski et al. BMC Syst Biol. .

Abstract

Background: Protein-protein interaction (PPI) networks enable us to better understand the functional organization of the proteome. We can learn a lot about a particular protein by querying its neighborhood in a PPI network to find proteins with similar function. A spectral approach that considers random walks between nodes of interest is particularly useful in evaluating closeness in PPI networks. Spectral measures of closeness are more robust to noise in the data and are more precise than simpler methods based on edge density and shortest path length.

Results: We develop a novel affinity measure for pairs of proteins in PPI networks, which uses personalized PageRank, a random walk based method used in context-sensitive search on the Web. Our measure of closeness, which we call PageRank Affinity, is proportional to the number of times the smaller-degree protein is visited in a random walk that restarts at the larger-degree protein. PageRank considers paths of all lengths in a network, therefore PageRank Affinity is a precise measure that is robust to noise in the data. PageRank Affinity is also provably related to cluster co-membership, making it a meaningful measure. In our experiments on protein networks we find that our measure is better at predicting co-complex membership and finding functionally related proteins than other commonly used measures of closeness. Moreover, our experiments indicate that PageRank Affinity is very resilient to noise in the network. In addition, based on our method we build a tool that quickly finds nodes closest to a queried protein in any protein network, and easily scales to much larger biological networks.

Conclusion: We define a meaningful way to assess the closeness of two proteins in a PPI network, and show that our closeness measure is more biologically significant than other commonly used methods. We also develop a tool, accessible at http://xialab.bu.edu/resources/pnns, that allows the user to quickly find nodes closest to a queried vertex in any protein network available from BioGRID or specified by the user.

PubMed Disclaimer

Figures

Figure 1
Figure 1
A view of the protein complexes in one of the PPI networks. The AC-Western network, with proteins in the same complex labeled with the same color.
Figure 2
Figure 2
Which measure of closeness is best at predicting co-complex membership? The number of co-complex pairs (as a fraction of the total number of co-complex pairs in the network) among pairs in the top percentile of each closeness ranking is displayed. Higher values indicate measures that are more biologically meaningful. (a) The results for the AC-Western network. (b) The results for the AC-MS network. (c) The results for the Two-Hybrid network.
Figure 3
Figure 3
Which measure of closeness best correlates with functional distance? The average functional distance of pairs in the top percentile of each closeness ranking is displayed. Lower values indicate measures that are more biologically meaningful. (a) The results for the AC-Western network. (b) The results for the AC-MS network. (c) The results for the Two-Hybrid network. The average functional distance of two proteins in the genome is 5.8.
Figure 4
Figure 4
How robust is PageRank Affinity to noise? We add noise to a network represented by a graph G = (V, E) by randomly choosing r1 * |E| of the true positive interactions and removing them, and randomly choosing r2 * |E| of the true negative interactions and adding them. We calculate the robustness score of PageRank Affinity by computing the PageRank Affinity of each pair of vertices in the noisy network, and counting how often true positive interactions score higher than true negative interactions. As a base for comparison, we also compute the robustness score of shortest path closeness (with ties in distance broken randomly). (a) The results for r1 = 0.1 and varying values of r2. (b) The results for r1 = 0.2 and varying values of r2. (c) The results for r1 = 0.3 and varying values of r2. (d) The results for r1 = 0.4 and varying values of r2.

Similar articles

Cited by

References

    1. Jeong H, Mason S, Barabasi A, Oltvai Z. Lethality and centrality in protein networks. Nature. 2001;411:41–42. doi: 10.1038/35075138. - DOI - PubMed
    1. Hahn M, Kern A. Comparative Genomics of Centrality and Essentiality in Three Eukaryotic Protein-Interaction Networks. Mol Biol Evol. 2005;22:803–806. doi: 10.1093/molbev/msi072. - DOI - PubMed
    1. Fowler J. Legislative cosponsorship networks in the US House and Senate. Social Networks. 2006;28:454–465. doi: 10.1016/j.socnet.2005.11.003. - DOI
    1. Gibson D, Kleinberg J, Raghavan P. Inferring Web communities from link topology. Proc ACM Conf on Hypertext and Hypermedia. 1998. pp. 225–234.
    1. Kumar R, Raghavan P, Rajagopalan S, Tomkins A. Trawling the Web for emerging cyber-communities. Computer Networks. 1999;31:1481–1493. doi: 10.1016/S1389-1286(99)00040-7. - DOI

Publication types

LinkOut - more resources