Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2010 Feb 10;11 Suppl 1(Suppl 1):S3.
doi: 10.1186/1471-2164-11-S1-S3.

Computational approaches for detecting protein complexes from protein interaction networks: a survey

Affiliations

Computational approaches for detecting protein complexes from protein interaction networks: a survey

Xiaoli Li et al. BMC Genomics. .

Abstract

Background: Most proteins form macromolecular complexes to perform their biological functions. However, experimentally determined protein complex data, especially of those involving more than two protein partners, are relatively limited in the current state-of-the-art high-throughput experimental techniques. Nevertheless, many techniques (such as yeast-two-hybrid) have enabled systematic screening of pairwise protein-protein interactions en masse. Thus computational approaches for detecting protein complexes from protein interaction data are useful complements to the limited experimental methods. They can be used together with the experimental methods for mapping the interactions of proteins to understand how different proteins are organized into higher-level substructures to perform various cellular functions.

Results: Given the abundance of pairwise protein interaction data from high-throughput genome-wide experimental screenings, a protein interaction network can be constructed from protein interaction data by considering individual proteins as the nodes, and the existence of a physical interaction between a pair of proteins as a link. This binary protein interaction graph can then be used for detecting protein complexes using graph clustering techniques. In this paper, we review and evaluate the state-of-the-art techniques for computational detection of protein complexes, and discuss some promising research directions in this field.

Conclusions: Experimental results with yeast protein interaction data show that the interaction subgraphs discovered by various computational methods matched well with actual protein complexes. In addition, the computational approaches have also improved in performance over the years. Further improvements could be achieved if the quality of the underlying protein interaction data can be considered adequately to minimize the undesirable effects from the irrelevant and noisy sources, and the various biological evidences can be better incorporated into the detection process to maximize the exploitation of the increasing wealth of biological knowledge available.

PubMed Disclaimer

Figures

Figure 1
Figure 1
An example of how MCODE detects protein complexes from a small sample graph of protein-protein interactions.
Figure 2
Figure 2
CODEC to detect protein complexes from TAP data which are modeled as a bipartite graph.
Figure 3
Figure 3
COACH method detects core-attachment complexes with overlapping core structures.
Figure 4
Figure 4
Comparative performance of existing methods in terms of various evaluation metrics for DIP data. The methods are ordered chronologically in the years in which they were published.
Figure 5
Figure 5
Comparative performance of existing methods in terms of various evaluation metrics for the Krogan data. The methods are ordered chronologically in the years in which they were published.
Figure 6
Figure 6
Proportion of statistically significant complexes predicted by various methods in terms of P-values.

References

    1. Alberts B. The cell as a collection of protein machines: preparing the next generation of molecular biologists. Cell. 1998;92(3):291–294. doi: 10.1016/S0092-8674(00)80922-8. - DOI - PubMed
    1. Gavin AC. Functional organization of the yeast proteome by systematic analysis of protein complexes. Nature. 2002;415(6868):141–147. doi: 10.1038/415141a. - DOI - PubMed
    1. Gavin A, Aloy P, Grandi P, Krause R, Boesche M. Proteome survey reveals modularity of the yeast cell machinery. Nature. 2006;440(7084):631–636. doi: 10.1038/nature04532. - DOI - PubMed
    1. F Cullis A, Muirhead H, Perutz MF, Rossmann MG, North ACT. Structure of Hamoglobin: A Three-Dimensional Fourier Synthesis at 5.5-A. Resolution, Obtained by X-Ray Analysis. Nature. 1960;185(4711):416–422. doi: 10.1038/185416a0. - DOI - PubMed
    1. Downward J. The ins and outs of signaling. Nature. 2001;411(6839):759–762. doi: 10.1038/35081138. - DOI - PubMed

Publication types