Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2012 Jan 1;28(1):84-90.
doi: 10.1093/bioinformatics/btr621. Epub 2011 Nov 10.

Multifunctional proteins revealed by overlapping clustering in protein interaction network

Affiliations

Multifunctional proteins revealed by overlapping clustering in protein interaction network

Emmanuelle Becker et al. Bioinformatics. .

Abstract

Motivation: Multifunctional proteins perform several functions. They are expected to interact specifically with distinct sets of partners, simultaneously or not, depending on the function performed. Current graph clustering methods usually allow a protein to belong to only one cluster, therefore impeding a realistic assignment of multifunctional proteins to clusters.

Results: Here, we present Overlapping Cluster Generator (OCG), a novel clustering method which decomposes a network into overlapping clusters and which is, therefore, capable of correct assignment of multifunctional proteins. The principle of OCG is to cover the graph with initial overlapping classes that are iteratively fused into a hierarchy according to an extension of Newman's modularity function. By applying OCG to a human protein-protein interaction network, we show that multifunctional proteins are revealed at the intersection of clusters and demonstrate that the method outperforms other existing methods on simulated graphs and PPI networks.

Availability: This software can be downloaded from http://tagc.univ-mrs.fr/welcome/spip.php?rubrique197

Contact: brun@tagc.univ-mrs.fr

Supplementary information: Supplementary data are available at Bioinformatics online.

PubMed Disclaimer

Figures

Fig. 1.
Fig. 1.
Flowchart of the OCG algorithm. The graph is covered by an initial overlapping class system. The initial classes are hierarchically fused optimizing the modularity of the partition, leading to overlapping clusters.
Fig. 2.
Fig. 2.
(A) Theoretical partition composed of 200 vertices. The four overlapping regions contain 10 vertices each. (B) Comparison of OCG performance, according to the initial class system chosen, when applied to simulated graphs with different edge probabilities (Pi=0.15, 0.20, 0.25, from light grey to black). Results are represented in ROC space.
Fig. 3.
Fig. 3.
(A) Topological and functional features of multi- versus monoclustered proteins. For each feature, the distributions of mono- and multiclustered proteins are represented by boxplots (line = median; dot = mean). (B) Gene Ontology terms overrepresented among multiclustered proteins [Image created by SimCT (Hermann et al., 2009)].
Fig. 4.
Fig. 4.
Comparison of the performances of CFinder, Link Communities and OCG when applied to different random graphs (A) Erdös-Rényi; (B) Random spanning trees; (C) Geo3D with different edge probabilities (pi=0.20, 0.25,…, 0.45, 0.50, from light grey to black). Results are represented in ROC space.

References

    1. Adamcsek B., et al. CFinder: locating cliques and overlapping modules in biological networks. Bioinformatics. 2006;22:1021–1023. - PubMed
    1. Ahn Y.-Y., et al. Link communities reveal multiscale complexity in networks. Nature. 2010;466:761–764. - PubMed
    1. Aittokallio T., Schwikowski B. Graph-based methods for analysing networks in cell biology. Brief. Bioinform. 2006;7:243–255. - PubMed
    1. Angelelli J.B., Reboul L. Proceedings of JOBIM 2008. Lille: University Press; 2008. Network modularity optimization by a fusion-fission process and application to protein-protein interactions networks; pp. 105–110.
    1. Ahn J., et al. Integrative gene network construction for predicting a set of complementary prostate cancer genes. Bioinformatics. 2011;27:1846–1853. - PubMed

Publication types