Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Comparative Study
. 2006 Oct 17:7:457.
doi: 10.1186/1471-2105-7-457.

Comparison of protein interaction networks reveals species conservation and divergence

Affiliations
Comparative Study

Comparison of protein interaction networks reveals species conservation and divergence

Zhi Liang et al. BMC Bioinformatics. .

Abstract

Background: Recent progresses in high-throughput proteomics have provided us with a first chance to characterize protein interaction networks (PINs), but also raised new challenges in interpreting the accumulating data.

Results: Motivated by the need of analyzing and interpreting the fast-growing data in the field of proteomics, we propose a comparative strategy to carry out global analysis of PINs. We compare two PINs by combining interaction topology and sequence similarity to identify conserved network substructures (CoNSs). Using this approach we perform twenty-one pairwise comparisons among the seven recently available PINs of E.coli, H.pylori, S.cerevisiae, C.elegans, D.melanogaster, M.musculus and H.sapiens. In spite of the incompleteness of data, PIN comparison discloses species conservation at the network level and the identified CoNSs are also functionally conserved and involve in basic cellular functions. We investigate the yeast CoNSs and find that many of them correspond to known complexes. We also find that different species harbor many conserved interaction regions that are topologically identical and these regions can constitute larger interaction regions that are topologically different but similar in framework. Based on the species-to-species difference in CoNSs, we infer potential species divergence. It seems that different species organize orthologs in similar but not necessarily the same topology to achieve similar or the same function. This attributes much to duplication and divergence of genes and their associated interactions. Finally, as the application of CoNSs, we predict 101 protein-protein interactions (PPIs), annotate 339 new protein functions and deduce 170 pairs of orthologs.

Conclusion: Our result demonstrates that the cross-species comparison strategy we adopt is powerful for the exploration of biological problems from the perspective of networks.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Schematic of pairwise network comparison in NetAlign. The comparison between two PINs is accomplished by a fast subgraph isomorphism algorithm and the resulting s-CoNSs are connected maximal common subgraphs (MCS) and exact matches of the two networks. The s-CoNSs are further merged by a clustering rule to produce c-CoNSs that allow inexact match among homologous regions of interaction in the two networks. The identified s-CoNSs and c-CoNSs are scored on the basis of their interaction topologies and evaluated by statistical significance.
Figure 2
Figure 2
Representative s-CoNSs. Each pair of matched conserved proteins from two different species is shown in one node with their identifiers delimited by a slash; black edges are conserved PPIs existed in both PINs and constitute the s-CoNS, while red/green edges are discrepant PPIs observed only in the species on the left/right of the slash, respectively. a-d. These s-CoNSs corresponds to the RNA polymerase (RNAP) of prokaryotes and are identified from the PIN comparison between E.coli (left) and H.pylori (right) (figure 10). e. This s-CoNS is from the NetAlign analysis between H.sapien and M.musculus. It is a part of the system of fibroblast growth factors (FGF) and FGF receptors (figure 8). Gene duplication present in this system results in great redundancy for the identified s-CoNSs that 151 very similar s-CoNSs are identified. f. This is an s-CoNS harbored by the PINs of S.cerevisiae and C.elegans, and it is a part of E2F/DP transcription factor complex (figure 4). Based on the discrepant red edge, we predict that F11A10.2 interacts with T13H5.4 and this prediction is also present in the interolog database [32]. g-k. These s-CoNSs constitute the complex of replication factor C (figure 3) and are derived from the comparison between the PINs of S.cerevisiae and H.sapien.
Figure 3
Figure 3
Representative c-CoNS: the complex of replication factor C (RFC). Figure 3-10 are representative c-CoNSs. Each c-CoNS is shown in two separate panels each for a species; orthologous and transitively orthologous proteins are shown in the same horizontal level in each panel. Black edges are conserved PPIs existed in both PINs and constitute c-CoNSs, while red/green edges are discrepant PPIs observed only in the species on the left/right panel, respectively.
Figure 4
Figure 4
Representative c-CoNS: E2F/DP transcription factor complex.
Figure 5
Figure 5
Representative c-CoNS: the general transcription and DNA repair factor IIH (TFIIH) complex.
Figure 6
Figure 6
Representative c-CoNS: ATP synthase.
Figure 7
Figure 7
Representative c-CoNS: synaptosomal neurotransmitter release.
Figure 8
Figure 8
Representative c-CoNS: system of fibroblast growth factors (FGFs) and FGF receptors.
Figure 9
Figure 9
Representative c-CoNS: nucleocytoplasmic transport.
Figure 10
Figure 10
Representative c-CoNS: bacterial RNA polymerase (RNAP).
Figure 11
Figure 11
Function distribution of the identified CoNSs. Each pie chart represents the distribution of the ten functional categories of the CoNSs derived from a pairwise PIN comparison. The area of each pie chart is approximately scaled according to the number of conserved proteins involved in the CoNSs.

Similar articles

Cited by

References

    1. Barabási AL, Oltvai ZN. Network biology: understanding the cell's functional organization. Nat Rev Genet. 2004;5:101–113. doi: 10.1038/nrg1272. - DOI - PubMed
    1. Fields S, Song O. A novel genetic system to detect protein-protein interactions. Nature. 1989;340:245–246. doi: 10.1038/340245a0. - DOI - PubMed
    1. Ho Y, Gruhler A, Heilbut A, Bader GD, Moore L, Adams SL, Millar A, Taylor P, Bennett K, Boutilier K, Yang L, Wolting C, Wolting C, Donaldson I, Schandorff S, Shewnarane J, Vo M, Taggart J, Goudreault M, Muskat B, Alfarano C, Dewar D, Lin Z, Michalickova K, Willems AR, Sassi H, Nielsen PA, Rasmussen KJ, Andersen JR, Johansen LE, Hansen LH, Jespersen H, Podtelejnikov A, Nielsen E, Crawford J, Poulsen V, SØrensen BD, Matthiesen J, Hendrickson RC, Gleeson F, Pawson T, Moran MF, Durocher D, Mann M, Hogue CWV, Figeys D, Tyers M. Systematic identification of protein complexes in Saccharomyces cerevisiae by mass spectrometry. Nature. 2002;415:180–183. doi: 10.1038/415180a. - DOI - PubMed
    1. Zhu H, Bilgin M, Bangham R, Hall D, Casamayor A, Bertone P, Lan N, Jansen R, Bidlingmaier S, Houfek T, Mitchell T, Miller P, Dean RA, Gerstein M, Snyder M. Global analysis of protein activities using proteome chips. Science. 2001;293:2101–2105. doi: 10.1126/science.1062191. - DOI - PubMed
    1. Tong AHY, Drees B, Nardelli G, Bader GD, Brannetti B, Castagnoli L, Evangelista M, Ferracuti S, Nelson B, Paoluzi S, Quondam M, Zucconi A, Hogue CWV, Fields S, Boone C, Cesareni G. A combined experimental and computational strategy to define protein interaction networks for peptide recognition modules. Science. 2002;295:321–324. doi: 10.1126/science.1064987. - DOI - PubMed

Publication types

LinkOut - more resources