Genome evolution reveals biochemical networks and functional modules
- PMID: 14673105
- PMCID: PMC307584
- DOI: 10.1073/pnas.2136809100
Genome evolution reveals biochemical networks and functional modules
Abstract
The analysis of completely sequenced genomes uncovers an astonishing variability between species in terms of gene content and order. During genome history, the genes are frequently rear-ranged, duplicated, lost, or transferred horizontally between genomes. These events appear to be stochastic, yet they are under selective constraints resulting from the functional interactions between genes. These genomic constraints form the basis for a variety of techniques that employ systematic genome comparisons to predict functional associations among genes. The most powerful techniques to date are based on conserved gene neighborhood, gene fusion events, and common phylogenetic distributions of gene families. Here we show that these techniques, if integrated quantitatively and applied to a sufficiently large number of genomes, have reached a resolution which allows the characterization of function at a higher level than that of the individual gene: global modularity becomes detectable in a functional protein network. In Escherichia coli, the predicted modules can be bench-marked by comparison to known metabolic pathways. We found as many as 74% of the known metabolic enzymes clustering together in modules, with an average pathway specificity of at least 84%. The modules extend beyond metabolism, and have led to hundreds of reliable functional predictions both at the protein and pathway level. The results indicate that modularity in protein networks is intrinsically encoded in present-day genomes.
Figures




Similar articles
-
Prediction of functional modules based on comparative genome analysis and Gene Ontology application.Nucleic Acids Res. 2005 May 18;33(9):2822-37. doi: 10.1093/nar/gki573. Print 2005. Nucleic Acids Res. 2005. PMID: 15901854 Free PMC article.
-
Evaluation of physical and functional protein-protein interaction prediction methods for detecting biological pathways.PLoS One. 2013;8(1):e54325. doi: 10.1371/journal.pone.0054325. Epub 2013 Jan 17. PLoS One. 2013. PMID: 23349851 Free PMC article.
-
Gene fusions and gene duplications: relevance to genomic annotation and functional analysis.BMC Genomics. 2005 Mar 9;6:33. doi: 10.1186/1471-2164-6-33. BMC Genomics. 2005. PMID: 15757509 Free PMC article.
-
Detecting hierarchical modularity in biological networks.Methods Mol Biol. 2009;541:145-60. doi: 10.1007/978-1-59745-243-4_7. Methods Mol Biol. 2009. PMID: 19381526 Review.
-
Prophages and bacterial genomics: what have we learned so far?Mol Microbiol. 2003 Jul;49(2):277-300. doi: 10.1046/j.1365-2958.2003.03580.x. Mol Microbiol. 2003. PMID: 12886937 Review.
Cited by
-
Global probabilistic annotation of metabolic networks enables enzyme discovery.Nat Chem Biol. 2012 Oct;8(10):848-54. doi: 10.1038/nchembio.1063. Nat Chem Biol. 2012. PMID: 22960854 Free PMC article.
-
Identification and analysis of evolutionarily cohesive functional modules in protein networks.Genome Res. 2006 Mar;16(3):374-82. doi: 10.1101/gr.4336406. Epub 2006 Jan 31. Genome Res. 2006. PMID: 16449501 Free PMC article.
-
Projection of gene-protein networks to the functional space of the proteome and its application to analysis of organism complexity.BMC Genomics. 2010 Feb 10;11 Suppl 1(Suppl 1):S4. doi: 10.1186/1471-2164-11-S1-S4. BMC Genomics. 2010. PMID: 20158875 Free PMC article.
-
How and when should interactome-derived clusters be used to predict functional modules and protein function?Bioinformatics. 2009 Dec 1;25(23):3143-50. doi: 10.1093/bioinformatics/btp551. Epub 2009 Sep 21. Bioinformatics. 2009. PMID: 19770263 Free PMC article.
-
Module detection in complex networks using integer optimisation.Algorithms Mol Biol. 2010 Nov 12;5:36. doi: 10.1186/1748-7188-5-36. Algorithms Mol Biol. 2010. PMID: 21073720 Free PMC article.
References
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources