Identifying metabolic enzymes with multiple types of association evidence
- PMID: 16571130
- PMCID: PMC1450304
- DOI: 10.1186/1471-2105-7-177
Identifying metabolic enzymes with multiple types of association evidence
Abstract
Background: Existing large-scale metabolic models of sequenced organisms commonly include enzymatic functions which can not be attributed to any gene in that organism. Existing computational strategies for identifying such missing genes rely primarily on sequence homology to known enzyme-encoding genes.
Results: We present a novel method for identifying genes encoding for a specific metabolic function based on a local structure of metabolic network and multiple types of functional association evidence, including clustering of genes on the chromosome, similarity of phylogenetic profiles, gene expression, protein fusion events and others. Using E. coli and S. cerevisiae metabolic networks, we illustrate predictive ability of each individual type of association evidence and show that significantly better predictions can be obtained based on the combination of all data. In this way our method is able to predict 60% of enzyme-encoding genes of E. coli metabolism within the top 10 (out of 3551) candidates for their enzymatic function, and as a top candidate within 43% of the cases.
Conclusion: We illustrate that a combination of genome context and other functional association evidence is effective in predicting genes encoding metabolic enzymes. Our approach does not rely on direct sequence homology to known enzyme-encoding genes, and can be used in conjunction with traditional homology-based metabolic reconstruction methods. The method can also be used to target orphan metabolic activities.
Figures




Similar articles
-
Implementation of homology based and non-homology based computational methods for the identification and annotation of orphan enzymes: using Mycobacterium tuberculosis H37Rv as a case study.BMC Bioinformatics. 2020 Oct 19;21(1):466. doi: 10.1186/s12859-020-03794-x. BMC Bioinformatics. 2020. PMID: 33076816 Free PMC article.
-
Filling gaps in a metabolic network using expression information.Bioinformatics. 2004 Aug 4;20 Suppl 1:i178-85. doi: 10.1093/bioinformatics/bth930. Bioinformatics. 2004. PMID: 15262797
-
The CanOE strategy: integrating genomic and metabolic contexts across multiple prokaryote genomes to find candidate genes for orphan enzymes.PLoS Comput Biol. 2012 May;8(5):e1002540. doi: 10.1371/journal.pcbi.1002540. Epub 2012 May 31. PLoS Comput Biol. 2012. PMID: 22693442 Free PMC article.
-
Profiling the orphan enzymes.Biol Direct. 2014 Jun 6;9:10. doi: 10.1186/1745-6150-9-10. Biol Direct. 2014. PMID: 24906382 Free PMC article. Review.
-
'Unknown' proteins and 'orphan' enzymes: the missing half of the engineering parts list--and how to find it.Biochem J. 2009 Dec 14;425(1):1-11. doi: 10.1042/BJ20091328. Biochem J. 2009. PMID: 20001958 Free PMC article. Review.
Cited by
-
Classification of microarray data using gene networks.BMC Bioinformatics. 2007 Feb 1;8:35. doi: 10.1186/1471-2105-8-35. BMC Bioinformatics. 2007. PMID: 17270037 Free PMC article.
-
Synergistic use of plant-prokaryote comparative genomics for functional annotations.BMC Genomics. 2011 Jun 15;12 Suppl 1(Suppl 1):S2. doi: 10.1186/1471-2164-12-S1-S2. BMC Genomics. 2011. PMID: 21810204 Free PMC article.
-
MIRAGE: a functional genomics-based approach for metabolic network model reconstruction and its application to cyanobacteria networks.Genome Biol. 2012 Nov 29;13(11):R111. doi: 10.1186/gb-2012-13-11-r111. Genome Biol. 2012. PMID: 23194418 Free PMC article.
-
Semi-automated curation of metabolic models via flux balance analysis: a case study with Mycoplasma gallisepticum.PLoS Comput Biol. 2013;9(9):e1003208. doi: 10.1371/journal.pcbi.1003208. Epub 2013 Sep 5. PLoS Comput Biol. 2013. PMID: 24039564 Free PMC article.
-
ModEnzA: Accurate Identification of Metabolic Enzymes Using Function Specific Profile HMMs with Optimised Discrimination Threshold and Modified Emission Probabilities.Adv Bioinformatics. 2011;2011:743782. doi: 10.1155/2011/743782. Epub 2011 Mar 29. Adv Bioinformatics. 2011. PMID: 21541071 Free PMC article.
References
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources
Molecular Biology Databases