Mapping of orthologous genes in the context of biological pathways: An application of integer programming
- PMID: 16373500
- PMCID: PMC1325003
- DOI: 10.1073/pnas.0509737102
Mapping of orthologous genes in the context of biological pathways: An application of integer programming
Abstract
Mapping biological pathways across microbial genomes is a highly important technique in functional studies of biological systems. Existing methods mainly rely on sequence-based orthologous gene mapping, which often leads to suboptimal mapping results because sequence-similarity information alone does not contain sufficient information for accurate identification of orthology relationship. Here we present an algorithm for pathway mapping across microbial genomes. The algorithm takes into account both sequence similarity and genomic structure information such as operons and regulons. One basic premise of our approach is that a microbial pathway could generally be decomposed into a few operons or regulons. We formulated the pathway-mapping problem to map genes across genomes to maximize their sequence similarity under the constraint that the mapped genes be grouped into a few operons, preferably coregulated in the target genome. We have developed an integer-programming algorithm for solving this constrained optimization problem and implemented the algorithm as a computer software program, p-map. We have tested p-map on a number of known homologous pathways. We conclude that using genomic structure information as constraints could greatly improve the pathway-mapping accuracy over methods that use sequence-similarity information alone.
Figures

Similar articles
-
Mapping of microbial pathways through constrained mapping of orthologous genes.Proc IEEE Comput Syst Bioinform Conf. 2004:363-70. doi: 10.1109/csb.2004.1332449. Proc IEEE Comput Syst Bioinform Conf. 2004. PMID: 16448029
-
Accurate identification of orthologous segments among multiple genomes.Bioinformatics. 2009 Apr 1;25(7):853-60. doi: 10.1093/bioinformatics/btp070. Epub 2009 Feb 2. Bioinformatics. 2009. PMID: 19188192
-
Comparative pathway annotation with protein-DNA interaction and operon information via graph tree decomposition.Pac Symp Biocomput. 2007:496-507. Pac Symp Biocomput. 2007. PMID: 17990513
-
Challenges in gene-oriented approaches for pangenome content discovery.Brief Bioinform. 2021 May 20;22(3):bbaa198. doi: 10.1093/bib/bbaa198. Brief Bioinform. 2021. PMID: 32893299 Review.
-
The nature and dynamics of bacterial genomes.Science. 2006 Mar 24;311(5768):1730-3. doi: 10.1126/science.1119966. Science. 2006. PMID: 16556833 Review.
Cited by
-
Hierarchical classification of functionally equivalent genes in prokaryotes.Nucleic Acids Res. 2007;35(7):2125-40. doi: 10.1093/nar/gkl1114. Epub 2007 Mar 11. Nucleic Acids Res. 2007. PMID: 17353185 Free PMC article.
-
Computational prediction of the osmoregulation network in Synechococcus sp. WH8102.BMC Genomics. 2010 May 10;11:291. doi: 10.1186/1471-2164-11-291. BMC Genomics. 2010. PMID: 20459751 Free PMC article.
-
Limited functional conservation of a global regulator among related bacterial genera: Lrp in Escherichia, Proteus and Vibrio.BMC Microbiol. 2008 Apr 11;8:60. doi: 10.1186/1471-2180-8-60. BMC Microbiol. 2008. PMID: 18405378 Free PMC article.
-
MSOAR 2.0: Incorporating tandem duplications into ortholog assignment based on genome rearrangement.BMC Bioinformatics. 2010 Jan 6;11:10. doi: 10.1186/1471-2105-11-10. BMC Bioinformatics. 2010. PMID: 20053291 Free PMC article.
-
Integration of sequence-similarity and functional association information can overcome intrinsic problems in orthology mapping across bacterial genomes.Nucleic Acids Res. 2011 Dec;39(22):e150. doi: 10.1093/nar/gkr766. Epub 2011 Sep 29. Nucleic Acids Res. 2011. PMID: 21965536 Free PMC article.
References
-
- Wall, D. P., Fraser, H. B. & Hirsh, A. E. (2003) Bioinformatics 19, 1710–1711. - PubMed
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources