Phylogenetic distances are encoded in networks of interacting pathways
- PMID: 18820265
- PMCID: PMC2579716
- DOI: 10.1093/bioinformatics/btn503
Phylogenetic distances are encoded in networks of interacting pathways
Abstract
Motivation: Although metabolic reactions are unquestionably shaped by evolutionary processes, the degree to which the overall structure and complexity of their interconnections are linked to the phylogeny of species has not been evaluated in depth. Here, we apply an original metabolome representation, termed Network of Interacting Pathways or NIP, with a combination of graph theoretical and machine learning strategies, to address this question. NIPs compress the information of the metabolic network exhibited by a species into much smaller networks of overlapping metabolic pathways, where nodes are pathways and links are the metabolites they exchange.
Results: Our analysis shows that a small set of descriptors of the structure and complexity of the NIPs combined into regression models reproduce very accurately reference phylogenetic distances derived from 16S rRNA sequences (10-fold cross-validation correlation coefficient higher than 0.9). Our method also showed better scores than previous work on metabolism-based phylogenetic reconstructions, as assessed by branch distances score, topological similarity and second cousins score. Thus, our metabolome representation as network of overlapping metabolic pathways captures sufficient information about the underlying evolutionary events leading to the formation of metabolic networks and species phylogeny. It is important to note that precise knowledge of all of the reactions in these pathways is not required for these reconstructions. These observations underscore the potential for the use of abstract, modular representations of metabolic reactions as tools in studying the evolution of species.
Supplementary information: Supplementary data are available at Bioinformatics online.
Figures




Similar articles
-
Phylogenetic investigations of Antarctic notothenioid fishes (Perciformes: Notothenioidei) using complete gene sequences of the mitochondrial encoded 16S rRNA.Mol Phylogenet Evol. 2004 Sep;32(3):881-91. doi: 10.1016/j.ympev.2004.01.002. Mol Phylogenet Evol. 2004. PMID: 15288063
-
RNA polymerase beta subunit (rpoB) gene and the 16S-23S rRNA intergenic transcribed spacer region (ITS) as complementary molecular markers in addition to the 16S rRNA gene for phylogenetic analysis and identification of the species of the family Mycoplasmataceae.Mol Phylogenet Evol. 2012 Jan;62(1):515-28. doi: 10.1016/j.ympev.2011.11.002. Epub 2011 Nov 17. Mol Phylogenet Evol. 2012. PMID: 22115576
-
Metabolic pathfinding using RPAIR annotation.J Mol Biol. 2009 May 1;388(2):390-414. doi: 10.1016/j.jmb.2009.03.006. Epub 2009 Mar 10. J Mol Biol. 2009. PMID: 19281817
-
Analyzing methods for path mining with applications in metabolomics.Gene. 2014 Jan 25;534(2):125-38. doi: 10.1016/j.gene.2013.10.056. Epub 2013 Nov 12. Gene. 2014. PMID: 24230973 Review.
-
Prediction of metabolic pathways from genome-scale metabolic networks.Biosystems. 2011 Aug;105(2):109-21. doi: 10.1016/j.biosystems.2011.05.004. Epub 2011 May 27. Biosystems. 2011. PMID: 21645586 Review.
Cited by
-
A protein network descriptor server and its use in studying protein, disease, metabolic and drug targeted networks.Brief Bioinform. 2017 Nov 1;18(6):1057-1070. doi: 10.1093/bib/bbw071. Brief Bioinform. 2017. PMID: 27542402 Free PMC article.
-
Metabolic classification of microbial genomes using functional probes.BMC Genomics. 2012 Apr 27;13:157. doi: 10.1186/1471-2164-13-157. BMC Genomics. 2012. PMID: 22537274 Free PMC article.
-
Optimized ancestral state reconstruction using Sankoff parsimony.BMC Bioinformatics. 2009 Feb 7;10:51. doi: 10.1186/1471-2105-10-51. BMC Bioinformatics. 2009. PMID: 19200389 Free PMC article.
-
Topological assessment of metabolic networks reveals evolutionary information.Sci Rep. 2018 Oct 29;8(1):15918. doi: 10.1038/s41598-018-34163-7. Sci Rep. 2018. PMID: 30374088 Free PMC article.
-
Evolution of metabolic network organization.BMC Syst Biol. 2010 May 11;4:59. doi: 10.1186/1752-0509-4-59. BMC Syst Biol. 2010. PMID: 20459825 Free PMC article.
References
-
- Aittokallio T, Schwikowski B. Graph-based methods for analysing networks in cell biology. Brief. Bioinform. 2006;7:243–255. - PubMed
-
- Barabasi A, Albert R. Emergence of scaling in random networks. Science. 1999;286:509–512. - PubMed
-
- Bonchev D, Buck G. Quantitative measures of network complexity. In: Bonchev D, Rouvray D, editors. Complexity in Chemistry, Biology, and Ecology. New York: Springer; 2005. pp. 191–235.