Ancestral state reconstruction of metabolic pathways across pangenome ensembles
- PMID: 32924924
- PMCID: PMC7725326
- DOI: 10.1099/mgen.0.000429
Ancestral state reconstruction of metabolic pathways across pangenome ensembles
Abstract
As genome sequencing efforts are unveiling the genetic diversity of the biosphere with an unprecedented speed, there is a need to accurately describe the structural and functional properties of groups of extant species whose genomes have been sequenced, as well as their inferred ancestors, at any given taxonomic level of their phylogeny. Elaborate approaches for the reconstruction of ancestral states at the sequence level have been developed, subsequently augmented by methods based on gene content. While these approaches of sequence or gene-content reconstruction have been successfully deployed, there has been less progress on the explicit inference of functional properties of ancestral genomes, in terms of metabolic pathways and other cellular processes. Herein, we describe PathTrace, an efficient algorithm for parsimony-based reconstructions of the evolutionary history of individual metabolic pathways, pivotal representations of key functional modules of cellular function. The algorithm is implemented as a five-step process through which pathways are represented as fuzzy vectors, where each enzyme is associated with a taxonomic conservation value derived from the phylogenetic profile of its protein sequence. The method is evaluated with a selected benchmark set of pathways against collections of genome sequences from key data resources. By deploying a pangenome-driven approach for pathway sets, we demonstrate that the inferred patterns are largely insensitive to noise, as opposed to gene-content reconstruction methods. In addition, the resulting reconstructions are closely correlated with the evolutionary distance of the taxa under study, suggesting that a diligent selection of target pangenomes is essential for maintaining cohesiveness of the method and consistency of the inference, serving as an internal control for an arbitrary selection of queries. The PathTrace method is a first step towards the large-scale analysis of metabolic pathway evolution and our deeper understanding of functional relationships reflected in emerging pangenome collections.
Keywords: ancestral reconstruction; comparative genomics; metabolic pathways; parsimony method; phylogenetic profiling.
Conflict of interest statement
The authors declare that there are no conflicts of interest.
Figures










Similar articles
-
RegPrecise 3.0--a resource for genome-scale exploration of transcriptional regulation in bacteria.BMC Genomics. 2013 Nov 1;14:745. doi: 10.1186/1471-2164-14-745. BMC Genomics. 2013. PMID: 24175918 Free PMC article.
-
Functional phylogenomics analysis of bacteria and archaea using consistent genome annotation with UniFam.BMC Evol Biol. 2014 Oct 9;14:207. doi: 10.1186/s12862-014-0207-y. BMC Evol Biol. 2014. PMID: 25293379 Free PMC article.
-
MapGL: inferring evolutionary gain and loss of short genomic sequence features by phylogenetic maximum parsimony.BMC Bioinformatics. 2020 Sep 22;21(1):416. doi: 10.1186/s12859-020-03742-9. BMC Bioinformatics. 2020. PMID: 32962625 Free PMC article.
-
Software platforms to facilitate reconstructing genome-scale metabolic networks.Environ Microbiol. 2014 Jan;16(1):49-59. doi: 10.1111/1462-2920.12312. Epub 2013 Nov 18. Environ Microbiol. 2014. PMID: 24148076 Review.
-
Ancestral state reconstructions for genomes.Curr Opin Genet Dev. 2005 Dec;15(6):595-600. doi: 10.1016/j.gde.2005.09.011. Epub 2005 Oct 7. Curr Opin Genet Dev. 2005. PMID: 16216489 Review.
Cited by
-
Elucidating the functional roles of prokaryotic proteins using big data and artificial intelligence.FEMS Microbiol Rev. 2023 Jan 16;47(1):fuad003. doi: 10.1093/femsre/fuad003. FEMS Microbiol Rev. 2023. PMID: 36725215 Free PMC article. Review.
References
-
- Omland KE. The assumptions and challenges of ancestral state reconstructions. Syst Biol. 1999;48:604–611. doi: 10.1080/106351599260175. - DOI
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Molecular Biology Databases
Research Materials