. 2009 Oct 29:3:103.

doi: 10.1186/1752-0509-3-103.

Inferring branching pathways in genome-scale metabolic networks

Esa Pitkänen¹, Paula Jouhten, Juho Rousu

Affiliations

PMID: 19874610
PMCID: PMC2791103
DOI: 10.1186/1752-0509-3-103

Inferring branching pathways in genome-scale metabolic networks

Esa Pitkänen et al. BMC Syst Biol. 2009.

. 2009 Oct 29:3:103.

doi: 10.1186/1752-0509-3-103.

Authors

Esa Pitkänen¹, Paula Jouhten, Juho Rousu

Affiliation

¹ Department of Computer Science, University of Helsinki, Finland. esa.pitkanen@cs.helsinki.fi

PMID: 19874610
PMCID: PMC2791103
DOI: 10.1186/1752-0509-3-103

Abstract

Background: A central problem in computational metabolic modelling is how to find biochemically plausible pathways between metabolites in a metabolic network. Two general, complementary frameworks have been utilized to find metabolic pathways: constraint-based modelling and graph-theoretical path finding approaches. In constraint-based modelling, one aims to find pathways where metabolites are balanced in a pseudo steady-state. Constraint-based methods, such as elementary flux mode analysis, have typically a high computational cost stemming from a large number of steady-state pathways in a typical metabolic network. On the other hand, graph-theoretical approaches avoid the computational complexity of constraint-based methods by solving a simpler problem of finding shortest paths. However, while scaling well with network size, graph-theoretic methods generally tend to return more false positive pathways than constraint-based methods.

Results: In this paper, we introduce a computational method, ReTrace, for finding biochemically relevant, branching metabolic pathways in an atom-level representation of metabolic networks. The method finds compact pathways which transfer a high fraction of atoms from source to target metabolites by considering combinations of linear shortest paths. In contrast to current steady-state pathway analysis methods, our method scales up well and is able to operate on genome-scale models. Further, we show that the pathways produced are biochemically meaningful by an example involving the biosynthesis of inosine 5'-monophosphate (IMP). In particular, the method is able to avoid typical problems associated with graph-theoretic approaches such as the need to define side metabolites or pathways not carrying any net carbon flux appearing in results. Finally, we discuss an application involving reconstruction of amino acid pathways of a recently sequenced organism demonstrating how measurement data can be easily incorporated into ReTrace analysis. ReTrace is licensed under GPL and is freely available for academic use at http://www.cs.helsinki.fi/group/sysfys/software/retrace/.

Conclusion: ReTrace is a useful method in metabolic path finding tasks, combining some of the best aspects in constraint-based and graph-theoretic methods. It finds use in a multitude of tasks ranging from metabolic engineering to metabolic reconstruction of recently sequenced organisms.

PubMed Disclaimer

Figures

**Figure 1**
**Example atom mappings**. Example reaction r: m₁→ m₂+ m₃, two alternative atom mappings Γ₁(r) = {(a₁, a₃), (a₂, a₄)} (left) and Γ₂(r) = {(a₁, a₄), (a₂, a₃)} (right) and corresponding atom graphs G({r}) for atom mappings Γ₁, Γ₂. Atom mappings indicated by shading of atoms. The reaction consumes atoms (r) = {a₁, a₂} and produces atoms (r) = {a₃, a₄}.

formula image — **Figure 1**
**Example atom mappings**. Example reaction r: m₁→ m₂+ m₃, two alternative atom mappings Γ₁(r) = {(a₁, a₃), (a₂, a₄)} (left) and Γ₂(r) = {(a₁, a₄), (a₂, a₃)} (right) and corresponding atom graphs G({r}) for atom mappings Γ₁, Γ₂. Atom mappings indicated by shading of atoms. The reaction consumes atoms (r) = {a₁, a₂} and produces atoms (r) = {a₃, a₄}.

**Figure 2**
**Example pathway of three reactions**. Example pathway of three reactions r₁, r₂and r₃and seven metabolites m₁,..., m₇containing 13 atoms in total. Atom coloring indicates how the atoms are mapped in reactions. For instance, the pathway transfers atom a₁to atoms f_P({a₁}) = {a₅, a₇, a₁₂} and atom a₈to atom f_P({a₈}) = {a₁₃}.

**Figure 3**
**Pathway examples**. Examples of four pathways and associated Z_Oscores. Assignment of sets S and T are indicated with dashed boxes and atom mappings with atom colorings in figure. Top left: pathway P₁= {r₁, r₂} transfers all three atoms in S to T, achieving Z_O(P₁, S, T) = = 1. Bottom left: pathway P₂= {r₃, r₄} transfers green and black atoms to T. Since grey atom is not in S, Z_O(P₂, S, T) = . Top right: branching pathway P₃= {r₅, r₆, r₇, r₈} transfers all atoms in S, resulting in Z_O(P₃, S, T) = 1. Bottom right: pathway P₄= {r₉, r₁₀} transfers the only target atom from S, giving Z_O(P₄, S, T) = 1. However, two atoms of S are not transferred to T.

**Figure 4**
**Reduction of Minimum-Set-Cover to Find-Minimal-Pathway**. Left: a minimal set cover instance with = {s₁, ..., s₆} (circles) and subset collection = {C₁, C₂, C₃, C₄}. Right: Find-Minimal-Pathway instance corresponding to the set cover instance with 12 atoms and four reactions. Arrows denote mapping of atoms Γ over reactions. In particular, mappings shown with similarly dashed arrow lines belong to the same reaction. Source and target atom sets indicated with S and T.

**Figure 5**
**Reaction and metabolite junctions**. Top left: atoms in metabolites m₁and m₂are transferred to atoms in metabolite m₅via two reaction paths p₁= (r₁, r₃) and p₂= (r₂, r₃) (indicated by dashed arrows). The paths merge in reaction r₃. Pathway P consisting of reactions r₁, r₂, r₃has Z_O(P, S, T) = 1, when atoms of {m₁, m₂} and {m₅} comprise the source and target sets S and T, respectively. Top right: pathway P' = {r₁, r₂, r₃} achieves Z_O(P', *S, T*) = 1 assuming source and target atoms to be all atoms in {m₁, m₂} and {m₄}, respectively. The two reaction paths transferring the atoms, = (r₁, r₃) and = (r₂, r₃) merge in metabolite m₃. Subsequently, atoms from m₁and m₂are never transferred to the same instance of metabolite m₃via these paths. Bottom left and right: atom graph representations of pathways P (left) and P' (right). Hollow circles denote atoms which can originate from atoms in metabolites m₁or m₂.

**Figure 6**
**ReTrace example run**. Example ReTrace run for query m₁→ m₁₀with k = 3 in a database of 9 reactions and 10 metabolites. Atoms numbered from top to bottom as shown in figure. Dashed arrows indicate edges connecting v_Δand v_Uto atom nodes. Otherwise atom graph edges are not drawn; instead, arrows indicate substrates and products in reactions and atoms are mapped in linear fashion. For example, in reaction r₉, atom nodes v_7,1, v_8,1and v_8,2are connected to nodes v_9,1, v_9,2and v_9,3, respectively. At first, U = {m_10,1, m_10,2, m_10,3} are the unresolved nodes. Top: algorithm state after first call to Procedure FindPath. The three shortest atom paths found are = (v_Δ, v_1,1, v_8,1, v_9,2, v_10,2, v_U), = (v_Δ, v_1,2, v_8,2, v_9,3, v_10,3, v_U) and = (v_Δ, v_1,2, v_4,2, v_7,1, v_9,1, v_10,1, v_U), with path length ties broken arbitrarily. Choosing to process first, the reaction set corresponding to the atom path is P' = {r₃, r₈, r₉}. Tracing back from v_U, ReTrace finds that v_10,2and v_10,3can be traced back to v_Δ, while v_7,1is added to U. Procedure FindPath is then called recursively. Bottom: algorithm state after second call to Procedure FindPath. Edges to v_Uare updated to reflect U = {v_7,1}. Shortest paths from v_Δto v_Uare computed. However, only two paths are found: = (v_Δ, v_1,2, v_4,2, v_7,1, v_U) and = (v_Δ, v_1,2, v_3,1, v_7,1, v_U). Choosing arbitrarily to process next, ReTrace finds out that the reaction set P' = {r₂, r₆} resolves the remaining atom in U and a complete pathway {r₂, r₃, r₆, r₈, r₉} has been discovered.

**Figure 7**
**Excerpt from ReTrace result page**. Excerpt from a html result page showing the first pathway found for the query from erythrose 4-phosphate (E4P) and phosphoenolpyruvate (PEP) to phenylalanine (Phe). Green circles in molecule structures indicate atoms in sources that the pathway transfers to target atoms. Additionally, the Z_Oscore (Z) and the composite map of this pathway are shown.

**Figure 8**
**Result pathway diagram**. Diagram of a result pathway for a query from erythrose 4-phosphate (E4P) and phosphoenolpyruvate (PEP) to phenylalanine (Phe). Source and target metabolites are drawn in green and yellow, respectively. For clarity, pathway has been split into two parts, with 5-O-(1-Carboxyvinyl)-3-phosphoshikimate repeated in both parts.

**Figure 9**
**Component sizes and numbers in atom graph**. Component size vs. the number of components in the atom graph induced by 7781 KEGG reactions. Components of carbon, nitrogen and phosphorus atoms shown separately. Both X- and Y-axes are shown in log-scale.

**Figure 10**
**Pairwise shortest distances in atom graph**. Pairwise distances in three subgraphs corresponding to the carbon, nitrogen and phosphorus specific mappings in the atom graph. Y-axis shown in log-scale.

**Figure 11**
**Distances in atom graph from Acetyl-CoA**. Left: Structure and atom numbering of acetyl-CoA. Right: distances in the atom graph from acetyl-CoA carbon atoms 3, 7, 13, 41, 49 and 50. Acetyl carbons 49 and 50 display significantly shorter graph distances compared to other carbons.

**Figure 12**
**ReTrace running time and number of pathways found**. Total running time and the number of pathways found in pairwise pathway queries between 13 metabolites. X-axis shows the number of shortest paths searched at each search level. Each point represents averages over 240 pairwise pathway queries.

**Figure 13**
**Metabolite-specific computation time**. Total computation time shown separately for each target metabolite. X-axis shows the number of shortest paths searched at the first and second search level. Each point represents queries from 12 metabolites to the target metabolite.

**Figure 14**
**Number of pathways found**. Number of pathways found on the average for queries where each metabolite in turn was considered as the source (Y-axis) and target (X-axis). Each point corresponds to averages over 12 results.

**Figure 15**
**Pathway sizes**. The average result pathway sizes for queries where each metabolite in turn was considered as the source (Y-axis) and target (X-axis). Each point corresponds to averages over 12 results.

**Figure 16**
**Z_Oscore distribution in pathways found**. Left: Distribution of Z_Oscore in pathways found for query glucose → IMP. Right: Distribution of pathway sizes. Green bars show the distribution of complete (Z_O= 1) pathways.

**Figure 17**
**Representative result pathway for query glucose → IMP**. A representative result pathway for the query glucose → IMP which utilizes reactions commonly used in other result pathways. Glucose and IMP are color-coded green and yellow, respectively.

See this image and copyright information in PMC

Cited by

A review of computational tools for design and reconstruction of metabolic pathways.
Wang L, Dash S, Ng CY, Maranas CD. Wang L, et al. Synth Syst Biotechnol. 2017 Nov 15;2(4):243-252. doi: 10.1016/j.synbio.2017.11.002. eCollection 2017 Dec. Synth Syst Biotechnol. 2017. PMID: 29552648 Free PMC article. Review.
Metabolic modelling in the development of cell factories by synthetic biology.
Jouhten P. Jouhten P. Comput Struct Biotechnol J. 2012 Nov 12;3:e201210009. doi: 10.5936/csbj.201210009. eCollection 2012. Comput Struct Biotechnol J. 2012. PMID: 24688669 Free PMC article. Review.
Seeing the forest for the trees: Retrieving plant secondary biochemical pathways from metabolome networks.
Desmet S, Brouckaert M, Boerjan W, Morreel K. Desmet S, et al. Comput Struct Biotechnol J. 2020 Dec 3;19:72-85. doi: 10.1016/j.csbj.2020.11.050. eCollection 2021. Comput Struct Biotechnol J. 2020. PMID: 33384856 Free PMC article. Review.
Mining the key regulatory genes of chicken inosine 5'-monophosphate metabolism based on time series microarray data.
Ma T, Xu L, Wang H, Chen J, Liu L, Chang G, Chen G. Ma T, et al. J Anim Sci Biotechnol. 2015 May 23;6(1):21. doi: 10.1186/s40104-015-0022-3. eCollection 2015. J Anim Sci Biotechnol. 2015. PMID: 26075067 Free PMC article.
MetaMapp: mapping and visualizing metabolomic data by integrating information from biochemical pathways and chemical and mass spectral similarity.
Barupal DK, Haldiya PK, Wohlgemuth G, Kind T, Kothari SL, Pinkerton KE, Fiehn O. Barupal DK, et al. BMC Bioinformatics. 2012 May 16;13:99. doi: 10.1186/1471-2105-13-99. BMC Bioinformatics. 2012. PMID: 22591066 Free PMC article.

See all "Cited by" articles

References

1. Feist AM, Herrgård MJ, Thiele I, Reed JL, Palsson BO. Reconstruction of biochemical networks in microorganisms. Nat Rev Microbiol. 2009;7(2):129–143. - PMC - PubMed
1. Kanehisa M, Araki M, Goto S, Hattori M, Hirakawa M, Itoh M, Katayama T, Kawashima S, Okuda S, Tokimatsu T, Yamanishi Y. KEGG for linking genomes to life and the environment. Nucleic Acids Res. 2008;36:D480–D484. - PMC - PubMed
1. Caspi R, Foerster H, Fulcher C, Kaipa P, Krummenacker M, Latendresse M, Paley S, Rhee S, Shearer A, Tissier C, Walk T, Zhang P, Karp P. The MetaCyc Database of metabolic pathways and enzymes and the BioCyc collection of Pathway/Genome Databases. Nucleic Acids Res. 2008;36:D623–D631. - PMC - PubMed
1. Lee DS, Burd H, Liu J, Almaas E, Wiest O, Barabási AL, Oltvai ZN, Kapatral V. Comparative genome-scale metabolic reconstruction and flux balance analysis of multiple Staphylococcus aureus genomes identify novel anti-microbial drug targets. J Bacteriol. 2009;191(12):4015–4024. - PMC - PubMed
1. Blank LM, Lehmbeck F, Sauer U. Metabolic-flux and network analysis in fourteen hemiascomycetous yeasts. FEMS Yeast Research. 2005;5:545–558. - PubMed

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions

LinkOut - more resources

Full Text Sources
Other Literature Sources
- The Lens - Patent Citations Database

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Inferring branching pathways in genome-scale metabolic networks

Affiliation

Inferring branching pathways in genome-scale metabolic networks

Authors

Affiliation

Abstract

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

LinkOut - more resources

Full Text Sources

Other Literature Sources

Abstract

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

Related information

LinkOut - more resources

Full Text Sources

Other Literature Sources