Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2009 Oct 29:3:103.
doi: 10.1186/1752-0509-3-103.

Inferring branching pathways in genome-scale metabolic networks

Affiliations

Inferring branching pathways in genome-scale metabolic networks

Esa Pitkänen et al. BMC Syst Biol. .

Abstract

Background: A central problem in computational metabolic modelling is how to find biochemically plausible pathways between metabolites in a metabolic network. Two general, complementary frameworks have been utilized to find metabolic pathways: constraint-based modelling and graph-theoretical path finding approaches. In constraint-based modelling, one aims to find pathways where metabolites are balanced in a pseudo steady-state. Constraint-based methods, such as elementary flux mode analysis, have typically a high computational cost stemming from a large number of steady-state pathways in a typical metabolic network. On the other hand, graph-theoretical approaches avoid the computational complexity of constraint-based methods by solving a simpler problem of finding shortest paths. However, while scaling well with network size, graph-theoretic methods generally tend to return more false positive pathways than constraint-based methods.

Results: In this paper, we introduce a computational method, ReTrace, for finding biochemically relevant, branching metabolic pathways in an atom-level representation of metabolic networks. The method finds compact pathways which transfer a high fraction of atoms from source to target metabolites by considering combinations of linear shortest paths. In contrast to current steady-state pathway analysis methods, our method scales up well and is able to operate on genome-scale models. Further, we show that the pathways produced are biochemically meaningful by an example involving the biosynthesis of inosine 5'-monophosphate (IMP). In particular, the method is able to avoid typical problems associated with graph-theoretic approaches such as the need to define side metabolites or pathways not carrying any net carbon flux appearing in results. Finally, we discuss an application involving reconstruction of amino acid pathways of a recently sequenced organism demonstrating how measurement data can be easily incorporated into ReTrace analysis. ReTrace is licensed under GPL and is freely available for academic use at http://www.cs.helsinki.fi/group/sysfys/software/retrace/.

Conclusion: ReTrace is a useful method in metabolic path finding tasks, combining some of the best aspects in constraint-based and graph-theoretic methods. It finds use in a multitude of tasks ranging from metabolic engineering to metabolic reconstruction of recently sequenced organisms.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Example atom mappings. Example reaction r: m1 m2 + m3, two alternative atom mappings Γ1(r) = {(a1, a3), (a2, a4)} (left) and Γ2(r) = {(a1, a4), (a2, a3)} (right) and corresponding atom graphs G({r}) for atom mappings Γ1, Γ2. Atom mappings indicated by shading of atoms. The reaction consumes atoms formula image (r) = {a1, a2} and produces atoms formula image (r) = {a3, a4}.
Figure 2
Figure 2
Example pathway of three reactions. Example pathway of three reactions r1, r2 and r3 and seven metabolites m1,..., m7 containing 13 atoms in total. Atom coloring indicates how the atoms are mapped in reactions. For instance, the pathway transfers atom a1 to atoms fP ({a1}) = {a5, a7, a12} and atom a8 to atom fP ({a8}) = {a13}.
Figure 3
Figure 3
Pathway examples. Examples of four pathways and associated ZO scores. Assignment of sets S and T are indicated with dashed boxes and atom mappings with atom colorings in figure. Top left: pathway P1 = {r1, r2} transfers all three atoms in S to T, achieving ZO (P1, S, T) = formula image = 1. Bottom left: pathway P2 = {r3, r4} transfers green and black atoms to T. Since grey atom is not in S, ZO (P2, S, T) = formula image. Top right: branching pathway P3 = {r5, r6, r7, r8} transfers all atoms in S, resulting in ZO (P3, S, T) = 1. Bottom right: pathway P4 = {r9, r10} transfers the only target atom from S, giving ZO(P4, S, T) = 1. However, two atoms of S are not transferred to T.
Figure 4
Figure 4
Reduction of Minimum-Set-Cover to Find-Minimal-Pathway. Left: a minimal set cover instance with formula image = {s1, ..., s6} (circles) and subset collection formula image = {C1, C2, C3, C4}. Right: Find-Minimal-Pathway instance corresponding to the set cover instance with 12 atoms and four reactions. Arrows denote mapping of atoms Γ over reactions. In particular, mappings shown with similarly dashed arrow lines belong to the same reaction. Source and target atom sets indicated with S and T.
Figure 5
Figure 5
Reaction and metabolite junctions. Top left: atoms in metabolites m1 and m2 are transferred to atoms in metabolite m5 via two reaction paths p1 = (r1, r3) and p2 = (r2, r3) (indicated by dashed arrows). The paths merge in reaction r3. Pathway P consisting of reactions r1, r2, r3 has ZO (P, S, T) = 1, when atoms of {m1, m2} and {m5} comprise the source and target sets S and T, respectively. Top right: pathway P' = {r1, r2, r3} achieves ZO (P', S, T) = 1 assuming source and target atoms to be all atoms in {m1, m2} and {m4}, respectively. The two reaction paths transferring the atoms, formula image = (r1, r3) and formula image = (r2, r3) merge in metabolite m3. Subsequently, atoms from m1 and m2 are never transferred to the same instance of metabolite m3 via these paths. Bottom left and right: atom graph representations of pathways P (left) and P' (right). Hollow circles denote atoms which can originate from atoms in metabolites m1 or m2.
Figure 6
Figure 6
ReTrace example run. Example ReTrace run for query m1 m10 with k = 3 in a database of 9 reactions and 10 metabolites. Atoms numbered from top to bottom as shown in figure. Dashed arrows indicate edges connecting vΔ and vU to atom nodes. Otherwise atom graph edges are not drawn; instead, arrows indicate substrates and products in reactions and atoms are mapped in linear fashion. For example, in reaction r9, atom nodes v7,1, v8,1 and v8,2 are connected to nodes v9,1, v9,2 and v9,3, respectively. At first, U = {m10,1, m10,2, m10,3} are the unresolved nodes. Top: algorithm state after first call to Procedure FindPath. The three shortest atom paths found are formula image = (vΔ, v1,1, v8,1, v9,2, v10,2, vU), formula image = (vΔ, v1,2, v8,2, v9,3, v10,3, vU) and formula image = (vΔ, v1,2, v4,2, v7,1, v9,1, v10,1, vU), with path length ties broken arbitrarily. Choosing to process formula image first, the reaction set corresponding to the atom path is P' = {r3, r8, r9}. Tracing back from vU, ReTrace finds that v10,2 and v10,3 can be traced back to vΔ, while v7,1 is added to U. Procedure FindPath is then called recursively. Bottom: algorithm state after second call to Procedure FindPath. Edges to vU are updated to reflect U = {v7,1}. Shortest paths from vΔ to vU are computed. However, only two paths are found: formula image = (vΔ, v1,2, v4,2, v7,1, vU) and formula image = (vΔ, v1,2, v3,1, v7,1, vU). Choosing arbitrarily formula image to process next, ReTrace finds out that the reaction set P' = {r2, r6} resolves the remaining atom in U and a complete pathway {r2, r3, r6, r8, r9} has been discovered.
Figure 7
Figure 7
Excerpt from ReTrace result page. Excerpt from a html result page showing the first pathway found for the query from erythrose 4-phosphate (E4P) and phosphoenolpyruvate (PEP) to phenylalanine (Phe). Green circles in molecule structures indicate atoms in sources that the pathway transfers to target atoms. Additionally, the ZO score (Z) and the composite map of this pathway are shown.
Figure 8
Figure 8
Result pathway diagram. Diagram of a result pathway for a query from erythrose 4-phosphate (E4P) and phosphoenolpyruvate (PEP) to phenylalanine (Phe). Source and target metabolites are drawn in green and yellow, respectively. For clarity, pathway has been split into two parts, with 5-O-(1-Carboxyvinyl)-3-phosphoshikimate repeated in both parts.
Figure 9
Figure 9
Component sizes and numbers in atom graph. Component size vs. the number of components in the atom graph induced by 7781 KEGG reactions. Components of carbon, nitrogen and phosphorus atoms shown separately. Both X- and Y-axes are shown in log-scale.
Figure 10
Figure 10
Pairwise shortest distances in atom graph. Pairwise distances in three subgraphs corresponding to the carbon, nitrogen and phosphorus specific mappings in the atom graph. Y-axis shown in log-scale.
Figure 11
Figure 11
Distances in atom graph from Acetyl-CoA. Left: Structure and atom numbering of acetyl-CoA. Right: distances in the atom graph from acetyl-CoA carbon atoms 3, 7, 13, 41, 49 and 50. Acetyl carbons 49 and 50 display significantly shorter graph distances compared to other carbons.
Figure 12
Figure 12
ReTrace running time and number of pathways found. Total running time and the number of pathways found in pairwise pathway queries between 13 metabolites. X-axis shows the number of shortest paths searched at each search level. Each point represents averages over 240 pairwise pathway queries.
Figure 13
Figure 13
Metabolite-specific computation time. Total computation time shown separately for each target metabolite. X-axis shows the number of shortest paths searched at the first and second search level. Each point represents queries from 12 metabolites to the target metabolite.
Figure 14
Figure 14
Number of pathways found. Number of pathways found on the average for queries where each metabolite in turn was considered as the source (Y-axis) and target (X-axis). Each point corresponds to averages over 12 results.
Figure 15
Figure 15
Pathway sizes. The average result pathway sizes for queries where each metabolite in turn was considered as the source (Y-axis) and target (X-axis). Each point corresponds to averages over 12 results.
Figure 16
Figure 16
ZO score distribution in pathways found. Left: Distribution of ZO score in pathways found for query glucose → IMP. Right: Distribution of pathway sizes. Green bars show the distribution of complete (ZO = 1) pathways.
Figure 17
Figure 17
Representative result pathway for query glucose → IMP. A representative result pathway for the query glucose → IMP which utilizes reactions commonly used in other result pathways. Glucose and IMP are color-coded green and yellow, respectively.

Similar articles

Cited by

References

    1. Feist AM, Herrgård MJ, Thiele I, Reed JL, Palsson BO. Reconstruction of biochemical networks in microorganisms. Nat Rev Microbiol. 2009;7(2):129–143. - PMC - PubMed
    1. Kanehisa M, Araki M, Goto S, Hattori M, Hirakawa M, Itoh M, Katayama T, Kawashima S, Okuda S, Tokimatsu T, Yamanishi Y. KEGG for linking genomes to life and the environment. Nucleic Acids Res. 2008;36:D480–D484. - PMC - PubMed
    1. Caspi R, Foerster H, Fulcher C, Kaipa P, Krummenacker M, Latendresse M, Paley S, Rhee S, Shearer A, Tissier C, Walk T, Zhang P, Karp P. The MetaCyc Database of metabolic pathways and enzymes and the BioCyc collection of Pathway/Genome Databases. Nucleic Acids Res. 2008;36:D623–D631. - PMC - PubMed
    1. Lee DS, Burd H, Liu J, Almaas E, Wiest O, Barabási AL, Oltvai ZN, Kapatral V. Comparative genome-scale metabolic reconstruction and flux balance analysis of multiple Staphylococcus aureus genomes identify novel anti-microbial drug targets. J Bacteriol. 2009;191(12):4015–4024. - PMC - PubMed
    1. Blank LM, Lehmbeck F, Sauer U. Metabolic-flux and network analysis in fourteen hemiascomycetous yeasts. FEMS Yeast Research. 2005;5:545–558. - PubMed

Publication types

LinkOut - more resources