. 2012 Mar 21;13 Suppl 3(Suppl 3):S2.

doi: 10.1186/1471-2105-13-S3-S2.

Metabolic network alignment in large scale by network compression

Ferhat Ay¹, Michael Dang, Tamer Kahveci

Affiliations

PMID: 22536900
PMCID: PMC3402922
DOI: 10.1186/1471-2105-13-S3-S2

Metabolic network alignment in large scale by network compression

Ferhat Ay et al. BMC Bioinformatics. 2012.

. 2012 Mar 21;13 Suppl 3(Suppl 3):S2.

doi: 10.1186/1471-2105-13-S3-S2.

Authors

Ferhat Ay¹, Michael Dang, Tamer Kahveci

Affiliation

¹ Computer and Information Science and Engineering, University of Florida, Gainesville, FL 32611, USA. ferhatay@uw.edu

PMID: 22536900
PMCID: PMC3402922
DOI: 10.1186/1471-2105-13-S3-S2

Abstract

Metabolic network alignment is a system scale comparative analysis that discovers important similarities and differences across different metabolisms and organisms. Although the problem of aligning metabolic networks has been considered in the past, the computational complexity of the existing solutions has so far limited their use to moderately sized networks. In this paper, we address the problem of aligning two metabolic networks, particularly when both of them are too large to be dealt with using existing methods. We develop a generic framework that can significantly improve the scale of the networks that can be aligned in practical time. Our framework has three major phases, namely the compression phase, the alignment phase and the refinement phase. For the first phase, we develop an algorithm which transforms the given networks to a compressed domain where they are summarized using fewer nodes, termed supernodes, and interactions. In the second phase, we carry out the alignment in the compressed domain using an existing network alignment method as our base algorithm. This alignment results in supernode mappings in the compressed domain, each of which are smaller instances of network alignment problem. In the third phase, we solve each of the instances using the base alignment algorithm to refine the alignment results. We provide a user defined parameter to control the number of compression levels which generally determines the tradeoff between the quality of the alignment versus how fast the algorithm runs. Our experiments on the networks from KEGG pathway database demonstrate that the compression method we propose reduces the sizes of metabolic networks by almost half at each compression level which provides an expected speedup of more than an order of magnitude. We also observe that the alignments obtained by only one level of compression capture the original alignment results with high accuracy. Together, these suggest that our framework results in alignments that are comparable to existing algorithms and can do this with practical resource utilization for large scale networks that existing algorithms could not handle. As an example of our method's performance in practice, the alignment of organism-wide metabolic networks of human (1615 reactions) and mouse (1600 reactions) was performed under three minutes by only using a single level of compression.

PubMed Disclaimer

Figures

**Figure 2**
**Shift of out-degree distributions from power law to uniform.** Changes in the out-degree distributions of ten organism-wide metabolic networks with increasing levels of compression. We calculate the frequencies of each out-degree in the range [2,40] for c ∈ {0, 1, 2, 3, 4} and plot them together for each of the ten organisms in our dataset. Out-degree distributions for organism-wide metabolic networks of (a) *Arabidopsis thaliana* (thale cress), (b) *Caenorhabditis elegans* (nematode), (c) *Drosophila melanogaster* (fruit fly), (d) *Escherichia coli K-12 MG1655*, (e) *Homo sapiens* (human), (f) *Mus musculus* (mouse), (g) *Pseudomonas aeruginosa PAO1*, (h) *Rattus norvegicus* (rat), (i) *Staphylococcus aureus COL* (MRSA), (j) *Saccharomyces cerevisiae* (budding yeast).

**Figure 3**
**Resource utilization of our framework.** The average (a) running time and (b) memory utilization of our framework when each query network in our large scale dataset is aligned with all the networks (including itself) in the same dataset. x-axis is the query size which is calculated as the product of the sizes (i.e., number of reactions) of the metabolic networks aligned. c = 0 denote the alignments performed with no compression. c ∈ {1, 2, 3} denote the results of our framework that compresses both of the query networks by c levels before aligning them.

**Figure 4**
**Gain/Loss in running time.** Gain/Loss in running time of alignment by using our framework with respect to the base alignment method (x-axis) versus the ratio of the number of all possible subnetwork mappings in compressed domain to this number in the original domain. The blue vertical line shows when the two methods take exact same amount of time or when both methods take very short amount of time in the case of small query networks. Points on the right (left) handside of this line means gain (loss) in the running time. The dashed line is our decision criteria for predicting whether there will be gain or loss before doing the alignment.

**Figure 5**
**One compression step of the *MDS* method.** Small circles represent reactions and big circles represent supernodes that result from earlier steps of compression. A solid arrow represents an edge between two non-compressed nodes in the current compression level. A dashed arrow denotes an edge between a supernode and another node in the network. While calculating the degrees of the non-compressed nodes, only the solid arrows are taken into account. (a) The state of network P during compression level x before the ith intermediate step (i.e., $P_{i - 1}^{x}$ ). The node with the minimum degree is denoted with *v_a*and its first neighbor is denoted with *v_b*. (b) The state of this network after the ith compression step (i.e., $P_{i}^{x}$ ). We denote the node resulted from the compression at this step with *v_ab*.

See this image and copyright information in PMC

Cited by

Pan-phylum Comparison of Nematode Metabolic Potential.
Tyagi R, Rosa BA, Lewis WG, Mitreva M. Tyagi R, et al. PLoS Negl Trop Dis. 2015 May 22;9(5):e0003788. doi: 10.1371/journal.pntd.0003788. eCollection 2015 May. PLoS Negl Trop Dis. 2015. PMID: 26000881 Free PMC article.
Aligning Metabolic Pathways Exploiting Binary Relation of Reactions.
Huang Y, Zhong C, Lin HX, Huang J. Huang Y, et al. PLoS One. 2016 Dec 9;11(12):e0168044. doi: 10.1371/journal.pone.0168044. eCollection 2016. PLoS One. 2016. PMID: 27936108 Free PMC article.
CAMPways: constrained alignment framework for the comparative analysis of a pair of metabolic pathways.
Abaka G, Bıyıkoğlu T, Erten C. Abaka G, et al. Bioinformatics. 2013 Jul 1;29(13):i145-53. doi: 10.1093/bioinformatics/btt235. Bioinformatics. 2013. PMID: 23812978 Free PMC article.
Community-scale models of microbiomes: Articulating metabolic modelling and metagenome sequencing.
Cerk K, Ugalde-Salas P, Nedjad CG, Lecomte M, Muller C, Sherman DJ, Hildebrand F, Labarthe S, Frioux C. Cerk K, et al. Microb Biotechnol. 2024 Jan;17(1):e14396. doi: 10.1111/1751-7915.14396. Epub 2024 Jan 20. Microb Biotechnol. 2024. PMID: 38243750 Free PMC article. Review.
Helminth.net: expansions to Nematode.net and an introduction to Trematode.net.
Martin J, Rosa BA, Ozersky P, Hallsworth-Pepin K, Zhang X, Bhonagiri-Palsikar V, Tyagi R, Wang Q, Choi YJ, Gao X, McNulty SN, Brindley PJ, Mitreva M. Martin J, et al. Nucleic Acids Res. 2015 Jan;43(Database issue):D698-706. doi: 10.1093/nar/gku1128. Epub 2014 Nov 11. Nucleic Acids Res. 2015. PMID: 25392426 Free PMC article.

See all "Cited by" articles

References

1. Navlakha S, Schatz M, Kingsford C. Revealing biological modules via graph summarization. J Comput Biol. 2009;16(2):253–264. doi: 10.1089/cmb.2008.11TT. - DOI - PubMed
1. Segal E, Pe'er D, Regev A, Koller D, Friedman N. Learning module networks. Journal of Machine Learning Research. 2005;6:557–88.
1. Ay F, Dinh T, Thai M, Kahveci T. Dynamic modular structure of regulatory networks. IEEE International Conference on Bioinformatics and Bioengineering (BIBE) 2010. pp. 136–143.
1. Dutkowski J, Tiuryn J. Identification of functional modules from conserved ancestral protein protein interactions. Bioinformatics. 2007;23(13):i149–i158. doi: 10.1093/bioinformatics/btm194. - DOI - PubMed
1. Dandekar T, Schuster S, Snel B, Huynen M, Bork P. Pathway alignment: application to the comparative analysis of glycolytic enzymes. Biochem J. 1999;343 Pt 1:115–124. doi: 10.1042/0264-6021:3430115. - DOI - PMC - PubMed

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

LinkOut - more resources

Full Text Sources
Research Materials
- NCI CPTC Antibody Characterization Program
Miscellaneous
- NCI CPTAC Assay Portal

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Metabolic network alignment in large scale by network compression

Affiliation

Metabolic network alignment in large scale by network compression

Authors

Affiliation

Abstract

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

LinkOut - more resources

Full Text Sources

Research Materials

Miscellaneous