MMG: a probabilistic tool to identify submodules of metabolic pathways
- PMID: 18292114
- DOI: 10.1093/bioinformatics/btn066
MMG: a probabilistic tool to identify submodules of metabolic pathways
Abstract
Motivation: A fundamental task in systems biology is the identification of groups of genes that are involved in the cellular response to particular signals. At its simplest level, this often reduces to identifying biological quantities (mRNA abundance, enzyme concentrations, etc.) which are differentially expressed in two different conditions. Popular approaches involve using t-test statistics, based on modelling the data as arising from a mixture distribution. A common assumption of these approaches is that the data are independent and identically distributed; however, biological quantities are usually related through a complex (weighted) network of interactions, and often the more pertinent question is which subnetworks are differentially expressed, rather than which genes. Furthermore, in many interesting cases (such as high-throughput proteomics and metabolomics), only very partial observations are available, resulting in the need for efficient imputation techniques.
Results: We introduce Mixture Model on Graphs (MMG), a novel probabilistic model to identify differentially expressed submodules of biological networks and pathways. The method can easily incorporate information about weights in the network, is robust against missing data and can be easily generalized to directed networks. We propose an efficient sampling strategy to infer posterior probabilities of differential expression, as well as posterior probabilities over the model parameters. We assess our method on artificial data demonstrating significant improvements over standard mixture model clustering. Analysis of our model results on quantitative high-throughput proteomic data leads to the identification of biologically significant subnetworks, as well as the prediction of the expression level of a number of enzymes, some of which are then verified experimentally.
Availability: MATLAB code is available from http://www.dcs.shef.ac.uk/~guido/software.html
Similar articles
-
Comparative evaluation of reverse engineering gene regulatory networks with relevance networks, graphical gaussian models and bayesian networks.Bioinformatics. 2006 Oct 15;22(20):2523-31. doi: 10.1093/bioinformatics/btl391. Epub 2006 Jul 14. Bioinformatics. 2006. PMID: 16844710
-
Fitting a geometric graph to a protein-protein interaction network.Bioinformatics. 2008 Apr 15;24(8):1093-9. doi: 10.1093/bioinformatics/btn079. Epub 2008 Mar 14. Bioinformatics. 2008. PMID: 18344248
-
Identifying differentially expressed subnetworks with MMG.Bioinformatics. 2008 Dec 1;24(23):2792-3. doi: 10.1093/bioinformatics/btn499. Epub 2008 Sep 25. Bioinformatics. 2008. PMID: 18819939
-
Gene regulatory network inference: data integration in dynamic models-a review.Biosystems. 2009 Apr;96(1):86-103. doi: 10.1016/j.biosystems.2008.12.004. Epub 2008 Dec 27. Biosystems. 2009. PMID: 19150482 Review.
-
Stochastic P systems and the simulation of biochemical processes with dynamic compartments.Biosystems. 2008 Mar;91(3):458-72. doi: 10.1016/j.biosystems.2006.12.009. Epub 2007 Jul 17. Biosystems. 2008. PMID: 17728055 Review.
Cited by
-
Network-based genomic discovery: application and comparison of Markov random field models.J R Stat Soc Ser C Appl Stat. 2010 Jan 1;59(1):105-125. doi: 10.1111/j.1467-9876.2009.00686.x. J R Stat Soc Ser C Appl Stat. 2010. PMID: 21373371 Free PMC article.
-
Analyzing LC/MS metabolic profiling data in the context of existing metabolic networks.Curr Metabolomics. 2013 Jan 1;1(1):83-91. doi: 10.2174/2213235X11301010084. Curr Metabolomics. 2013. PMID: 24010053 Free PMC article.
-
A systems biology approach to investigate the response of Synechocystis sp. PCC6803 to a high salt environment.Saline Syst. 2009 Sep 7;5:8. doi: 10.1186/1746-1448-5-8. Saline Syst. 2009. PMID: 19735556 Free PMC article.
-
Identifying differentially regulated subnetworks from phosphoproteomic data.BMC Bioinformatics. 2010 Jun 28;11:351. doi: 10.1186/1471-2105-11-351. BMC Bioinformatics. 2010. PMID: 20584295 Free PMC article.
-
Algorithms for modeling global and context-specific functional relationship networks.Brief Bioinform. 2016 Jul;17(4):686-95. doi: 10.1093/bib/bbv065. Epub 2015 Aug 6. Brief Bioinform. 2016. PMID: 26254431 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources