Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2010 Jan 4:5:10.
doi: 10.1186/1748-7188-5-10.

A markov classification model for metabolic pathways

Affiliations

A markov classification model for metabolic pathways

Timothy Hancock et al. Algorithms Mol Biol. .

Abstract

Background: This paper considers the problem of identifying pathways through metabolic networks that relate to a specific biological response. Our proposed model, HME3M, first identifies frequently traversed network paths using a Markov mixture model. Then by employing a hierarchical mixture of experts, separate classifiers are built using information specific to each path and combined into an ensemble prediction for the response.

Results: We compared the performance of HME3M with logistic regression and support vector machines (SVM) for both simulated pathways and on two metabolic networks, glycolysis and the pentose phosphate pathway for Arabidopsis thaliana. We use AltGenExpress microarray data and focus on the pathway differences in the developmental stages and stress responses of Arabidopsis. The results clearly show that HME3M outperformed the comparison methods in the presence of increasing network complexity and pathway noise. Furthermore an analysis of the paths identified by HME3M for each metabolic network confirmed known biological responses of Arabidopsis.

Conclusions: This paper clearly shows HME3M to be an accurate and robust method for classifying metabolic pathways. HME3M is shown to outperform all comparison methods and further is capable of identifying known biologically active pathways within microarray data.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Example network.
Figure 2
Figure 2
Simulated network diagrams.
Figure 3
Figure 3
Arabidopsis thaliana glycolysis pathway from Alpha-D-Glucose to Pyruvate. For visual simplicity, we show only a single edge connecting each compound; however in the actual network there is a separate edge for each gene label displayed.
Figure 4
Figure 4
Arabidopsis thaliana Oxidative Pentose Phosphate Cycle. For visual simplicity, we show only a single edge connecting each compound; however in the actual network there is a separate edge for each gene label displayed.
Figure 5
Figure 5
Performance results for the Glycolysis pathway. Inverse cross-validated Correct Classification Rates (CCR) for all models for the Glycolysis pathway for Arabidopsis.
Figure 6
Figure 6
ROC curve of all paths for the optimal model (M = 4) for the glycolysis pathway.
Figure 7
Figure 7
Transition probabilities for the most expressed glycolysis path (m = 3) that separates flowers from rosette for Arabidopsis. Training set CCR = 0.818, AUC = 0.752.
Figure 8
Figure 8
Arabidopsis Performance results for the pentose phosphate pathway for classifying oxidative stress pathways.
Figure 9
Figure 9
ROC curve of all paths for the optimal model (M = 2) for the pentose phosphate pathway for classifying oxidative stress pathways.
Figure 10
Figure 10
Transition probabilities for the most expressed pentose phosphate path (m = 2) for Arabidopsis under oxidative stress. The numbers in brackets represent the probability of each edge θm for m = 2.

Similar articles

Cited by

References

    1. Kanehisa M, Goto S. KEGG: Kyoto Encyclopedia of Genes and Genomes. Nucleic Acids Res. 2000;28:27–30. doi: 10.1093/nar/28.1.27. - DOI - PMC - PubMed
    1. Brazma A, Parkinson H, Sarkans U, Shojatalab M, Vilo J, Abeygunawardena N, Holloway E, Kapushesky M, Kemmeren P, Lara GG, Oezcimen A, Rocca-Serra P, Sansone S. ArrayExpress-a public repository for microarray gene expression data at the EBI. Nucl Acids Res. 2003;31:68–71. doi: 10.1093/nar/gkg091. - DOI - PMC - PubMed
    1. Pang H, Lin A, Holford M, Enerson B, Lu B, Lawton MP, Floyd E, Zhao H. Pathway analysis using random forests classification and regression. Bioinformatics. 2006;22(16):2028–36. doi: 10.1093/bioinformatics/btl344. - DOI - PubMed
    1. Pireddu L, Poulin B, Szafron D, Lu P, Wishart DS. Pathway Analyst - Automated Metabolic Pathway Prediction. Proceedings of the 2005 IEEE Symposium on Computational Intelligence. 2005. http://metabolomics.ca/News/publications/2005cibcb-path.pdf
    1. Jordan M. Learning in Graphical Models. Norwell, MD: Kluwer Academic Publishers; 1998.

LinkOut - more resources