Identifying biological pathways via phase decomposition and profile extraction
- PMID: 17369645
Identifying biological pathways via phase decomposition and profile extraction
Abstract
Biological processes are always carried out through large numbers of genes (and their products) and these activities are often organized into different cellular pathways: sets of genes that cooperate to finish specific biological functions. Owing to the development of microarray technology and its ability to simultaneously measure the expression of thousands of genes, effective algorithms to reveal biologically significant pathways are possible. However, some open problems such as large amount of noise in microarrays and the fact that most biological processes are overlapping and active only on partial conditions pose great challenges to researchers. In this paper, we proposed a novel approach to identify overlapping pathways via extracting partial expression profiles from coherent cliques of clusters scattered on different conditions. We firstly decompose gene expression data into highly overlapping segments and partition genes into clusters in each segment; then we organize all the resulting clusters as a cluster graph and search coherent cliques of clusters; finally we extract expression profiles from coherent cliques and shape biological pathways as genes consistent with these profiles. We compare our algorithm with several recent models and the experimental results justify the superiorities of our approach: robustly identifying overlapping pathways in arbitrary set of conditions and consequently discovering more biologically significant pathways in terms of enrichment of gene functions.
Similar articles
-
Quantitative inference of dynamic regulatory pathways via microarray data.BMC Bioinformatics. 2005 Mar 7;6:44. doi: 10.1186/1471-2105-6-44. BMC Bioinformatics. 2005. PMID: 15748298 Free PMC article.
-
A novel approach for discovering overlapping clusters in gene expression data.IEEE Trans Biomed Eng. 2009 Jul;56(7):1803-9. doi: 10.1109/TBME.2009.2015055. Epub 2009 Feb 20. IEEE Trans Biomed Eng. 2009. PMID: 19237334
-
Analysis of a Gibbs sampler method for model-based clustering of gene expression data.Bioinformatics. 2008 Jan 15;24(2):176-83. doi: 10.1093/bioinformatics/btm562. Epub 2007 Nov 22. Bioinformatics. 2008. PMID: 18033794
-
Computational methods for the identification of differential and coordinated gene expression.Hum Mol Genet. 1999;8(10):1821-32. doi: 10.1093/hmg/8.10.1821. Hum Mol Genet. 1999. PMID: 10469833 Review.
-
Tools for interpreting large-scale protein profiling in microbiology.J Dent Res. 2008 Nov;87(11):1004-15. doi: 10.1177/154405910808701113. J Dent Res. 2008. PMID: 18946006 Free PMC article. Review.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources