H-CORE: enabling genome-scale Bayesian analysis of biological systems without prior knowledge
- PMID: 17005318
- DOI: 10.1016/j.biosystems.2006.08.004
H-CORE: enabling genome-scale Bayesian analysis of biological systems without prior knowledge
Abstract
The Bayesian network is a popular tool for describing relationships between data entities by representing probabilistic (in)dependencies with a directed acyclic graph (DAG) structure. Relationships have been inferred between biological entities using the Bayesian network model with high-throughput data from biological systems in diverse fields. However, the scalability of those approaches is seriously restricted because of the huge search space for finding an optimal DAG structure in the process of Bayesian network learning. For this reason, most previous approaches limit the number of target entities or use additional knowledge to restrict the search space. In this paper, we use the hierarchical clustering and order restriction (H-CORE) method for the learning of large Bayesian networks by clustering entities and restricting edge directions between those clusters, with the aim of overcoming the scalability problem and thus making it possible to perform genome-scale Bayesian network analysis without additional biological knowledge. We use simulations to show that H-CORE is much faster than the widely used sparse candidate method, whilst being of comparable quality. We have also applied H-CORE to retrieving gene-to-gene relationships in a biological system (The 'Rosetta compendium'). By evaluating learned information through literature mining, we demonstrate that H-CORE enables the genome-scale Bayesian analysis of biological systems without any prior knowledge.
Similar articles
-
Clustering microarray gene expression data using weighted Chinese restaurant process.Bioinformatics. 2006 Aug 15;22(16):1988-97. doi: 10.1093/bioinformatics/btl284. Epub 2006 Jun 9. Bioinformatics. 2006. PMID: 16766561
-
A hybrid Bayesian network learning method for constructing gene networks.Comput Biol Chem. 2007 Oct;31(5-6):361-72. doi: 10.1016/j.compbiolchem.2007.08.005. Epub 2007 Aug 19. Comput Biol Chem. 2007. PMID: 17889617
-
EXAMINE: a computational approach to reconstructing gene regulatory networks.Biosystems. 2005 Aug;81(2):125-36. doi: 10.1016/j.biosystems.2005.02.007. Biosystems. 2005. PMID: 15951103
-
Bayesian methods in bioinformatics and computational systems biology.Brief Bioinform. 2007 Mar;8(2):109-16. doi: 10.1093/bib/bbm007. Epub 2007 Apr 12. Brief Bioinform. 2007. PMID: 17430978 Review.
-
Graph-based methods for analysing networks in cell biology.Brief Bioinform. 2006 Sep;7(3):243-55. doi: 10.1093/bib/bbl022. Epub 2006 Jul 30. Brief Bioinform. 2006. PMID: 16880171 Review.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources