Computational discovery of transcriptional regulatory rules
- PMID: 16204087
- DOI: 10.1093/bioinformatics/bti1117
Computational discovery of transcriptional regulatory rules
Abstract
Motivation: Even in a simple organism like yeast Saccharomyces cerevisiae, transcription is an extremely complex process. The expression of sets of genes can be turned on or off by the binding of specific transcription factors to the promoter regions of genes. Experimental and computational approaches have been proposed to establish mappings of DNA-binding locations of transcription factors. However, although location data obtained from experimental methods are noisy owing to imperfections in the measuring methods, computational approaches suffer from over-prediction problems owing to the short length of the sequence motifs bound by the transcription factors. Also, these interactions are usually environment-dependent: many regulators only bind to the promoter region of genes under specific environmental conditions. Even more, the presence of regulators at a promoter region indicates binding but not necessarily function: the regulator may act positively, negatively or not act at all. Therefore, identifying true and functional interactions between transcription factors and genes in specific environment conditions and describing the relationship between them are still open problems.
Results: We developed a method that combines expression data with genomic location information to discover (1) relevant transcription factors from the set of potential transcription factors of a target gene; and (2) the relationship between the expression behavior of a target gene and that of its relevant transcription factors. Our method is based on rule induction, a machine learning technique that can efficiently deal with noisy domains. When applied to genomic location data with a confidence criterion relaxed to P-value = 0.005, and three different expression datasets of yeast S.cerevisiae, we obtained a set of regulatory rules describing the relationship between the expression behavior of a specific target gene and that of its relevant transcription factors. The resulting rules provide strong evidence of true positive gene-regulator interactions, as well as of protein-protein interactions that could serve to identify transcription complexes.
Availability: Supplementary files are available from http://www.jaist.ac.jp/~h-pham/regulatory-rules
Similar articles
-
Predicting genetic regulatory response using classification.Bioinformatics. 2004 Aug 4;20 Suppl 1:i232-40. doi: 10.1093/bioinformatics/bth923. Bioinformatics. 2004. PMID: 15262804
-
Computational identification of combinatorial regulation and transcription factor binding sites.Biotechnol Bioeng. 2007 Aug 15;97(6):1594-602. doi: 10.1002/bit.21354. Biotechnol Bioeng. 2007. PMID: 17252601
-
A graph-based approach to systematically reconstruct human transcriptional regulatory modules.Bioinformatics. 2007 Jul 1;23(13):i577-86. doi: 10.1093/bioinformatics/btm227. Bioinformatics. 2007. PMID: 17646346
-
Tag-based approaches for transcriptome research and genome annotation.Nat Methods. 2005 Jul;2(7):495-502. doi: 10.1038/nmeth768. Nat Methods. 2005. PMID: 15973418 Review.
-
[Specificiety of DNA-protein interactions within transcription complexes of Escherichia coli].Mol Biol (Mosk). 2004 Sep-Oct;38(5):786-97. Mol Biol (Mosk). 2004. PMID: 15554182 Review. Russian.
Cited by
-
Characterizing nucleosome dynamics from genomic and epigenetic information using rule induction learning.BMC Genomics. 2009 Dec 3;10 Suppl 3(Suppl 3):S27. doi: 10.1186/1471-2164-10-S3-S27. BMC Genomics. 2009. PMID: 19958491 Free PMC article.
-
DNA motif elucidation using belief propagation.Nucleic Acids Res. 2013 Sep;41(16):e153. doi: 10.1093/nar/gkt574. Epub 2013 Jun 29. Nucleic Acids Res. 2013. PMID: 23814189 Free PMC article.
-
Discovering protein-DNA binding sequence patterns using association rule mining.Nucleic Acids Res. 2010 Oct;38(19):6324-37. doi: 10.1093/nar/gkq500. Epub 2010 Jun 6. Nucleic Acids Res. 2010. PMID: 20529874 Free PMC article.
-
Inferring Transcriptional Interactions by the Optimal Integration of ChIP-chip and Knock-out Data.Bioinform Biol Insights. 2009 Oct 21;3:129-40. doi: 10.4137/bbi.s3445. Bioinform Biol Insights. 2009. PMID: 20140075 Free PMC article.
-
High-resolution analysis of condition-specific regulatory modules in Saccharomyces cerevisiae.Genome Biol. 2008 Jan 3;9(1):R2. doi: 10.1186/gb-2008-9-1-r2. Genome Biol. 2008. PMID: 18171483 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Molecular Biology Databases