A mutation degree model for the identification of transcriptional regulatory elements
- PMID: 21708002
- PMCID: PMC3228546
- DOI: 10.1186/1471-2105-12-262
A mutation degree model for the identification of transcriptional regulatory elements
Abstract
Background: Current approaches for identifying transcriptional regulatory elements are mainly via the combination of two properties, the evolutionary conservation and the overrepresentation of functional elements in the promoters of co-regulated genes. Despite the development of many motif detection algorithms, the discovery of conserved motifs in a wide range of phylogenetically related promoters is still a challenge, especially for the short motifs embedded in distantly related gene promoters or very closely related promoters, or in the situation that there are not enough orthologous genes available.
Results: A mutation degree model is proposed and a new word counting method is developed for the identification of transcriptional regulatory elements from a set of co-expressed genes. The new method comprises two parts: 1) identifying overrepresented oligo-nucleotides in promoters of co-expressed genes, 2) estimating the conservation of the oligo-nucleotides in promoters of phylogenetically related genes by the mutation degree model. Compared with the performance of other algorithms, our method shows the advantages of low false positive rate and higher specificity, especially the robustness to noisy data. Applying the method to co-expressed gene sets from Arabidopsis, most of known cis-elements were successfully detected. The tool and example are available at http://mcube.nju.edu.cn/jwang/lab/soft/ocw/OCW.html.
Conclusions: The mutation degree model proposed in this paper is adapted to phylogenetic data of different qualities, and to a wide range of evolutionary distances. The new word-counting method based on this model has the advantage of better performance in detecting short sequence of cis-elements from co-expressed genes of eukaryotes and is robust to less complete phylogenetic data.
Figures



Similar articles
-
[Computational identification of transcriptional regulatory elements in Arabidopsis TCH4 promoter].Yi Chuan. 2008 May;30(5):620-6. doi: 10.3724/sp.j.1005.2008.00620. Yi Chuan. 2008. PMID: 18487153 Chinese.
-
Quantitative statistical analysis of cis-regulatory sequences in ABA/VP1- and CBF/DREB1-regulated genes of Arabidopsis.Plant Physiol. 2005 Sep;139(1):437-47. doi: 10.1104/pp.104.058412. Epub 2005 Aug 19. Plant Physiol. 2005. PMID: 16113229 Free PMC article.
-
Seed storage protein gene promoters contain conserved DNA motifs in Brassicaceae, Fabaceae and Poaceae.BMC Plant Biol. 2009 Oct 20;9:126. doi: 10.1186/1471-2229-9-126. BMC Plant Biol. 2009. PMID: 19843335 Free PMC article.
-
Computational approaches to identify promoters and cis-regulatory elements in plant genomes.Plant Physiol. 2003 Jul;132(3):1162-76. doi: 10.1104/pp.102.017715. Plant Physiol. 2003. PMID: 12857799 Free PMC article. Review.
-
Computational identification of transcriptional regulatory elements in DNA sequence.Nucleic Acids Res. 2006 Jul 19;34(12):3585-98. doi: 10.1093/nar/gkl372. Print 2006. Nucleic Acids Res. 2006. PMID: 16855295 Free PMC article. Review.
Cited by
-
Cascading cis-cleavage on transcript from trans-acting siRNA-producing locus 3.Int J Mol Sci. 2013 Jul 12;14(7):14689-99. doi: 10.3390/ijms140714689. Int J Mol Sci. 2013. PMID: 23857062 Free PMC article.
References
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Miscellaneous