Detection of prokaryotic promoters from the genomic distribution of hexanucleotide pairs
- PMID: 17014715
- PMCID: PMC1615881
- DOI: 10.1186/1471-2105-7-423
Detection of prokaryotic promoters from the genomic distribution of hexanucleotide pairs
Abstract
Background: In bacteria, sigma factors and other transcriptional regulatory proteins recognize DNA patterns upstream of their target genes and interact with RNA polymerase to control transcription. As a consequence of evolution, DNA sequences recognized by transcription factors are thought to be enriched in intergenic regions (IRs) and depleted from coding regions of prokaryotic genomes.
Results: In this work, we report that genomic distribution of transcription factors binding sites is biased towards IRs, and that this bias is conserved amongst bacterial species. We further take advantage of this observation to develop an algorithm that can efficiently identify promoter boxes by a distribution-dependent approach rather than a direct sequence comparison approach. This strategy, which can easily be combined with other methodologies, allowed the identification of promoter sequences in ten species and can be used with any annotated bacterial genome, with results that rival with current methodologies. Experimental validations of predicted promoters also support our approach.
Conclusion: Considering that complete genomic sequences of over 1000 bacteria will soon be available and that little transcriptional information is available for most of them, our algorithm constitutes a promising tool for the prediction of promoter sequences. Importantly, our methodology could also be adapted to identify DNA sequences recognized by other regulatory proteins.
Figures




Similar articles
-
PREDetector: a new tool to identify regulatory elements in bacterial genomes.Biochem Biophys Res Commun. 2007 Jun 15;357(4):861-4. doi: 10.1016/j.bbrc.2007.03.180. Epub 2007 Apr 12. Biochem Biophys Res Commun. 2007. PMID: 17451648
-
Composition-sensitive analysis of the human genome for regulatory signals.In Silico Biol. 2003;3(1-2):145-71. Epub 2003 Jun 27. In Silico Biol. 2003. PMID: 12954097
-
Cis-motifs upstream of the transcription and translation initiation sites are effectively revealed by their positional disequilibrium in eukaryote genomes using frequency distribution curves.BMC Bioinformatics. 2006 Nov 30;7:522. doi: 10.1186/1471-2105-7-522. BMC Bioinformatics. 2006. PMID: 17137509 Free PMC article.
-
Deciphering bacterial flagellar gene regulatory networks in the genomic era.Adv Appl Microbiol. 2009;67:257-95. doi: 10.1016/S0065-2164(08)01008-3. Adv Appl Microbiol. 2009. PMID: 19245942 Review.
-
Automated bacterial genome analysis and annotation.Curr Opin Microbiol. 2006 Oct;9(5):505-10. doi: 10.1016/j.mib.2006.08.002. Epub 2006 Aug 22. Curr Opin Microbiol. 2006. PMID: 16931121 Review.
Cited by
-
Triad pattern algorithm for predicting strong promoter candidates in bacterial genomes.BMC Bioinformatics. 2008 May 9;9:233. doi: 10.1186/1471-2105-9-233. BMC Bioinformatics. 2008. PMID: 18471287 Free PMC article.
-
SIGffRid: a tool to search for sigma factor binding sites in bacterial genomes using comparative approach and biologically driven statistics.BMC Bioinformatics. 2008 Jan 31;9:73. doi: 10.1186/1471-2105-9-73. BMC Bioinformatics. 2008. PMID: 18237374 Free PMC article.
-
Mobilizable Rolling-Circle Replicating Plasmids from Gram-Positive Bacteria: A Low-Cost Conjugative Transfer.Microbiol Spectr. 2014 Sep 19;2(5):8. doi: 10.1128/microbiolspec.PLAS-0008-2013. Microbiol Spectr. 2014. PMID: 25606350 Free PMC article.
-
Gains and unexpected lessons from genome-scale promoter mapping.Nucleic Acids Res. 2009 Aug;37(15):4919-31. doi: 10.1093/nar/gkp490. Epub 2009 Jun 15. Nucleic Acids Res. 2009. PMID: 19528070 Free PMC article.
-
Recognition of prokaryotic and eukaryotic promoters using convolutional deep learning neural networks.PLoS One. 2017 Feb 3;12(2):e0171410. doi: 10.1371/journal.pone.0171410. eCollection 2017. PLoS One. 2017. PMID: 28158264 Free PMC article.
References
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Molecular Biology Databases
Research Materials