Identification of core promoter modules in Drosophila and their application in accurate transcription start site prediction
- PMID: 17068082
- PMCID: PMC1635271
- DOI: 10.1093/nar/gkl608
Identification of core promoter modules in Drosophila and their application in accurate transcription start site prediction
Abstract
The reliable recognition of eukaryotic RNA polymerase II core promoters, and the associated transcription start sites (TSSs) of genes, has been an ongoing challenge for computational biology. High throughput experimental methods such as tiling arrays or 5' SAGE/EST sequencing have recently lead to much larger datasets of core promoters, and to the assessment that the well-known core promoter sequence elements such as the TATA box appear to be much less frequent than thought. Here, we address the co-occurrence of several previously identified core promoter sequence motifs in Drosophila melanogaster to determine frequently occurring core promoter modules. We then use this in a new strategy to model core promoters as a set of alternative submodels for different core promoter architectures reflecting these different motif modules. We show that this system improves greatly on computational promoter recognition and leads to highly accurate in silico TSS prediction. Our results indicate that at least for the case of the fruit fly, we are getting closer to an understanding of how the beginning of a gene is defined in a eukaryotic genome.
Figures



Similar articles
-
Heterogeneity of Arabidopsis core promoters revealed by high-density TSS analysis.Plant J. 2009 Oct;60(2):350-62. doi: 10.1111/j.1365-313X.2009.03958.x. Epub 2009 Jun 29. Plant J. 2009. PMID: 19563441
-
Computational detection and location of transcription start sites in mammalian genomic DNA.Genome Res. 2002 Mar;12(3):458-61. doi: 10.1101/gr.216102. Genome Res. 2002. PMID: 11875034 Free PMC article.
-
Synergy of human Pol II core promoter elements revealed by statistical sequence analysis.Bioinformatics. 2005 Apr 15;21(8):1295-300. doi: 10.1093/bioinformatics/bti172. Epub 2004 Nov 30. Bioinformatics. 2005. PMID: 15572469
-
Deep cap analysis gene expression (CAGE): genome-wide identification of promoters, quantification of their expression, and network inference.Biotechniques. 2008 Apr;44(5):627-8, 630, 632. doi: 10.2144/000112802. Biotechniques. 2008. PMID: 18474037 Review.
-
Descartes' fly: the geometry of genomic annotation.Funct Integr Genomics. 2001 Mar;1(4):241-9. doi: 10.1007/s101420000025. Funct Integr Genomics. 2001. PMID: 11793243 Review.
Cited by
-
Predictive features of gene expression variation reveal mechanistic link with differential expression.Mol Syst Biol. 2020 Aug;16(8):e9539. doi: 10.15252/msb.20209539. Mol Syst Biol. 2020. PMID: 32767663 Free PMC article.
-
Analysis of transcriptional regulation of the human miR-17-92 cluster; evidence for involvement of Pim-1.Int J Mol Sci. 2013 Jun 7;14(6):12273-96. doi: 10.3390/ijms140612273. Int J Mol Sci. 2013. PMID: 23749113 Free PMC article.
-
Characterization of genomic regulatory domains conserved across the genus Drosophila.Genome Biol Evol. 2012;4(10):1054-60. doi: 10.1093/gbe/evs089. Genome Biol Evol. 2012. PMID: 23042552 Free PMC article.
-
Generic eukaryotic core promoter prediction using structural features of DNA.Genome Res. 2008 Feb;18(2):310-23. doi: 10.1101/gr.6991408. Epub 2007 Dec 20. Genome Res. 2008. PMID: 18096745 Free PMC article.
-
Large-scale analysis of Drosophila core promoter function using synthetic promoters.Mol Syst Biol. 2022 Feb;18(2):e9816. doi: 10.15252/msb.20209816. Mol Syst Biol. 2022. PMID: 35156763 Free PMC article.
References
-
- Li H., Wang W. Dissecting the transcription networks of a cell using computational genomics. Curr. Opin. Genet. Dev. 2003;13:611–616. - PubMed
-
- Levine M., Tjian R. Transcription regulation and animal diversity. Nature. 2003;424:147–151. - PubMed
-
- Arnosti D.N. Analysis and function of transcriptional regulatory elements: insights from Drosophila. Annu. Rev. Entomol. 2003;48:579–602. - PubMed
-
- Wray G.A., Hahn M.W., Abouheif E., Balhoff J.P., Pizer M., Rockman M.V., Romano L.A. The evolution of transcriptional regulation in eukaryotes. Mol. Biol. Evol. 2003;20:1377–1419. - PubMed
-
- Smale S.T., Kadonaga J.T. The RNA polymerase II core promoter. Annu. Rev. Biochem. 2003;72:449–479. - PubMed
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Molecular Biology Databases
Research Materials