Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Comparative Study
. 2002;3(12):RESEARCH0087.
doi: 10.1186/gb-2002-3-12-research0087. Epub 2002 Dec 20.

Computational analysis of core promoters in the Drosophila genome

Affiliations
Comparative Study

Computational analysis of core promoters in the Drosophila genome

Uwe Ohler et al. Genome Biol. 2002.

Abstract

Background: The core promoter, a region of about 100 base-pairs flanking the transcription start site (TSS), serves as the recognition site for the basal transcription apparatus. Drosophila TSSs have generally been mapped by individual experiments; the low number of accurately mapped TSSs has limited analysis of promoter sequence motifs and the training of computational prediction tools.

Results: We identified TSS candidates for about 2,000 Drosophila genes by aligning 5' expressed sequence tags (ESTs) from cap-trapped cDNA libraries to the genome, while applying stringent criteria concerning coverage and 5'-end distribution. Examination of the sequences flanking these TSSs revealed the presence of well-known core promoter motifs such as the TATA box, the initiator and the downstream promoter element (DPE). We also define, and assess the distribution of, several new motifs prevalent in core promoters, including what appears to be a variant DPE motif. Among the prevalent motifs is the DNA-replication-related element DRE, recently shown to be part of the recognition site for the TBP-related factor TRF2. Our TSS set was then used to retrain the computational promoter predictor McPromoter, allowing us to improve the recognition performance to over 50% sensitivity and 40% specificity. We compare these computational results to promoter prediction in vertebrates.

Conclusions: There are relatively few recognizable binding sites for previously known general transcription factors in Drosophila core promoters. However, we identified several new motifs enriched in promoter regions. We were also able to significantly improve the performance of computational TSS prediction in Drosophila.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Positional distributions of the occurrence of the 10 most significant motifs relative to the putative transcription start site, as determined by MEME. The positions of base 1 of the motifs as given in the pictograms of Table 2 were binned in 5-bp intervals. (The numerical values plotted here are given in Supplementary Table 4 of the additional data files.)
Figure 2
Figure 2
Genomic distance between the predicted TSS and the beginning of the ORF for protein-coding genes with annotated 5' UTRs on chromosome arm 2R of D. melanogaster.

References

    1. Latchman DS. Gene Regulation - A Eukaryotic Perspective. 4. Cheltenham, UK: Nelson Thornes; 2002.
    1. Lemon B, Tjian R. Orchestrated response: a symphony of transcription factors for gene control. Genes Dev. 2000;14:2551–2569. - PubMed
    1. Smale ST. Core promoters: active contributors to combinatorial gene expression. Genes Dev. 2001;15:2503–2508. - PubMed
    1. Butler JEF, Kadonaga JT. The RNA polymerase II core promoter: a key component in the regulation of gene expression. Genes Dev. 2002;16:2583–2592. - PubMed
    1. Kutach AK, Kadonaga JT. The downstream promoter element DPE appears to be as widely used as the TATA box in Drosophila core promoters. Mol Cell Biol. 2000;20:4754–4764. - PMC - PubMed

Publication types

LinkOut - more resources