Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 1995 Jun 23;249(5):923-32.
doi: 10.1006/jmbi.1995.0349.

Predicting Pol II promoter sequences using transcription factor binding sites

Affiliations

Predicting Pol II promoter sequences using transcription factor binding sites

D S Prestridge. J Mol Biol. .

Abstract

A computer program, PROMOTER SCAN, has been developed to recognize a high percentage of Pol II promoter sequences while allowing only a small rate of false positives. A total of 167 primate Pol II promoter sequences, obtained from the Eukaryotic Promoter Database, and 999 primate non-promoter sequences, obtained from the GenBank sequence databank, were used in the analysis. Both promoter and non-promoter sequences were analyzed for the comparative density of each unique mammalian transcription factor binding site listed in the Ghosh Transcription Factor Database. The density of each of these binding sites was then used to derive a ratio of density of each transcriptional element in promoter compared to non-promoter sequences. The combined individual density ratios of all binding sites were then collectively used to build a scoring profile called the Promoter Recognition Profile. This profile, used in combination with a weighted matrix for scoring a TATA box, was then used by the PROMOTER SCAN program to test the prediction of promoter sequences and the ability of the computer program to discriminate them from non-promoter sequences. When the promoter cutoff score was set so that 70% of promoters were recognized correctly by the program, a false positive rate of about 1/5600 bases was observed in the non-promoter sequence set. PROMOTER SCAN is now being developed for public distribution.

PubMed Disclaimer

LinkOut - more resources