Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 1989 Feb;86(4):1183-7.
doi: 10.1073/pnas.86.4.1183.

Identifying protein-binding sites from unaligned DNA fragments

Affiliations

Identifying protein-binding sites from unaligned DNA fragments

G D Stormo et al. Proc Natl Acad Sci U S A. 1989 Feb.

Abstract

The ability to determine important features within DNA sequences from the sequences alone is becoming essential as large-scale sequencing projects are being undertaken. We present a method that can be applied to the problem of identifying the recognition pattern for a DNA-binding protein given only a collection of sequenced DNA fragments, each known to contain somewhere within it a binding site for that protein. Information about the position or orientation of the binding sites within those fragments is not needed. The method compares the "information content" of a large number of possible binding site alignments to arrive at a matrix representation of the binding site pattern. The specificity of the protein is represented as a matrix, rather than a consensus sequence, allowing patterns that are typical of regulatory protein-binding sites to be identified. The reliability of the method improves as the number of sequences increases, but the time required increases only linearly with the number of sequences. An example, using known cAMP receptor protein-binding sites, illustrates the method.

PubMed Disclaimer

References

    1. J Bacteriol. 1982 Apr;150(1):312-8 - PubMed
    1. Annu Rev Biophys Biophys Chem. 1988;17:241-63 - PubMed
    1. Nucleic Acids Res. 1983 Apr 25;11(8):2237-55 - PubMed
    1. Proc Natl Acad Sci U S A. 1983 Nov;80(22):6785-9 - PubMed
    1. Nucleic Acids Res. 1984 Jan 11;12(1 Pt 2):505-19 - PubMed

Publication types

LinkOut - more resources