Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Comparative Study
. 1993 Apr 11;21(7):1655-64.
doi: 10.1093/nar/21.7.1655.

Computer-assisted prediction, classification, and delimitation of protein binding sites in nucleic acids

Affiliations
Free PMC article
Comparative Study

Computer-assisted prediction, classification, and delimitation of protein binding sites in nucleic acids

K Frech et al. Nucleic Acids Res. .
Free PMC article

Abstract

We present a method to determine the location and extent of protein binding regions in nucleic acids by computer-assisted analysis of sequence data. The program ConsIndex establishes a library of consensus descriptions based on sequence sets containing known regulatory elements. These defined consensus descriptions are used by the program ConsInspector to predict binding sites in new sequences. We show the programs to correctly determine the significant regions involved in transcriptional control of seven sequence elements. The internal profile of relative variability of individual nucleotide positions within these regions paralleled experimental profiles of biological significance. Consensus descriptions are determined by employing an anchored alignment scheme, the results of which are then evaluated by a novel method which is superior to cluster algorithms. The alignment procedure is able to include several closely related sequences without biasing the consensus description. Moreover, the algorithm detects additional elements on the basis of a moderate distance correlation and is capable of discriminating between real binding sites and false positive matches. The software is well suited to cope with the frequent phenomenon of optional elements present in a subset of functionally similar sequences, while taking maximal advantage of the existing sequence data base. Since it requires only a minimum of seven sequences for a single element, it is applicable to a wide range of binding sites.

PubMed Disclaimer

Similar articles

Cited by

References

    1. Nucleic Acids Res. 1985 Feb 25;13(4):1347-68 - PubMed
    1. Nucleic Acids Res. 1992 May 11;20 Suppl:2091-3 - PubMed
    1. J Mol Biol. 1986 Apr 5;188(3):415-31 - PubMed
    1. Nucleic Acids Res. 1986 Dec 22;14(24):10009-26 - PubMed
    1. Nucleic Acids Res. 1987 Feb 25;15(4):1353-61 - PubMed

Publication types