Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Comparative Study
. 1994 Dec 6;91(25):12091-5.
doi: 10.1073/pnas.91.25.12091.

Detection of conserved segments in proteins: iterative scanning of sequence databases with alignment blocks

Affiliations
Comparative Study

Detection of conserved segments in proteins: iterative scanning of sequence databases with alignment blocks

R L Tatusov et al. Proc Natl Acad Sci U S A. .

Abstract

We describe an approach to analyzing protein sequence databases that, starting from a single uncharacterized sequence or group of related sequences, generates blocks of conserved segments. The procedure involves iterative database scans with an evolving position-dependent weight matrix constructed from a coevolving set of aligned conserved segments. For each iteration, the expected distribution of matrix scores under a random model is used to set a cutoff score for the inclusion of a segment in the next iteration. This cutoff may be calculated to allow the chance inclusion of either a fixed number or a fixed proportion of false positive segments. With sufficiently high cutoff scores, the procedure converged for all alignment blocks studied, with varying numbers of iterations required. Different methods for calculating weight matrices from alignment blocks were compared. The most effective of those tested was a logarithm-of-odds, Bayesian-based approach that used prior residue probabilities calculated from a mixture of Dirichlet distributions. The procedure described was used to detect novel conserved motifs of potential biological importance.

PubMed Disclaimer

References

    1. J Mol Biol. 1983 Sep 5;169(1):15-30 - PubMed
    1. Proc Natl Acad Sci U S A. 1993 May 15;90(10):4753-7 - PubMed
    1. Proc Natl Acad Sci U S A. 1987 Jul;84(13):4355-8 - PubMed
    1. J Mol Biol. 1987 Feb 20;193(4):723-50 - PubMed
    1. Proc Natl Acad Sci U S A. 1988 Apr;85(8):2444-8 - PubMed

Publication types

LinkOut - more resources