Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Comparative Study
. 1996 Oct;5(10):1991-9.
doi: 10.1002/pro.5560051005.

Construction and analysis of a profile library characterizing groups of structurally known proteins

Affiliations
Comparative Study

Construction and analysis of a profile library characterizing groups of structurally known proteins

A Ogiwara et al. Protein Sci. 1996 Oct.

Abstract

A new sequence motif library StrProf was constructed characterizing the groups of related proteins in the PDB three-dimensional structure database. For a representative member of each protein family, which was identified by cross-referencing the PDB with the PIR superfamily classification, a group of related sequences was collected by the BLAST search against the nonredundant protein sequence database. For every group, the motifs were identified automatically according to the criteria of conservation and uniqueness of pentapeptide patterns and with a dual dynamic programming algorithm. In the StrProf library, motifs are represented by profile matrices rather than consensus patterns to allow more flexible search capabilities. Another dynamic programming algorithm was then developed to search this motif library. When the computationally derived StrProf was compared with PROSITE, which is a manually derived motif library in the best consensus pattern representation, the numbers of identified patterns were comparable. StrProf missed about one third of the PROSITE motifs, but there were also new motifs lacking in PROSITE. The new library was incorporated in SMART (Sequence Motif Analysis and Retrieval Tool), a computer tool designed to help search and annotate biologically important sites in an unknown protein sequence. The client program is available free of charge through the Internet.

PubMed Disclaimer

Similar articles

Cited by

References

    1. J Mol Biol. 1985 Nov 5;186(1):117-28 - PubMed
    1. J Mol Biol. 1986 Apr 5;188(3):415-31 - PubMed
    1. J Interferon Res. 1986 Dec;6(6):663-70 - PubMed
    1. Proc Natl Acad Sci U S A. 1987 Jul;84(13):4355-8 - PubMed
    1. Proc Natl Acad Sci U S A. 1989 Feb;86(4):1183-7 - PubMed

Publication types

Substances