Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2006 Jun;15(6):1530-6.
doi: 10.1110/ps.062152706. Epub 2006 May 2.

Recurrent use of evolutionary importance for functional annotation of proteins based on local structural similarity

Affiliations

Recurrent use of evolutionary importance for functional annotation of proteins based on local structural similarity

David M Kristensen et al. Protein Sci. 2006 Jun.

Abstract

The annotation of protein function has not kept pace with the exponential growth of raw sequence and structure data. An emerging solution to this problem is to identify 3D motifs or templates in protein structures that are necessary and sufficient determinants of function. Here, we demonstrate the recurrent use of evolutionary trace information to construct such 3D templates for enzymes, search for them in other structures, and distinguish true from spurious matches. Serine protease templates built from evolutionarily important residues distinguish between proteases and other proteins nearly as well as the classic Ser-His-Asp catalytic triad. In 53 enzymes spanning 33 distinct functions, an automated pipeline identifies functionally related proteins with an average positive predictive power of 62%, including correct matches to proteins with the same function but with low sequence identity (the average identity for some templates is only 17%). Although these template building, searching, and match classification strategies are not yet optimized, their sequential implementation demonstrates a functional annotation pipeline which does not require experimental information, but only local molecular mimicry among a small number of evolutionarily important residues.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
Distribution of matches for serine protease motifs: (A) catalytic triad, (B) noncatalytic quartet, (C) negative control, (D) surface trace cluster. The vertical lines represent the points at which the p-value = 1% (solid green) and 5% (dashed purple). Structural template representations created using PyMOL (DeLano 2002).
Figure 2.
Figure 2.
Distribution of matches for 53 enzymes in a single dimension, LRMSD (A), and two dimensions, LRMSD and evolutionary importance (B).

References

    1. Altschul S.F., Gish W., Miller W., Myers E.W., Lipman D.J. 1990. Basic local alignment search tool. J. Mol. Biol. 215 403–410. - PubMed
    1. Altschul S.F., Madden T.L., Schaffer A.A., Zhang J., Zhang Z., Miller W., Lipman D.J. 1997. Gapped BLAST and PSI-BLAST: A new generation of protein database search programs. Nucleic Acids Res. 25 3389–3402. - PMC - PubMed
    1. Ausiello G., Zanzoni A., Peluso D., Via A., Helmer-Citterich M. 2005. Mass selection and fast comparison of annotated PDB residues. Nucleic Acids Res. 33 W133–W137. - PMC - PubMed
    1. Barker J.A. and Thornton J.M. 2003. An algorithm for constraint-based structural template matching: Application to 3D templates with statistical analysis. Bioinformatics 19 1644–1649. - PubMed
    1. Bartlett G.J., Porter C.T., Borkakoti N., Thornton J.M. 2002. Analysis of catalytic residues in enzyme active sites. J. Mol. Biol. 324 105–121. - PubMed

Publication types

LinkOut - more resources