Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 1998 Feb;7(2):445-56.
doi: 10.1002/pro.5560070226.

Comprehensive assessment of automatic structural alignment against a manual standard, the scop classification of proteins

Affiliations

Comprehensive assessment of automatic structural alignment against a manual standard, the scop classification of proteins

M Gerstein et al. Protein Sci. 1998 Feb.

Abstract

We apply a simple method for aligning protein sequences on the basis of a 3D structure, on a large scale, to the proteins in the scop classification of fold families. This allows us to assess, understand, and improve our automatic method against an objective, manually derived standard, a type of comprehensive evaluation that has not yet been possible for other structural alignment algorithms. Our basic approach directly matches the backbones of two structures, using repeated cycles of dynamic programming and least-squares fitting to determine an alignment minimizing coordinate difference. Because of simplicity, our method can be readily modified to take into account additional features of protein structure such as the orientation of side chains or the location-dependent cost of opening a gap. Our basic method, augmented by such modifications, can find reasonable alignments for all but 1.5% of the known structural similarities in scop, i.e., all but 32 of the 2,107 superfamily pairs. We discuss the specific protein structural features that make these 32 pairs so difficult to align and show how our procedure effectively partitions the relationships in scop into different categories, depending on what aspects of protein structure are involved (e.g., depending on whether or not consideration of side-chain orientation is necessary for proper alignment). We also show how our pairwise alignment procedure can be extended to generate a multiple alignment for a group of related structures. We have compared these alignments in detail with corresponding manual ones culled from the literature. We find good agreement (to within 95% for the core regions), and detailed comparison highlights how particular protein structural features (such as certain strands) are problematical to align, giving somewhat ambiguous results. With these improvements and systematic tests, our procedure should be useful for the development of scop and the future classification of protein folds.

PubMed Disclaimer

References

    1. J Mol Biol. 1994 Mar 4;236(4):1067-78 - PubMed
    1. Protein Sci. 1996 Jul;5(7):1325-38 - PubMed
    1. Protein Sci. 1994 Sep;3(9):1582-96 - PubMed
    1. J Mol Biol. 1995 Jun 16;249(4):816-31 - PubMed
    1. J Mol Biol. 1993 Jan 20;229(2):494-501 - PubMed

Publication types

LinkOut - more resources