Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 1996 Jun 11;93(12):5814-8.
doi: 10.1073/pnas.93.12.5814.

Global properties of the mapping between local amino acid sequence and local structure in proteins

Affiliations

Global properties of the mapping between local amino acid sequence and local structure in proteins

K F Han et al. Proc Natl Acad Sci U S A. .

Abstract

Local protein structure prediction efforts have consistently failed to exceed approximately 70% accuracy. We characterize the degeneracy of the mapping from local sequence to local structure responsible for this failure by investigating the extent to which similar sequence segments found in different proteins adopt similar three-dimensional structures. Sequence segments 3-15 residues in length from 154 different protein families are partitioned into neighborhoods containing segments with similar sequences using cluster analysis. The consistency of the sequence-to-structure mapping is assessed by comparing the local structures adopted by sequence segments in the same neighborhood in proteins of known structure. In the 154 families, 45% and 28% of the positions occur in neighborhoods in which one and two local structures predominate, respectively. The sequence patterns that characterize the neighborhoods in the first class probably include virtually all of the short sequence motifs in proteins that consistently occur in a particular local structure. These patterns, many of which occur in transitions between secondary structural elements, are an interesting combination of previously studied and novel motifs. The identification of sequence patterns that consistently occur in one or a small number of local structures in proteins should contribute to the prediction of protein structure from sequence.

PubMed Disclaimer

References

    1. J Mol Biol. 1993 Jul 20;232(2):584-99 - PubMed
    1. Biochemistry. 1993 Aug 3;32(30):7605-9 - PubMed
    1. J Comput Aided Mol Des. 1993 Aug;7(4):457-72 - PubMed
    1. Proteins. 1994 Jan;18(1):1-7 - PubMed
    1. Science. 1994 May 20;264(5162):1126-30 - PubMed

Publication types

MeSH terms

LinkOut - more resources