Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Comparative Study
. 1992 Mar;1(3):401-8.
doi: 10.1002/pro.5560010312.

An optimization approach to predicting protein structural class from amino acid composition

Affiliations
Free PMC article
Comparative Study

An optimization approach to predicting protein structural class from amino acid composition

C T Zhang et al. Protein Sci. 1992 Mar.
Free PMC article

Abstract

Proteins are generally classified into four structural classes: all-alpha proteins, all-beta proteins, alpha + beta proteins, and alpha/beta proteins. In this article, a protein is expressed as a vector of 20-dimensional space, in which its 20 components are defined by the composition of its 20 amino acids. Based on this, a new method, the so-called maximum component coefficient method, is proposed for predicting the structural class of a protein according to its amino acid composition. In comparison with the existing methods, the new method yields a higher general accuracy of prediction. Especially for the all-alpha proteins, the rate of correct prediction obtained by the new method is much higher than that by any of the existing methods. For instance, for the 19 all-alpha proteins investigated previously by P.Y. Chou, the rate of correct prediction by means of his method was 84.2%, but the correct rate when predicted with the new method would be 100%! Furthermore, the new method is characterized by an explicable physical picture. This is reflected by the process in which the vector representing a protein to be predicted is decomposed into four component vectors, each of which corresponds to one of the norms of the four protein structural classes.

PubMed Disclaimer

References

    1. Gene. 1990 Dec 15;96(2):161-9 - PubMed
    1. J Bacteriol. 1990 Dec;172(12):7227-40 - PubMed
    1. Nucleic Acids Res. 1990 Apr 25;18 Suppl:2367-411 - PubMed
    1. Protein Eng. 1989 Nov;3(2):85-94 - PubMed
    1. Eur J Biochem. 1989 Mar 15;180(2):479-84 - PubMed

Publication types

LinkOut - more resources