Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 1982;18(4):240-50.
doi: 10.1007/BF01734102.

Assessment of similarities of pairs and groups of proteins using transformed amino-acid-residue data

Assessment of similarities of pairs and groups of proteins using transformed amino-acid-residue data

A H Reisner et al. J Mol Evol. 1982.

Abstract

Using as a primary standard a representative set of 208 proteins whose amino-acid-residue mole frequencies have been accurately established, a set of standard distributions of mole frequencies is defined for each amino acids, in terms of which percentile values for the observed mole frequencies of the amino-acid residues in any other protein can be determined. Data so transformed have a distribution much closer to Gaussian than untransformed values, and allow meaningful determinations of correlations between the amino-acid-residue compositions of two proteins as well as between pairs of amino-acid-residues within groups of proteins. Of the 153 possible pairs of amino acids (Asx and Glx are used) 39 are significantly correlated at p less than or equal to 0.01 and 22 at p less than or equal to 0.001. A percentile table is included for those wishing to use the method with programmable calculators. The transformed data for amino-acid compositions have been used to perform principal components analyses on groups of proteins in order to determine if meaningful sub-groupings (observable clusters in scatter diagrams) were detectable. Such analyses are shown for the representative set of proteins and for a group of 184 globins. With regard to the globin chains, a correlation is observed for alpha chains in the first principal component projection (PCP), (accounting for 22% of the variance) with respect to the evolutionary time-scale while beta chains show such a correlation in the first and second PCPs (22% and 18% of the variance respectively). Thus, alpha and beta chains appear to diverge from a common progenitor, similar in position to globin chains from "primitive" forms. Furthermore, globins from "primitive" forms are nearer to one another than they are to globins from the vertebrates, a finding without a priori reason, suggesting perhaps that once a chain has reached a stable relationship with its environment, strong constrains are placed on the co-existing globin chains so that they maintain appropriate interaction with one another. In addition, positions of the epsilon, gamma and delta chains are in the order: epsilon (embryonal) more primitive than gamma (foetal) more primitive than delta equal to beta (adult).

PubMed Disclaimer

References

    1. Anal Biochem. 1971 Nov;44(1):159-73 - PubMed
    1. Anal Biochem. 1973 Mar;52(1):234-54 - PubMed
    1. Anal Biochem. 1973 Nov;56(1):208-36 - PubMed
    1. Anal Biochem. 1974 Oct;61(2):567-609 - PubMed
    1. Biochemistry. 1969 Jun;8(6):2442-54 - PubMed