Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2010 Mar;27(3):609-21.
doi: 10.1093/molbev/msp277. Epub 2009 Nov 18.

Comparing models of evolution for ordered and disordered proteins

Affiliations

Comparing models of evolution for ordered and disordered proteins

Celeste J Brown et al. Mol Biol Evol. 2010 Mar.

Erratum in

  • Mol Biol Evol. 2012 Jan;29(1):443

Abstract

Most models of protein evolution are based upon proteins that form relatively rigid 3D structures. A significant fraction of proteins, the so-called disordered proteins, do not form rigid 3D structures and sample a broad conformational ensemble. Disordered proteins do not typically maintain long-range interactions, so the constraints on their evolution should be different than ordered proteins. To test this hypothesis, we developed and compared models of evolution for disordered and ordered proteins. Substitution matrices were constructed using the sequences of putative homologs for sets of experimentally characterized disordered and ordered proteins. Separate matrices, at three levels of sequence similarity (>85%, 85-60%, and 60-40%), were inferred for each type of protein structure. The substitution matrices for disordered and ordered proteins differed significantly at each level of sequence similarity. The disordered matrices reflected a greater likelihood of evolutionary changes, relative to the ordered matrices, and these changes involved nonconservative substitutions. Glutamic acid and asparagine were interesting exceptions to this result. Important differences between the substitutions that are accepted in disordered proteins relative to ordered proteins were also identified. In general, disordered proteins have fewer evolutionary constraints than ordered proteins. However, some residues like tryptophan and tyrosine are highly conserved in disordered proteins. This is due to their important role in forming protein-protein interfaces. Finally, the amino acid frequencies for disordered proteins, computed during the development of the matrices, were compared with amino acid frequencies for different categories of secondary structure in ordered proteins. The highest correlations were observed between the amino acid frequencies in disordered proteins and the solvent-exposed loops and turns of ordered proteins, supporting an emerging structural model for disordered proteins.

PubMed Disclaimer

Figures

F<sc>IG</sc>. 1.
FIG. 1.
Iterative procedure used for constructing substitution matrices.
F<sc>IG</sc>. 2.
FIG. 2.
Amino acid frequencies (pi) of residues in each matrix. The axis is sorted by average frequency of the residues in the D matrices. Shades of red are D matrices; shades of blue are O matrices. Bars indicate ±0.2%, which is within rounding error and slightly greater than the maximum standard deviation. D, disorder; O, order.
F<sc>IG</sc>. 3.
FIG. 3.
(A) Substitution matrices for ordered (O40, upper) and disordered (D40, lower) proteins at 40–60% sequence identity. Color shading indicates probability of substitutions being greater than expected by chance (blue) or less than expected (red). (B) Matrix showing the difference between D40 and O40. Color shading indicates greater frequency of substitutions in disorder (blue) or greater frequency in order (red). In all matrices, residues are ordered from most order promoting (green) to most disorder promoting (white) as shown in (Radivojac et al. 2007). D, disorder; O, order.
F<sc>IG</sc>. 4.
FIG. 4.
Residues that are most conserved are different between disordered and ordered proteins, and conservation is not determined by frequency. (A) Differences between the probability that an amino acid is conserved at a site (pii) for disorder and order at 40–60% identity for each of the 20 amino acids (i). Error bars indicate one standard deviation. (B) Scatter plot of the frequencies of each amino acid versus pii for disorder (filled diamonds) and order (open squares) at 40–60% identity.
F<sc>IG</sc>. 5.
FIG. 5.
Frequencies of amino acids in disordered proteins are most similar to the frequencies of amino acids in the exposed coils and turns of ordered proteins. The line indicates a one-to-one correspondence between frequency in order and disorder. (Information for disordered proteins is based upon the >85% similarity matrix and for ordered proteins is from Goldman et al. 1998.)
F<sc>IG</sc>. 6.
FIG. 6.
Scatter plot of the residue-normalized diagonal values, pii, for the eight secondary structure matrices and D85. The plot is ordered based upon increasing pii of the D85 matrix. Lines between data points from the same matrix are added for clarity. (Information for disordered proteins is based upon the >85% similarity matrix and for ordered proteins is from Goldman et al. 1998.). D, disorder; O, order.
F<sc>IG</sc>. 7.
FIG. 7.
Protein sequence alignment of the disordered p53 transactivation domain for seven closely related family members. The orange bar shows the position of the amphipathic helix that forms when p53 binds to the ubiquitin ligase, MDM2. The green bar shows the position of the amphipathic helix that forms when p53 binds to RPA70.

Similar articles

Cited by

References

    1. Abramova NA, Russell J, Botchan M, Li R. Interaction between replication protein A and p53 is disrupted after UV damage in a DNA repair-dependent manner. Proc Natl Acad Sci USA. 1997;94:7186–7191. - PMC - PubMed
    1. Altschul S, Madden T, Schaffer A, Zhang J, Zhang Z, Miller W, Lipman D. Gapped BLAST and PSI-BLAST: a new generation of protein database search tools. Nucleic Acids Res. 1997;25:3389–3402. - PMC - PubMed
    1. Altschul SF. Amino acid substitution matrices from an information theoretic perspective. J Mol Biol. 1991;219:555–565. - PMC - PubMed
    1. Anisimova M, Kosiol C. Investigating protein-coding sequence evolution with probabilistic codon substitution models. Mol Biol Evol. 2009;26:255–271. - PubMed
    1. Appella E, Anderson CW. Post-translational modifications and activation of p53 by genotoxic stresses. Eur J Biochem. 2001;268:2764–2772. - PubMed

Publication types