Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2004 Nov;13(11):2871-7.
doi: 10.1110/ps.04881304.

To be folded or to be unfolded?

Affiliations

To be folded or to be unfolded?

Sergiy O Garbuzynskiy et al. Protein Sci. 2004 Nov.

Abstract

The lack of ordered structure in "natively unfolded" proteins raises a general question: Are there intrinsic properties of amino acid residues that are responsible for the absence of fixed structure at physiological conditions? In this article, we demonstrate that the competence of a protein to be folded or to be unfolded may be determined by the property of amino acid residues to form a sufficient number of contacts in a globular state. The expected average number of contacts per residue calculated from the amino acid sequence alone (using the average number of contacts for 20 amino acid residues in globular proteins) can be used as one of the simple indicators of natively unfolded proteins. The prediction accuracy for the sets of 80 folded and 90 natively unfolded proteins reaches 89% if the expected average number of contacts is used as a parameter and 83% in the case of hydrophobicity. An optimal set of artificial parameters for 20 amino acid residues obtained by Monte Carlo algorithm to maximally separate the sets of 90 natively unfolded and 80 folded proteins demonstrates the upper limit for prediction accuracy, which is 95%.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
Comparison of the mean values of different parameters computed from sequence alone for the set of 90 “natively unfolded” proteins (black circles) and for the set of 80 “ideally” folded proteins (gray circles). All parameters presented here are averaged per residue; one circle corresponds to one protein. Dependence of (A) expected number of contacts, (B) hydrophobicity in the Kyte and Doolittle (1982) scale, and (C) absolute magnitude of net charge on the number of residues in protein. (D) Dependence of net charge on hydrophobicity. (E) Dependence of the optimal set of artificial parameters on the number of residues in protein. (F) Dependence of expected number of contacts on hydrophobicity. Inset demonstrates the standard deviations for considered parameters.
Figure 2.
Figure 2.
Histograms representing the distribution of 90 “natively unfolded” proteins (black bars) and 80 “ideally” folded proteins (gray bars) as a function of (A) expected number of contacts, (B) hydrophobicity on the Kyte and Doolittle (1982) scale, (C) absolute magnitude of net charge, and (D) optimal set of artificial parameters. The dashed line representing the optimal border between two groups of proteins is placed in the intersection of their distributions. The dependence of the fraction of proteins predicted incorrectly (i.e., native proteins predicted as natively unfolded and vice versa) on the border position of considered parameters between two sets of proteins: (E) for expected number of contacts, (F) for hydrophobicity, (G) for absolute magnitude of net charge, and (H) for optimal set of artificial parameters.

Similar articles

Cited by

References

    1. Bairoch, A. and Apweiler, R. 2000. The SWISS-PROT protein sequence database and its supplement TrEMBL in 2000. Nucleic Acids Res. 28 45–48. - PMC - PubMed
    1. Bernstein, F.C., Koetzle, T.F., Williams, G.J.B., Meyer, E.F., Brice, M.D., Rogers, J.R., Kennard, O., Shimanouchi, T., and Tasumi, M. 1977. The Protein Data Bank. A computer-based archival file for macromolecular structures. Eur. J. Biochem. 80 319–324. - PubMed
    1. Dunker, A.K., Garner, E., Guilliot, S., Romero, P., Albrecht, K., Hart, J., Obradovic, Z., Kissinger, C., and Villafranca, J.E. 1998. Protein disorder and the evolution of molecular recognition: Theory, predictions and observations. Pac. Symp. Biocomput. 473–484. - PubMed
    1. Dyson, H.J. and Wright, P.E. 2002. Insights into the structure and dynamics of unfolded proteins from nuclear magnetic resonance. Adv. Protein Chem. 62 311–340. - PubMed
    1. Fauchere, I.I. and Pliska, V. 1983. Hydrophobic parameters amino-acid side chains from partitioning of N-acetyl-amino-acid amides. Eur. J. Med. Chem.-Chim. Ther. 18 369–375.

Publication types

LinkOut - more resources