Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2009 Apr 8;10(4):1567-1589.
doi: 10.3390/ijms10041567.

Folding by numbers: primary sequence statistics and their use in studying protein folding

Affiliations
Review

Folding by numbers: primary sequence statistics and their use in studying protein folding

Brent Wathen et al. Int J Mol Sci. .

Abstract

The exponential growth over the past several decades in the quantity of both primary sequence data available and the number of protein structures determined has provided a wealth of information describing the relationship between protein primary sequence and tertiary structure. This growing repository of data has served as a prime source for statistical analysis, where underlying relationships between patterns of amino acids and protein structure can be uncovered. Here, we survey the main statistical approaches that have been used for identifying patterns within protein sequences, and discuss sequence pattern research as it relates to both secondary and tertiary protein structure. Limitations to statistical analyses are discussed, and a context for their role within the field of protein folding is given. We conclude by describing a novel statistical study of residue patterning in beta-strands, which finds that hydrophobic (i,i+2) pairing in beta-strands occurs more often than expected at locations near strand termini. Interpretations involving beta-sheet nucleation and growth are discussed.

Keywords: Primary Sequence; Protein Folding; Sequence-Structure Relationship.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
Hydrophobic and hydrophilic coupling within β-strands.
Figure 2.
Figure 2.
Position-specific hydrophobic coupling within β-strands.

Similar articles

Cited by

References

    1. Anfinsen CB, Haber E, Sela M, White FH., Jr The kinetics of formation of native ribonuclease during oxidation of the reduced polypeptide chain. Proc. Natl. Acad. Sci. USA. 1961;47:1309–1314. - PMC - PubMed
    1. Rossmann MG, Argos P. Protein folding. Ann. Rev. Biochem. 1981;50:497–532. - PubMed
    1. Levinthal C. Are there pathways for protein folding? J. Chem. Phys. 1968;65:44–45.
    1. Fetrow JS, Giamonna A, Kolinski A, Sholnick J. The protein folding problem: a biophysical enigma. Curr. Pharm. Biotechnol. 2002;3:329–347. - PubMed
    1. Dill KA, Ozkan SB, Shell MS, Weikl TR. The protein folding problem. Annu. Rev. Biophys. 2008;37:289–316. - PMC - PubMed

LinkOut - more resources