Folding by numbers: primary sequence statistics and their use in studying protein folding
- PMID: 19468326
- PMCID: PMC2680634
- DOI: 10.3390/ijms10041567
Folding by numbers: primary sequence statistics and their use in studying protein folding
Abstract
The exponential growth over the past several decades in the quantity of both primary sequence data available and the number of protein structures determined has provided a wealth of information describing the relationship between protein primary sequence and tertiary structure. This growing repository of data has served as a prime source for statistical analysis, where underlying relationships between patterns of amino acids and protein structure can be uncovered. Here, we survey the main statistical approaches that have been used for identifying patterns within protein sequences, and discuss sequence pattern research as it relates to both secondary and tertiary protein structure. Limitations to statistical analyses are discussed, and a context for their role within the field of protein folding is given. We conclude by describing a novel statistical study of residue patterning in beta-strands, which finds that hydrophobic (i,i+2) pairing in beta-strands occurs more often than expected at locations near strand termini. Interpretations involving beta-sheet nucleation and growth are discussed.
Keywords: Primary Sequence; Protein Folding; Sequence-Structure Relationship.
Figures
Similar articles
-
Prediction of folding mechanisms for Ig-like beta sandwich proteins based on inter-residue average distance statistics methods.Proteins. 2019 Feb;87(2):120-135. doi: 10.1002/prot.25637. Epub 2018 Dec 21. Proteins. 2019. PMID: 30520530
-
Role of hydrophobic clusters and long-range contact networks in the folding of (alpha/beta)8 barrel proteins.Biophys J. 2003 Mar;84(3):1919-25. doi: 10.1016/s0006-3495(03)75000-0. Biophys J. 2003. PMID: 12609894 Free PMC article.
-
An amino acid code for β-sheet packing structure.Proteins. 2014 Sep;82(9):2128-40. doi: 10.1002/prot.24569. Epub 2014 Apr 16. Proteins. 2014. PMID: 24668690 Free PMC article.
-
Understanding the mechanism of beta-sheet folding from a chemical and biological perspective.Biopolymers. 2008;90(6):751-8. doi: 10.1002/bip.21101. Biopolymers. 2008. PMID: 18844292 Review.
-
Coupled folding and specific binding: fishing for amphiphilicity.Int J Mol Sci. 2011;12(3):1431-50. doi: 10.3390/ijms12031431. Epub 2011 Feb 24. Int J Mol Sci. 2011. PMID: 21673899 Free PMC article. Review.
Cited by
-
A decade and a half of protein intrinsic disorder: biology still waits for physics.Protein Sci. 2013 Jun;22(6):693-724. doi: 10.1002/pro.2261. Epub 2013 Apr 29. Protein Sci. 2013. PMID: 23553817 Free PMC article. Review.
-
On the Roles of Protein Intrinsic Disorder in the Origin of Life and Evolution.Life (Basel). 2024 Oct 15;14(10):1307. doi: 10.3390/life14101307. Life (Basel). 2024. PMID: 39459607 Free PMC article. Review.
-
Protein beta-sheet nucleation is driven by local modular formation.J Biol Chem. 2010 Jun 11;285(24):18376-84. doi: 10.1074/jbc.M110.120824. Epub 2010 Apr 10. J Biol Chem. 2010. PMID: 20382979 Free PMC article.
-
BetaSearch: a new method for querying β-residue motifs.BMC Res Notes. 2012 Jul 30;5:391. doi: 10.1186/1756-0500-5-391. BMC Res Notes. 2012. PMID: 22839199 Free PMC article.
References
-
- Rossmann MG, Argos P. Protein folding. Ann. Rev. Biochem. 1981;50:497–532. - PubMed
-
- Levinthal C. Are there pathways for protein folding? J. Chem. Phys. 1968;65:44–45.
-
- Fetrow JS, Giamonna A, Kolinski A, Sholnick J. The protein folding problem: a biophysical enigma. Curr. Pharm. Biotechnol. 2002;3:329–347. - PubMed
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Miscellaneous