Finding the needle in the haystack: towards solving the protein-folding problem computationally
- PMID: 28976219
- PMCID: PMC6790072
- DOI: 10.1080/10409238.2017.1380596
Finding the needle in the haystack: towards solving the protein-folding problem computationally
Abstract
Prediction of protein tertiary structures from amino acid sequence and understanding the mechanisms of how proteins fold, collectively known as "the protein folding problem," has been a grand challenge in molecular biology for over half a century. Theories have been developed that provide us with an unprecedented understanding of protein folding mechanisms. However, computational simulation of protein folding is still difficult, and prediction of protein tertiary structure from amino acid sequence is an unsolved problem. Progress toward a satisfying solution has been slow due to challenges in sampling the vast conformational space and deriving sufficiently accurate energy functions. Nevertheless, several techniques and algorithms have been adopted to overcome these challenges, and the last two decades have seen exciting advances in enhanced sampling algorithms, computational power and tertiary structure prediction methodologies. This review aims at summarizing these computational techniques, specifically conformational sampling algorithms and energy approximations that have been frequently used to study protein-folding mechanisms or to de novo predict protein tertiary structures. We hope that this review can serve as an overview on how the protein-folding problem can be studied computationally and, in cases where experimental approaches are prohibitive, help the researcher choose the most relevant computational approach for the problem at hand. We conclude with a summary of current challenges faced and an outlook on potential future directions.
Keywords: Protein-folding problem; conformational sampling algorithms; protein energy approximations; protein structure prediction; protein-folding simulation; sparse experimental data.
Conflict of interest statement
Disclosure statement
The authors report no conflicts of interest. The authors alone are responsible for the content and writing of this article.
Figures







References
-
- Abeysinghe S, Ju T, Baker ML, Chiu W. 2008. Shape modeling and matching in identifying 3D protein structures. Computer-Aided Design. 40:708–720.
-
- Amir ED, Kalisman N, Keasar C. 2008. Differentiable, multidimensional, knowledge-based energy terms for torsion angle probabilities and propensities. Proteins. 72:62–73. - PubMed
-
- Anfinsen CB. 1973. Principles that govern the folding of protein chains. Science. 181:223–230. - PubMed
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources