State-of-the-Art Estimation of Protein Model Accuracy Using AlphaFold
- PMID: 36563190
- PMCID: PMC12178128
- DOI: 10.1103/PhysRevLett.129.238101
State-of-the-Art Estimation of Protein Model Accuracy Using AlphaFold
Abstract
The problem of predicting a protein's 3D structure from its primary amino acid sequence is a longstanding challenge in structural biology. Recently, approaches like alphafold have achieved remarkable performance on this task by combining deep learning techniques with coevolutionary data from multiple sequence alignments of related protein sequences. The use of coevolutionary information is critical to these models' accuracy, and without it their predictive performance drops considerably. In living cells, however, the 3D structure of a protein is fully determined by its primary sequence and the biophysical laws that cause it to fold into a low-energy configuration. Thus, it should be possible to predict a protein's structure from only its primary sequence by learning an approximate biophysical energy function. We provide evidence that alphafold has learned such an energy function, and uses coevolution data to solve the global search problem of finding a low-energy conformation. We demonstrate that alphafold'slearned energy function can be used to rank the quality of candidate protein structures with state-of-the-art accuracy, without using any coevolution data. Finally, we explore several applications of this energy function, including the prediction of protein structures without multiple sequence alignments.
Figures




References
-
- Balakrishnan S, Kamisetty H, Carbonell JG, Lee S-I, and Langmead CJ, Learning generative models for protein fold families, Proteins 79, 1061 (2011). - PubMed
-
- Jones DT, Buchan DWA, Cozzetto D, and Pontil M, PSICOV: Precise structural contact prediction using sparse inverse covariance estimation on large multiple sequence alignments, Bioinformatics 28, 184 (2012). - PubMed
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources