Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2011 Dec:1062-1067.
doi: 10.1109/ICDM.2011.88.

Learning Protein Folding Energy Functions

Affiliations

Learning Protein Folding Energy Functions

Wei Guan et al. Proc IEEE Int Conf Data Min. 2011 Dec.

Abstract

A critical open problem in ab initio protein folding is protein energy function design, which pertains to defining the energy of protein conformations in a way that makes folding most efficient and reliable. In this paper, we address this issue as a weight optimization problem and utilize a machine learning approach, learning-to-rank, to solve this problem. We investigate the ranking-via-classification approach, especially the RankingSVM method and compare it with the state-of-the-art approach to the problem using the MINUIT optimization package. To maintain the physicality of the results, we impose non-negativity constraints on the weights. For this we develop two efficient non-negative support vector machine (NNSVM) methods, derived from L2-norm SVM and L1-norm SVMs, respectively. We demonstrate an energy function which maintains the correct ordering with respect to structure dissimilarity to the native state more often, is more efficient and reliable for learning on large protein sets, and is qualitatively superior to the current state-of-the-art energy function.

Keywords: ab initio protein folding; energy function; learning-to-rank; non-negativity constrained SVM optimization; support vector machine.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Energy versus Structural Dissimilarity Plot: each dot represents a non-native conformation, and the red square represents the native structure
Figure 2
Figure 2
Error Plot of the Performance of the Learned Energy Functions
Figure 3
Figure 3
NNSVM Optimization Method Comparison

References

    1. Baker D, Sali A. Protein structure prediction and structural genomics. Science. 2001;294(5540):93. - PubMed
    1. Skolnick J, Kihara D, Zhang Y. Development and testing of the PROSPECTOR 3.0 threading algorithm. Proteins. 2004;3:502–518. - PubMed
    1. Anfinsen C, et al. Principles that govern the folding of protein chains. Science. 1973;181(96):223–230. - PubMed
    1. Zhang Y, Kolinski A, Skolnick J. Touchstone II: A new approach to ab initio protein structure prediction. Biophysical Journal. 2003;85:1145–1164. - PMC - PubMed
    1. Kabsch W. A solution for the best rotation to relate two sets of vectors. Acta Crystallographica Section A. 1976;32(5):922–923.

LinkOut - more resources