Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2010 Mar 12:11:128.
doi: 10.1186/1471-2105-11-128.

New statistical potential for quality assessment of protein models and a survey of energy functions

Affiliations

New statistical potential for quality assessment of protein models and a survey of energy functions

Dmitry Rykunov et al. BMC Bioinformatics. .

Abstract

Background: Scoring functions, such as molecular mechanic forcefields and statistical potentials are fundamentally important tools in protein structure modeling and quality assessment.

Results: The performances of a number of publicly available scoring functions are compared with a statistical rigor, with an emphasis on knowledge-based potentials. We explored the effect on accuracy of alternative choices for representing interaction center types and other features of scoring functions, such as using information on solvent accessibility, on torsion angles, accounting for secondary structure preferences and side chain orientation. Partially based on the observations made, we present a novel residue based statistical potential, which employs a shuffled reference state definition and takes into account the mutual orientation of residue side chains. Atom- and residue-level statistical potentials and Linux executables to calculate the energy of a given protein proposed in this work can be downloaded from http://www.fiserlab.org/potentials.

Conclusions: Among the most influential terms we observed a critical role of a proper reference state definition and the benefits of including information about the microenvironment of interaction centers. Molecular mechanical potentials were also tested and found to be over-sensitive to small local imperfections in a structure, requiring unfeasible long energy relaxation before energy scores started to correlate with model quality.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Statistical significance of rank distributions obtained with different methods. Only p-values obtained with one-tailed Wilcoxon test above 0.05 are shown. Alternative hypothesis in the test was that location of the distribution obtained with scoring function in a row is lower than the one in the column. Interaction centers are color coded in row titles as follows: black is used for residue-based potentials, blue is used for atom-based potentials, and red designates composite scoring functions. Certain interaction types are color coded in column titles, specifically, scoring functions making use of solvent accessibility (green), torsion angle dependence (blue), secondary structure dependence (red), and orientation-dependence (in italic).
Figure 2
Figure 2
Performance of different potentials as a function of the quality of the best available model. Average rank calculated by (A) composite functions, (B) heavy-atom based functions, and (C) residue-based potential functions for targets having best model with GDT_TS better than 95.0 (11 targets), between 87.5 and 95.0 (25 targets), between 80.0 and 87.5 (32 targets), between 72.5 and 80.0 (47 targets) and between 65.0 and 72.5 (28 targets).
Figure 3
Figure 3
Definition of residue orientation used to derive potentials. (A) interaction i j is considered "parallel" if the scalar product of Cα-Cβ vectors for residues i (vector a) and j (vector b) is positive; (B) interaction i j is considered "antiparallel i facing j" if scalar product of a and b is non-positive and scalar product of vector a and vector from Cα atom of residue i to Cα atom of residue j (vector c) is positive; (C) interaction i j is considered as "antiparallel i pointing away from j" if both a·b and a·c scalar products are non-positive.
Figure 4
Figure 4
Superimposition of models of different quality with the experimental solution structure. Experimental structure of the CASP8 target T0502 (violet) and its models, (A) METATASSER_TS5, GDT_TS = 80.357, (B) 3Dpro_TS4, GDT_TS = 60.204, and (C) panther_server_TS2, GDT_TS = 44.643 are shown as Cα traces. Those parts of the models, where the experimental positions are not known are colored white. Parts of models deviating from experimentally determined positions less than 4 Å are colored green, and the rest is colored bronze. Cα-Cα pseudobonds longer than 3.9 Å are shown thin. This plot has been generated using MOLSCRIPT software [63].

References

    1. Levitt M, Warshel A. Computer simulation of protein folding. Nature. 1975;253(5494):694. doi: 10.1038/253694a0. - DOI - PubMed
    1. Miyazawa S, Jernigan RL. Estimation of effective interresidue contact energies from protein crystal structures: quasi-chemical approximation. Macromolecules. 1985;18:534. doi: 10.1021/ma00145a039. - DOI
    1. Pohl FM. Empirical protein energy maps. NatNew Biol. 1971;234(52):277. - PubMed
    1. Samudrala R, Moult J. An all-atom distance-dependent conditional probability discriminatory function for protein structure prediction. JMolBiol. 1998;275(5):895. - PubMed
    1. Rajgaria R, McAllister SR, Floudas CA. A novel high resolution Calpha--Calpha distance dependent force field based on a high quality decoy set. Proteins. 2006;65(3):726–741. doi: 10.1002/prot.21149. - DOI - PubMed

Publication types