Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2007 Oct;16(10):2123-39.
doi: 10.1110/ps.072939707.

Reduced C(beta) statistical potentials can outperform all-atom potentials in decoy identification

Affiliations

Reduced C(beta) statistical potentials can outperform all-atom potentials in decoy identification

James E Fitzgerald et al. Protein Sci. 2007 Oct.

Abstract

We developed a series of statistical potentials to recognize the native protein from decoys, particularly when using only a reduced representation in which each side chain is treated as a single C(beta) atom. Beginning with a highly successful all-atom statistical potential, the Discrete Optimized Protein Energy function (DOPE), we considered the implications of including additional information in the all-atom statistical potential and subsequently reducing to the C(beta) representation. One of the potentials includes interaction energies conditional on backbone geometries. A second potential separates sequence local from sequence nonlocal interactions and introduces a novel reference state for the sequence local interactions. The resultant potentials perform better than the original DOPE statistical potential in decoy identification. Moreover, even upon passing to a reduced C(beta) representation, these statistical potentials outscore the original (all-atom) DOPE potential in identifying native states for sets of decoys. Interestingly, the backbone-dependent statistical potential is shown to retain nearly all of the information content of the all-atom representation in the C(beta) representation. In addition, these new statistical potentials are combined with existing potentials to model hydrogen bonding, torsion energies, and solvation energies to produce even better performing potentials. The ability of the C(beta) statistical potentials to accurately represent protein interactions bodes well for computational efficiency in protein folding calculations using reduced backbone representations, while the extensions to DOPE illustrate general principles for improving knowledge-based potentials.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
Reference state used by DOPE for the case where the reference sphere has radius a = 24 Å and the cutoff distance for computing nonbonded interactions is assumed to be at least 48 Å. In actual applications, it is likely that a significantly shorter cutoff distance is used. Then, this function must be normalized differently as the longer distance information is irrelevant (see Appendix).
Figure 2.
Figure 2.
(A) The four-basin system used to define DOPE-Back. The basins called β and PPII are very common in β-sheet structures. The αR basin contains helical geometries. The basin termed ɛ comprises the rest of the Ramachandran map. (B) The more conventional five-basin Ramachandran map. The ɛ-basin from A is the union of γ and αL basins as well as the portion of αR basin that corresponds to turn geometries.
Figure 3.
Figure 3.
Reference states used in (A) DOPE-N1, (B) DOPE-N2, (C) DOPE-N3, and (D) DOPE-N4. Superimposed on these reference states is the DOPE-like spherical reference state obtained by matching the maxima of the distributions. The inequivalence between these distributions emphasizes the utility of the new local reference states introduced by DOPENN.
Figure 4.
Figure 4.
The DOPE, DOPENN, and DOPE-Back statistical potentials all yield strikingly different interaction energies. (A) Local interaction energies for DOPENN, (B) nonlocal interaction energies for DOPENN compared with DOPE, (C) interaction energies of DOPE-Back compared with DOPE. All interactions are between two arginine α-carbons.
Figure 5.
Figure 5.
Probability distributions of the correlation coefficient between DOPE and DOPE-Cβ vs. DOPE-Back and DOPE-Cβ-Back over all 306 decoy sets. To illustrate the geometric meaning of particular correlation coefficients, correlations between all-atom and reduced β-carbon DOPE-Back statistical potentials are taken from the Baker decoy set for the proteins 1ubx, 1ail, and 1bq9.

Similar articles

Cited by

References

    1. Anfinsen C.B. 1972. Formation and stabilization of protein structure. Biochem. J. 128: 737–749. - PMC - PubMed
    1. Anfinsen C.B. 1973. Principles that govern folding of protein chains. Science 181: 223–230. - PubMed
    1. Bauer A. and Beyer, A. 1994. An improved pair potential to recognize native protein folds. Proteins 18: 254–261. - PubMed
    1. Bennaim A. 1997. Statistical potentials extracted from protein structures: Are these meaningful potentials? J. Chem. Phys. 107: 3698–3706.
    1. Betancourt M.R. 2003. A reduced protein model with accurate native-structure identification ability. Proteins 53: 889–907. - PubMed

Publication types

LinkOut - more resources