Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2018 Dec 27;14(12):e1006342.
doi: 10.1371/journal.pcbi.1006342. eCollection 2018 Dec.

Accurate calculation of side chain packing and free energy with applications to protein molecular dynamics

Affiliations

Accurate calculation of side chain packing and free energy with applications to protein molecular dynamics

John M Jumper et al. PLoS Comput Biol. .

Abstract

To address the large gap between time scales that can be easily reached by molecular simulations and those required to understand protein dynamics, we present a rapid self-consistent approximation of the side chain free energy at every integration step. In analogy with the adiabatic Born-Oppenheimer approximation for electronic structure, the protein backbone dynamics are simulated as preceding according to the dictates of the free energy of an instantaneously-equilibrated side chain potential. The side chain free energy is computed on the fly, allowing the protein backbone dynamics to traverse a greatly smoothed energetic landscape. This computation results in extremely rapid equilibration and sampling of the Boltzmann distribution. Our method, termed Upside, employs a reduced model involving the three backbone atoms, along with the carbonyl oxygen and amide proton, and a single (oriented) side chain bead having multiple locations reflecting the conformational diversity of the side chain's rotameric states. We also introduce a novel, maximum-likelihood method to parameterize the side chain interactions using protein structures. We demonstrate state-of-the-art accuracy for predicting χ1 rotamer states while consuming only milliseconds of CPU time. Our method enables rapidly equilibrating coarse-grained simulations that can nonetheless contain significant molecular detail. We also show that the resulting free energies of the side chains are sufficiently accurate for de novo folding of some proteins.

PubMed Disclaimer

Conflict of interest statement

The authors have declared that no competing interests exist.

Figures

Fig 1
Fig 1. Six step inner loop of Upside calculation.
The side chain potential enters into the integration step simply as a complicated, many-body energy function that may be treated with standard techniques of molecular simulations.
Fig 2
Fig 2. Error in the position as a function of the number of side chain states, resulting from a decomposition of rotamer states into coarse-grained states.
The table summarizes the number of states chosen for each amino acid type. The relative uncertainty is the positional uncertainty for each number of states divided by the accuracy at three states. One, three, or six rotamer states are used, depending on the residue type. For residues without a rotatable χ2, such as valine, only three states are needed. The time to compute the pairwise interactions and solve for the free energy scales roughly as the number of coarse rotamer states squared, so the use of fewer coarse states is preferred. Ile, Leu and Lys are the three residues with rotatable χ2 where only 3 states are assigned.
Fig 3
Fig 3. Fragment of protein G with associated interaction graph (Rcutoff = 7Å).
A pair of residues is assigned a connection whenever their side chain beads are within Rcutoff for any side chain states.
Fig 4
Fig 4. Example of optimized coarse states for arginine overlaid on the PDB distribution of the rotamer angles χ1 and χ2.
Each of the six coarse states contains only a single fine state that has high probability, so that the variance of dihedral angles within each coarse state is small.
Fig 5
Fig 5. Coordinates and potentials used for side chain interactions.
Top. For each residue, a reference backbone structure is aligned to the N, Cα, and C atomic coordinates. This alignment creates a reference frame to establish the position and direction of the side chain bead. The two side chain beads x1 and x2 for a pair of residues establishes three coordinates, the distance r and angles θ1 and θ2. Bottom. Example of distance-dependent potential, unif(r12), after training, between the side chain and backbone residues. These interactions cutoff at 5 Å while the side chain-side chain interactions cutoff at 7 Å. The thin lines describe the Vradial of oxygen (red) and hydrogen (blue), and the thick lines describe Vradial + Vangular for the same interactions.
Fig 6
Fig 6. Comparison of χ1 prediction accuracy for Upside and SCWRL4, ordered by Upside accuracy.
The “PDB χ1 frequency” line represents the accuracy of the NDRD rotamer library without any interactions; this library is used in both Upside and SCWRL4.
Fig 7
Fig 7. Comparison of the accuracy of predicting side chains as well as cpu running time.
For all programs, time spent reading the protein structure and writing the results is excluded from the running time to focus on the cost of solving for the side chain positions. For Upside (10 Å cutoff), all side chain interactions with backbone or other side chains are cutoff at 10 Å.
Fig 8
Fig 8. Accuracy of MD simulations for three proteins at variable backbone hydrogen bond strength.
Results with and without side chain-backbone interactions are presented.
Fig 9
Fig 9. Closest structure to native protein (lowest Cα-RMSD) at optimal hydrogen bonding strength, with and without backbone-side chain hydrogen bonds.
For alpha3D, BBA and homeodomain with backbone-side chain hydrogen bonds, the optimal hydrogen bond strength and lowest Cα-RMSD are -2.0, -2.0 and -1.9 RT, and 3.6, 1.7 and 3 Å, respectively. Without these hydrogen bonds, the corresponding values are -2.0, -1.8 and -1.6 RT; and 2.7, 1.8, and 2.5 Å. Blue is the native structure and red is the simulation.

References

    1. Berman HM, Westbrook J, Feng Z, Gilliland G, Bhat TN, Weissig H, et al. The protein data bank. Nucleic acids research. 2000;28(1):235–242. 10.1093/nar/28.1.235 - DOI - PMC - PubMed
    1. Hamelryck T, Borg M, Paluszewski M, Paulsen J, Frellsen J, Andreetta C, et al. Potentials of mean force for protein structure prediction vindicated, formalized and generalized. PloS one. 2010;5(11):e13714 10.1371/journal.pone.0013714 - DOI - PMC - PubMed
    1. Krivov GG, Shapovalov MV, Dunbrack RL. Improved prediction of protein side-chain conformations with SCWRL4. Proteins: Structure, Function, and Bioinformatics. 2009;77(4):778–795. 10.1002/prot.22488 - DOI - PMC - PubMed
    1. Liang S, Zheng D, Zhang C, Standley DM. Fast and accurate prediction of protein side-chain conformations. Bioinformatics. 2011;27(20):2913–2914. 10.1093/bioinformatics/btr482 - DOI - PMC - PubMed
    1. Liang S, Zhou Y, Grishin N, Standley DM. Protein side chain modeling with orientation-dependent atomic force fields derived by series expansions. Journal of computational chemistry. 2011;32(8):1680–1686. 10.1002/jcc.21747 - DOI - PMC - PubMed

Publication types