Machine learned coarse-grained protein force-fields: Are we there yet?

Affiliations

¹ Department of Mathematics and Computer Science, Freie Universität Berlin, Arnimallee 12, 14195, Berlin, Germany.
² Department of Physics and Astronomy, Rice University, 6100 Main Street, Houston, 77005, Texas, USA; Department of Physics, Freie Universität Berlin, Arnimallee 12, 14195, Berlin, Germany; Center for Theoretical Biological Physics, Rice University, 6100 Main Street, Houston, 77005, Texas, USA.
³ Department of Physics, Freie Universität Berlin, Arnimallee 12, 14195, Berlin, Germany. Electronic address: https://twitter.com/pbrun03.
⁴ Department of Physics, Freie Universität Berlin, Arnimallee 12, 14195, Berlin, Germany. Electronic address: https://twitter.com/FelixMusil.
⁵ Department of Physics, Freie Universität Berlin, Arnimallee 12, 14195, Berlin, Germany.
⁶ Department of Physics, Freie Universität Berlin, Arnimallee 12, 14195, Berlin, Germany. Electronic address: https://twitter.com/sayeg84.
⁷ Department of Mathematics and Computer Science, Freie Universität Berlin, Arnimallee 12, 14195, Berlin, Germany. Electronic address: https://twitter.com/hello_yaoyi.
⁸ Microsoft Research AI4Science, Karl-Liebknecht Str. 32, Berlin, 10178, Berlin, Germany; Department of Mathematics and Computer Science, Freie Universität Berlin, Arnimallee 12, 14195, Berlin, Germany; Department of Physics, Freie Universität Berlin, Arnimallee 12, 14195, Berlin, Germany; Department of Chemistry, Rice University, 6100 Main Street, Houston, 77005, Texas, USA. Electronic address: https://twitter.com/FrankNoeBerlin.
⁹ Department of Physics, Freie Universität Berlin, Arnimallee 12, 14195, Berlin, Germany; Center for Theoretical Biological Physics, Rice University, 6100 Main Street, Houston, 77005, Texas, USA; Department of Chemistry, Rice University, 6100 Main Street, Houston, 77005, Texas, USA; Department of Physics and Astronomy, Rice University, 6100 Main Street, Houston, 77005, Texas, USA. Electronic address: c.clementi@fu-berlin.de.

PMID: 36731338
PMCID: PMC10023382
DOI: 10.1016/j.sbi.2023.102533

Review

Machine learned coarse-grained protein force-fields: Are we there yet?

Aleksander E P Durumeric et al. Curr Opin Struct Biol. 2023 Apr.

. 2023 Apr:79:102533.

doi: 10.1016/j.sbi.2023.102533. Epub 2023 Jan 31.

Authors

Affiliations

¹ Department of Mathematics and Computer Science, Freie Universität Berlin, Arnimallee 12, 14195, Berlin, Germany.
² Department of Physics and Astronomy, Rice University, 6100 Main Street, Houston, 77005, Texas, USA; Department of Physics, Freie Universität Berlin, Arnimallee 12, 14195, Berlin, Germany; Center for Theoretical Biological Physics, Rice University, 6100 Main Street, Houston, 77005, Texas, USA.
³ Department of Physics, Freie Universität Berlin, Arnimallee 12, 14195, Berlin, Germany. Electronic address: https://twitter.com/pbrun03.
⁴ Department of Physics, Freie Universität Berlin, Arnimallee 12, 14195, Berlin, Germany. Electronic address: https://twitter.com/FelixMusil.
⁵ Department of Physics, Freie Universität Berlin, Arnimallee 12, 14195, Berlin, Germany.
⁶ Department of Physics, Freie Universität Berlin, Arnimallee 12, 14195, Berlin, Germany. Electronic address: https://twitter.com/sayeg84.
⁷ Department of Mathematics and Computer Science, Freie Universität Berlin, Arnimallee 12, 14195, Berlin, Germany. Electronic address: https://twitter.com/hello_yaoyi.
⁸ Microsoft Research AI4Science, Karl-Liebknecht Str. 32, Berlin, 10178, Berlin, Germany; Department of Mathematics and Computer Science, Freie Universität Berlin, Arnimallee 12, 14195, Berlin, Germany; Department of Physics, Freie Universität Berlin, Arnimallee 12, 14195, Berlin, Germany; Department of Chemistry, Rice University, 6100 Main Street, Houston, 77005, Texas, USA. Electronic address: https://twitter.com/FrankNoeBerlin.
⁹ Department of Physics, Freie Universität Berlin, Arnimallee 12, 14195, Berlin, Germany; Center for Theoretical Biological Physics, Rice University, 6100 Main Street, Houston, 77005, Texas, USA; Department of Chemistry, Rice University, 6100 Main Street, Houston, 77005, Texas, USA; Department of Physics and Astronomy, Rice University, 6100 Main Street, Houston, 77005, Texas, USA. Electronic address: c.clementi@fu-berlin.de.

PMID: 36731338
PMCID: PMC10023382
DOI: 10.1016/j.sbi.2023.102533

Abstract

The successful recent application of machine learning methods to scientific problems includes the learning of flexible and accurate atomic-level force-fields for materials and biomolecules from quantum chemical data. In parallel, the machine learning of force-fields at coarser resolutions is rapidly gaining relevance as an efficient way to represent the higher-body interactions needed in coarse-grained force-fields to compensate for the omitted degrees of freedom. Coarse-grained models are important for the study of systems at time and length scales exceeding those of atomistic simulations. However, the development of transferable coarse-grained models via machine learning still presents significant challenges. Here, we discuss recent developments in this field and current efforts to address the remaining challenges.

PubMed Disclaimer

Conflict of interest statement

Declaration of competing interest The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Figures

**Figure 1:**
Sequential reduction in resolution of a variant of the miniprotein Chignolin (CLN025) from a solvated all-atom representation containing many thousands of atoms, to an implicit solvent representation, to a heavy-backbone representation with C_β beads, and finally to a C_α CG representation containing 10 beads.

**Figure 2:**
A pipeline for creating and using ML CG models from atomistic simulation data and experimental measurements. A chosen CG mapping can reduce reference information into a CG dataset that can be used to train ML CG models. This training can rely on both simulation and experimental observables in order to reduce the complexity of the learning task and respect physical constraints. A trained ML CG model can then be validated through CG MD and used for general property predictions.

**Figure 3:**
State-of-the-art performance for a C_α CG ML model on the benchmark protein CLN025. A) Comparison of the CG free energy landscape of CLN025 (produced using MD) for a learned CG ML model with the corresponding free energy for the reference all-atom dataset projected onto slow degrees of freedom (TICA) [74]. B) Ensembles of structures sampled from the CG ML model MD simulation (in red) are superimposed onto all-atom reference structure counterparts (in blue). Basin 1 represents the unfolded state, basin 2 the misfolded state, and basin 3 the folded state.

See this image and copyright information in PMC

References

1. Levitt M, Warshel A, Computer simulation of protein folding, Nature 253 (5494) (1975) 694–698. - PubMed
1. Clementi C, Coarse-grained models of protein folding: toy models or predictive tools?, Curr. Opin. Struct. Biol 18 (1) (2008) 10–15. - PubMed
1. Bryngelson JD, Wolynes PG, Spin glasses and the statistical mechanics of protein folding., Proc. Natl. Acad. Sci. USA 84 (21) (1987) 7524–7528. - PMC - PubMed
1. Onuchic JN, Luthey-Schulten Z, Wolynes PG, Theory of Protein Folding: The energy landscape perspective, Annu. Rev. Phys. Chem 48 (1) (1997) 545–600. - PubMed
1. Dill KA, Bromberg S, Yue K, Chan HS, Ftebig KM, Yee DP, Thomas PD, Principles of protein folding — a perspective from simple exact models, Protein Science 4 (4) (1995) 561–602. doi:10.1002/pro.5560040401. - DOI - PMC - PubMed

Publication types

Actions
Actions

MeSH terms

Actions
Actions

Grants and funding

T15 LM007093/LM/NLM NIH HHS/United States

LinkOut - more resources

Full Text Sources

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Machine learned coarse-grained protein force-fields: Are we there yet?

Affiliations

Machine learned coarse-grained protein force-fields: Are we there yet?

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

References

Publication types

MeSH terms

Grants and funding

LinkOut - more resources

Full Text Sources