Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2025 Feb:90:102972.
doi: 10.1016/j.sbi.2024.102972. Epub 2025 Jan 2.

On the emergence of machine-learning methods in bottom-up coarse-graining

Affiliations
Review

On the emergence of machine-learning methods in bottom-up coarse-graining

Patrick G Sahrmann et al. Curr Opin Struct Biol. 2025 Feb.

Abstract

Machine-learning methods have gained significant attention in the computational chemistry community as a viable approach to molecular modeling and analysis. Recent successes in utilizing neural networks to learn atomistic force-fields which 'coarse-grain' electronic structure have inspired similar applications to the thermodynamic coarse-graining of chemical and biological systems. In this review, we discuss the current viability and challenges in using machine-learning methods to represent coarse-grained force-fields, as well as the utility of machine-learning in various aspects of coarse-grained modeling.

PubMed Disclaimer

Conflict of interest statement

Declaration of competing interest None.

Figures

Figure 1.
Figure 1.
Current and future directions for ML in CG modeling. a) ML for optimal CG mappings, adapted from Ref. An autoencoding formalism is adopted in which the latent space of the autoencoder constitutes the CG configurational space (above); discrete optimization is performed to obtain a CG mapping (below). b) Modern ML CG models utilize equivariance and graph NNs to construct potential energies. c) Error analysis between AA and CG model (above) of the actin protein using ML-derived CVs (below), adapted from Ref. d) ML is utilized to enable backmapping from a CG configuration to an AA representation, adapted from Ref.
Figure 2.
Figure 2.
Future challenges for ML in CG modeling. (Left) Addressing transferability of CG FFs across thermodynamic state points, in which explicit dependence on thermodynamic variables must be learned. (Middle) Investigation of mode additivity for CG FFs. The combining rule protocol for interactions between Lennard-Jones particles (above) is a paradigmatic example of modularity in FFs, similar protocols for ML FFs have yet to be investigated. (Right) Application of ML FFs to biological systems. Ideally, CG models can be synthesized directly from protein sequence and resulting interactions with bioactive molecules can be accurately assessed. The HIV capsid protein assembly complex and inositol hexakisphosphate (IP6) polyanion are shown as an example.

References

    1. Jin J; Pak AJ; Durumeric AEP; Loose TD; Voth GA Bottom-up Coarse-Graining: Principles and Perspectives. J. Chem. Theory Comput. 2022, 18 (10), 5759–5791. - PMC - PubMed
    1. Giulini M; Menichetti R; Shell MS; Potestio R An Information-Theory-Based Approach for Optimal Model Reduction of Biomolecules. J. Chem. Theory Comput. 2020, 16 (11), 6795–6813. - PMC - PubMed
    1. Boninsegna L; Banisch R; Clementi C A Data-Driven Perspective on the Hierarchical Assembly of Molecular Structures. J. Chem. Theory Comput. 2018, 14 (1), 453–460. - PubMed
    1. Wu J; Xue W; Voth GA K-Means Clustering Coarse-Graining (KMC-CG): A Next Generation Methodology for Determining Optimal Coarse-Grained Mappings of Large Biomolecules. J. Chem. Theory Comput. 2023, 19 (23), 8987–8997. - PMC - PubMed
    1. Yang W; Templeton C; Rosenberger D; Bittracher A; Nuske F, et al. Slicing and Dicing: Optimal Coarse-Grained Representation to Preserve Molecular Kinetics. ACS Cent. Sci. 2023, 9 (2), 186–196. - PMC - PubMed

LinkOut - more resources