Molecular Latent Space Simulators for Distributed and Multimolecular Trajectories
- PMID: 37314375
- DOI: 10.1021/acs.jpca.3c01362
Molecular Latent Space Simulators for Distributed and Multimolecular Trajectories
Abstract
All atom molecular dynamics (MD) simulations offer a powerful tool for molecular modeling, but the short time steps required for numerical stability of the integrator place many interesting molecular events out of reach of unbiased simulations. The popular and powerful Markov state modeling (MSM) approach can extend these time scales by stitching together multiple short discontinuous trajectories into a single long-time kinetic model but necessitates a configurational coarse-graining of the phase space that entails a loss of spatial and temporal resolution and an exponential increase in complexity for multimolecular systems. Latent space simulators (LSS) present an alternative formalism that employs a dynamical, as opposed to configurational, coarse graining comprising three back-to-back learning problems to (i) identify the molecular system's slowest dynamical processes, (ii) propagate the microscopic system dynamics within this slow subspace, and (iii) generatively reconstruct the trajectory of the system within the molecular phase space. A trained LSS model can generate temporally and spatially continuous synthetic molecular trajectories at orders of magnitude lower cost than MD to improve sampling of rare transition events and metastable states to reduce statistical uncertainties in thermodynamic and kinetic observables. In this work, we extend the LSS formalism to short discontinuous training trajectories generated by distributed computing and to multimolecular systems without incurring exponential scaling in computational cost. First, we develop a distributed LSS model over thousands of short simulations of a 264-residue proteolysis-targeting chimera (PROTAC) complex to generate ultralong continuous trajectories that identify metastable states and collective variables to inform PROTAC therapeutic design and optimization. Second, we develop a multimolecular LSS architecture to generate physically realistic ultralong trajectories of DNA oligomers that can undergo both duplex hybridization and hairpin folding. These trajectories retain thermodynamic and kinetic characteristics of the training data while providing increased precision of folding populations and time scales across simulation temperature and ion concentration.
Similar articles
-
Tutorial on Molecular Latent Space Simulators (LSSs): Spatially and Temporally Continuous Data-Driven Surrogate Dynamical Models of Molecular Systems.J Phys Chem A. 2024 Nov 28;128(47):10299-10317. doi: 10.1021/acs.jpca.4c05389. Epub 2024 Nov 14. J Phys Chem A. 2024. PMID: 39540914
-
Molecular latent space simulators.Chem Sci. 2020 Aug 26;11(35):9459-9467. doi: 10.1039/d0sc03635h. Chem Sci. 2020. PMID: 34094212 Free PMC article.
-
RPnet: a reverse-projection-based neural network for coarse-graining metastable conformational states for protein dynamics.Phys Chem Chem Phys. 2022 Jan 19;24(3):1462-1474. doi: 10.1039/d1cp03622j. Phys Chem Chem Phys. 2022. PMID: 34985469
-
How to Run FAST Simulations.Methods Enzymol. 2016;578:213-25. doi: 10.1016/bs.mie.2016.05.032. Epub 2016 Jun 16. Methods Enzymol. 2016. PMID: 27497168 Review.
-
Detecting Functional Dynamics in Proteins with Comparative Perturbed-Ensembles Analysis.Acc Chem Res. 2019 Dec 17;52(12):3455-3464. doi: 10.1021/acs.accounts.9b00485. Epub 2019 Dec 3. Acc Chem Res. 2019. PMID: 31793290 Review.
Cited by
-
AlphaFold and Protein Folding: Not Dead Yet! The Frontier Is Conformational Ensembles.Annu Rev Biomed Data Sci. 2024 Aug;7(1):51-57. doi: 10.1146/annurev-biodatasci-102423-011435. Epub 2024 Jul 24. Annu Rev Biomed Data Sci. 2024. PMID: 38603560 Free PMC article. Review.
LinkOut - more resources
Full Text Sources