. 2022 Nov 19;13(1):7101.

doi: 10.1038/s41467-022-34603-z.

Deep learning to decompose macromolecules into independent Markovian domains

Andreas Mardt^#¹, Tim Hempel^#^{1

2}, Cecilia Clementi^{2

3

4}, Frank Noé^{5

6

7

8}

Affiliations

¹ Freie Universität Berlin, Department of Mathematics and Computer Science, Berlin, Germany.
² Freie Universität Berlin, Department of Physics, Berlin, Germany.
³ Rice University, Department of Chemistry, Houston, TX, USA.
⁴ Rice University, Center for Theoretical Biological Physics, Houston, TX, USA.
⁵ Freie Universität Berlin, Department of Mathematics and Computer Science, Berlin, Germany. franknoe@microsoft.com.
⁶ Freie Universität Berlin, Department of Physics, Berlin, Germany. franknoe@microsoft.com.
⁷ Rice University, Department of Chemistry, Houston, TX, USA. franknoe@microsoft.com.
⁸ Microsoft Research AI4Science, Berlin, Germany. franknoe@microsoft.com.

^# Contributed equally.

PMID: 36402768
PMCID: PMC9675806
DOI: 10.1038/s41467-022-34603-z

Deep learning to decompose macromolecules into independent Markovian domains

Andreas Mardt et al. Nat Commun. 2022.

. 2022 Nov 19;13(1):7101.

doi: 10.1038/s41467-022-34603-z.

Authors

Andreas Mardt^#¹, Tim Hempel^#^{1

2}, Cecilia Clementi^{2

3

4}, Frank Noé^{5

6

7

8}

Affiliations

¹ Freie Universität Berlin, Department of Mathematics and Computer Science, Berlin, Germany.
² Freie Universität Berlin, Department of Physics, Berlin, Germany.
³ Rice University, Department of Chemistry, Houston, TX, USA.
⁴ Rice University, Center for Theoretical Biological Physics, Houston, TX, USA.
⁵ Freie Universität Berlin, Department of Mathematics and Computer Science, Berlin, Germany. franknoe@microsoft.com.
⁶ Freie Universität Berlin, Department of Physics, Berlin, Germany. franknoe@microsoft.com.
⁷ Rice University, Department of Chemistry, Houston, TX, USA. franknoe@microsoft.com.
⁸ Microsoft Research AI4Science, Berlin, Germany. franknoe@microsoft.com.

^# Contributed equally.

PMID: 36402768
PMCID: PMC9675806
DOI: 10.1038/s41467-022-34603-z

Abstract

The increasing interest in modeling the dynamics of ever larger proteins has revealed a fundamental problem with models that describe the molecular system as being in a global configuration state. This notion limits our ability to gather sufficient statistics of state probabilities or state-to-state transitions because for large molecular systems the number of metastable states grows exponentially with size. In this manuscript, we approach this challenge by introducing a method that combines our recent progress on independent Markov decomposition (IMD) with VAMPnets, a deep learning approach to Markov modeling. We establish a training objective that quantifies how well a given decomposition of the molecular system into independent subdomains with Markovian dynamics approximates the overall dynamics. By constructing an end-to-end learning framework, the decomposition into such subdomains and their individual Markov state models are simultaneously learned, providing a data-efficient and easily interpretable summary of the complex system dynamics. While learning the dynamical coupling between Markovian subdomains is still an open issue, the present results are a significant step towards learning Ising models of large molecular complexes from simulation data.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing interests.

Figures

**Fig. 1. The iVAMP concept as visualized by modeling dynamics of a protein that has two independent, flexible regions separated by a rigid barrel.**
iVAMPnets learn an assignment of the C- (blue/top) and N-termini (green/bottom) into independent subsystems from molecular dynamics trajectories (left column). Moreover, the dynamics of both termini are modeled with statistically independent VAMPnets (right column).

**Fig. 2. Architecture of an iVAMPnet for N subsystems, where trainable parts are shaded green.**
Two lobes are given for configuration pairs x_t (light) and x_t+τ (dark) where the weights are shared. Firstly, the input features are element wise weighted ${\bar{Y}}_{t} = G ⊙ x_{t}$ with a mask $G \in R^{D \times N}$ , where each subsystem learns its individual weighting. The mask values can be interpreted as probabilities to which subsystem the input feature belongs. In order to prevent the subsequent neural network to reverse the effects of the mask, we draw for each input feature i and subsystem j an independent, normally distributed random variable $ϵ_{i j} ~ N (0, σ (1 - G_{i j}))$ . This noise is added to the weighted features $Y_{t} = {\bar{Y}}_{t} + ϵ$ . Thereby, the attention weight linearly interpolates between input feature and Gaussian noise, i.e., if the attention weight G_ij = 1, Y_ij carries exclusively the input feature x_i, if G_ij = 0, Y_ij is simple Gaussian noise. Afterwards, the transformed feature vector is split for each individual subsystem $Y_{t} = [Y_{t}^{1}, . . ., Y_{t}^{N}]$ and passed through the subsystem specific neural network ηⁱ. We call the whole transformation for a subsystem i the fuzzy state assignment $χ^{i} (x_{t}) = η^{i} (Y_{t}^{i})$ .

**Fig. 3. Hidden Markov state model as a benchmark example for independent subsystems.**
a 2 subsystems with 2 and 3 states emit independently to an x and y axis, respectively. The corresponding 2D space embeds all 6 global states. b The learned mask, depicted in gray-scale from 0 (white) to 1 (black), shows that each subsystem focuses on one input dimension. c The estimated subsystem transition matrices are compared with the ground truth (in percent). d Subsystem eigenfunctions (color-coded) and corresponding eigenvalues (number prints) as found by iVAMPnet. Independent processes are recovered from the 2D data.

**Fig. 4. Hidden Markov state model with 1024 global states forming a 10D hypercube embedded in a 20D space.**
a The hypercube is composed of ten independent 2-state subsystems. A pair of two subsystems always lives in a common rotated 2D-manifold. Therefore, two subsystems need the same input features to be well approximated. b 2D depiction of the hypercube in an orthographic projection^,, where the global system can jump freely between all 1024 vertices, and the ten 2-state models retrieved from it by the iVAMPnet (colors denote subsystem identity). c Learned mask, depicted in gray-scale from 0 (white) to 1 (black), assigning inputs to subsystems (color-coded). It shows that for each subsystem, the network assigns two highly important input features which are shared with exactly one other subsystem, mirroring the rotated input space. Noise dimensions (x10-x19) are assigned low importance values. d Implied timescales as a function of the model lag time (both in arbitrary units, a.u.) of all ten subsystems learned by our method (dots) approximate the underlying true timescales (lines). Time scales are color-coded by index.

**Fig. 5. iVAMPnet of synaptotagmin-C2A with two subsystems and twelve and six states, respectively.**
a Importance values of the trainable mask depicted as color-coded protein secondary structure, indicating assignment to subsystem I (II) in green (blue). b Implied timescales of the two subsystems with a 90% percentile over 20 runs (dot markers denote means), color-coded by index. c Superposed representative structures of both extrema of the slowest resolved eigenfunctions of each subsystem (residues not assigned a high importance value or not showing significant movement are omitted for clarity). The slowest process of subsystem I changes between green and gray structures showing an orchestrated movement of the full Calcium Binding Region (CBR1, CBR2, and CBR3). The slowest process of subsystem II occurs between the blue and gray structures and describes a combined movement of the loops C78 and C34.

**Fig. 6. Attention scheme for amino acid chain.**
Windows of size B are placed along the chain with a step size of s resulting into W many windows. A trainable weight $g \in R^{W \times N}$ is assigned for a window in each subsystem which are made positive and normalized along the window axis through a softmax $\bar{g} = softmax (g, \dim = 0)$ . Here a window size of B = 4 and a step size of s = 2 is chosen. As a consequence the weight of the amino acid glutamine (Q) is given as the product of the two windows it is part of $g_{1} (Q) = {\bar{g}}_{i} ⊙ {\bar{g}}_{i + 1}$ , where the multiplication is executed element wise for each subsystem. The choice of the step size determines how many neighboring amino acids have the exact same weight within a subsystem, which applies here for the tyrosine (Y). Together with the window size it is regulated how many residues share parts of their weights. Hence, the serine (S) shares the weight ${\bar{g}}_{i + 1}$ with the previous two amino acids $g_{1} (S) = {\bar{g}}_{i + 1} {\bar{g}}_{i + 2}$ , which has a smoothing effect on the attention mechanism along the chain.

See this image and copyright information in PMC

Cited by

AMUSET-TICA: A Tensor-Based Approach for Identifying Slow Collective Variables in Biomolecular Dynamics.
Cao S, Nüske F, Liu B, Soley MB, Huang X. Cao S, et al. J Chem Theory Comput. 2025 May 13;21(9):4855-4866. doi: 10.1021/acs.jctc.5c00076. Epub 2025 Apr 20. J Chem Theory Comput. 2025. PMID: 40254940
Memory kernel minimization-based neural networks for discovering slow collective variables of biomolecular dynamics.
Liu B, Cao S, Boysen JG, Xue M, Huang X. Liu B, et al. Nat Comput Sci. 2025 Jul;5(7):562-571. doi: 10.1038/s43588-025-00815-8. Epub 2025 Jun 10. Nat Comput Sci. 2025. PMID: 40495006
Thermodynamic Interpolation: A Generative Approach to Molecular Thermodynamics and Kinetics.
Moqvist S, Chen W, Schreiner M, Nüske F, Olsson S. Moqvist S, et al. J Chem Theory Comput. 2025 Mar 11;21(5):2535-2545. doi: 10.1021/acs.jctc.4c01557. Epub 2025 Feb 23. J Chem Theory Comput. 2025. PMID: 39988824 Free PMC article.
Exploring and Learning the Universe of Protein Allostery Using Artificial Intelligence Augmented Biophysical and Computational Approaches.
Agajanian S, Alshahrani M, Bai F, Tao P, Verkhivker GM. Agajanian S, et al. J Chem Inf Model. 2023 Mar 13;63(5):1413-1428. doi: 10.1021/acs.jcim.2c01634. Epub 2023 Feb 24. J Chem Inf Model. 2023. PMID: 36827465 Free PMC article. Review.
Special Issue: "Advanced Research on Molecular Modeling of Protein Structure and Functions".
Khrenova MG. Khrenova MG. Int J Mol Sci. 2025 Aug 16;26(16):7916. doi: 10.3390/ijms26167916. Int J Mol Sci. 2025. PMID: 40869235 Free PMC article.

See all "Cited by" articles

References

1. Phillips JC, et al. Scalable molecular dynamics on cpu and gpu architectures with namd. J. Chem. Phys. 2020;153:044130. doi: 10.1063/5.0014475. - DOI - PMC - PubMed
1. Vant, J. W. et al. Protein Structure Prediction 301–315 (Springer, 2020).
1. Buch I, Harvey MJ, Giorgino T, Anderson DP, De Fabritiis G. High-throughput all-atom molecular dynamics simulations using distributed computing. J. Chem. Inform. Modeling. 2010;50:397–403. doi: 10.1021/ci900455r. - DOI - PubMed
1. Eastman P, et al. Openmm 7: Rapid development of high performance algorithms for molecular dynamics. PLoS Comput. Biol. 2017;13:e1005659. doi: 10.1371/journal.pcbi.1005659. - DOI - PMC - PubMed
1. Salomon-Ferrer R, Gotz AW, Poole D, Le Grand S, Walker RC. Routine microsecond molecular dynamics simulations with amber on gpus. 2. explicit solvent particle mesh Ewald. J. Chem. Theory Comput. 2013;9:3878–3888. doi: 10.1021/ct400314y. - DOI - PubMed

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions

Substances

Actions

LinkOut - more resources

Full Text Sources

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Deep learning to decompose macromolecules into independent Markovian domains

Affiliations

Deep learning to decompose macromolecules into independent Markovian domains

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

Substances

LinkOut - more resources

Full Text Sources

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

Substances

Related information

LinkOut - more resources

Full Text Sources