Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Jul 27;2(4):316-330.
doi: 10.1021/acsphyschemau.2c00005. Epub 2022 May 18.

Mechanistic Insights into Enzyme Catalysis from Explaining Machine-Learned Quantum Mechanical and Molecular Mechanical Minimum Energy Pathways

Affiliations

Mechanistic Insights into Enzyme Catalysis from Explaining Machine-Learned Quantum Mechanical and Molecular Mechanical Minimum Energy Pathways

Zilin Song et al. ACS Phys Chem Au. .

Abstract

With the increasing popularity of machine learning (ML) applications, the demand for explainable artificial intelligence techniques to explain ML models developed for computational chemistry has also emerged. In this study, we present the development of the Boltzmann-weighted cumulative integrated gradients (BCIG) approach for effective explanation of mechanistic insights into ML models trained on high-level quantum mechanical and molecular mechanical (QM/MM) minimum energy pathways. Using the acylation reactions of the Toho-1 β-lactamase and two antibiotics (ampicillin and cefalexin) as the model systems, we show that the BCIG approach could quantitatively attribute the energetic contribution in one system and the relative reactivity of individual steps across different systems to specific chemical processes such as the bond making/breaking and proton transfers. The proposed BCIG contribution attribution method quantifies chemistry-interpretable insights in terms of contributions from each elementary chemical process, which is in agreement with the validating QM/MM calculations and our intuitive mechanistic understandings of the model reactions.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing financial interest.

Figures

Figure 1
Figure 1
Toho-1 β-lactamase, the β-lactam antibiotics, and the acylation pathways of Toho-1. (a) Toho-1/β-lactam complex and the selection of QM atoms. The carbon atoms of the amino acid residues are colored in dark gray. The Cα–Cβ bonds (used as the QM/MM boundary for the amino acid side chains) are colored in cyan. The carbon atoms of the β-lactam ligand are colored in magenta, except the carbonyl carbon in β-lactams, which is colored in dark purple. The catalytic water molecule is uniformly colored in green. All nitrogen, oxygen, sulfur, and hydrogen atoms are colored in blue, red, yellow, and white, respectively. All hydrogen atoms in the ligand are omitted for clarity. (b) Chemical structures of AMP and CEX. The penam and cephem scaffolds are colored in red; (c) Acylation pathways using Glu166 as the general base (R1-AE) and using Glu166/Lys73 concertedly as the general base (R2-AE).
Figure 2
Figure 2
Selected features, the 2D representation of the pathway conformations, and the architecture of the ML-MEP models. The selected features and chemical processes of (a) Toho/AMP: R1-AE and Toho/CEX: R1-AE datasets and (b) Toho/AMP: R2-AE and Toho/CEX: R2-AE datasets. The atomic distances that are included in feature vectors are noted in orange lines and the chemical processes are noted in blue; see also Table S1. (c) 2D principal component dimensionality reduction of the pairwise inter-heavy-atom distances in the QM region, and a schematic demonstration for the loss of pathway context of the replicas; (d) architecture of the QM/MM MEP learning deep-and-wide neural network.
Figure 3
Figure 3
The distribution of the acylation barriers (ΔE) at B3LYP-D3/6-31+G**/C36 level of theory. (a) Toho/AMP: R1-AE acylation pathways; (b) Toho/CEX: R1-AE acylation pathways; (c) Toho/AMP: R2-AE acylation pathways; and (d) Toho/CEX: R2-AE acylation pathways. The scatters show the locations of the energy barriers. The width of the histograms is 4 kcal mol–1. The red curves note the density estimation from the GMMs. The labels “min. ΔE”, “expo. formula image”, “mean formula image”, “med. ΔE”, and “std” refer to the minimum, exponential average, arithmetic average, median, and standard deviation of the energy barriers from the QM/MM MEP profiles, respectively.
Figure 4
Figure 4
Predictive performance and the BCIG contributions of the ML-MEP models. The predictive performance of (a) replica energies and (b) pathway barriers of (left to right) the Toho/AMP: R1-AE, Toho/CEX: R1-AE, Toho/AMP: R2-AE, and Toho/CEX: R2-AE models. The BCIG contributions of the models: (c) Toho/AMP: R1-AE; (d) Toho/CEX: R1-AE; (e) Toho/AMP: R2-AE; and (f) Toho/CEX: R2-AE. The rankings (highest to lowest) of the BCIG contributions are noted below.
Figure 5
Figure 5
Predictive performance and the BCIG contributions of the unified ML-MEP models. The predictive performance of (left to right) the replica energies and the pathway barriers of (a) Toho/AMP&CEX: R1-AE and (b) Toho/AMP&CEX: R2-AE models. The BCIG contributions of (c) Toho/AMP&CEX: R1-AE and (d) Toho/AMP&CEX: R2-AE models.
Figure 6
Figure 6
Atomic distances between the ChElPG charges on critical heavy atoms in all reactant states at the B3LYP-D3/6-31+G**/C36 level. The (a) atomic distances and (b) ChElPG charge profiles of the Toho/AMP: R1-AE and Toho/CEX: R1-AE pathways. The (c) atomic distances and (d) ChElPG charge profiles of the Toho/AMP: R2-AE and Toho/CEX: R2-AE pathways. The meanings of the labels on x-axes are defined in the symbolic legend (right panel).

Similar articles

Cited by

References

    1. Butler K. T.; Davies D. W.; Cartwright H.; Isayev O.; Walsh A. Machine Learning for Molecular and Materials Science. Nature 2018, 559, 547–555. 10.1038/s41586-018-0337-2. - DOI - PubMed
    1. Vamathevan J.; Clark D.; Czodrowski P.; Dunham I.; Ferran E.; Lee G.; Li B.; Madabhushi A.; Shah P.; Spitzer M.; Zhao S. Applications of Machine Learning in Drug Discovery and Development. Nat. Rev. Drug Discovery 2019, 18, 463–477. 10.1038/s41573-019-0024-5. - DOI - PMC - PubMed
    1. Behler J. Four Generations of High-Dimensional Neural Network Potentials. Chem. Rev. 2021, 121, 10037–10072. 10.1021/acs.chemrev.0c00868. - DOI - PubMed
    1. Keith J. A.; Vassilev-Galindo V.; Cheng B.; Chmiela S.; Gastegger M.; Müller K.-R.; Tkatchenko A. Combining Machine Learning and Computational Chemistry for Predictive Insights into Chemical Systems. Chem. Rev. 2021, 121, 9816–9872. 10.1021/acs.chemrev.1c00107. - DOI - PMC - PubMed
    1. Li Z.; Kermode J. R.; De Vita A. Molecular Dynamics with On-the-Fly Machine Learning of Quantum-Mechanical Forces. Phys. Rev. Lett. 2015, 114, 096405.10.1103/PhysRevLett.114.096405. - DOI - PubMed

LinkOut - more resources