Machine-Learning-Assisted Free Energy Simulation of Solution-Phase and Enzyme Reactions

Xiaoliang Pan¹, Junjie Yang¹, Richard Van¹, Evgeny Epifanovsky², Junming Ho³, Jing Huang⁴, Jingzhi Pu⁵, Ye Mei^{6

7

8}, Kwangho Nam⁹, Yihan Shao¹

Affiliations

¹ Department of Chemistry and Biochemistry, University of Oklahoma, 101 Stephenson Parkway, Norman, Oklahoma 73019, United States.
² Q-Chem, Inc., 6601 Owens Drive, Suite 105, Pleasanton, California 94588, United States.
³ School of Chemistry, University of New South Wales, Sydney, NSW 2052, Australia.
⁴ Key Laboratory of Structural Biology of Zhejiang Province, School of Life Sciences, Westlake University, 18 Shilongshan Road, Hangzhou, Zhejiang 310024, China.
⁵ Department of Chemistry and Chemical Biology, Indiana University-Purdue University Indianapolis, 402 North Blackford Street, LD326, Indianapolis, Indiana 46202, United States.
⁶ State Key Laboratory of Precision Spectroscopy, School of Physics and Electronic Science, East China Normal University, Shanghai 200062, China.
⁷ NYU-ECNU Center for Computational Chemistry at NYU Shanghai, Shanghai 200062, China.
⁸ Collaborative Innovation Center of Extreme Optics, Shanxi University, Taiyuan, Shanxi 030006, China.
⁹ Department of Chemistry and Biochemistry, University of Texas at Arlington, Arlington, Texas 76019, United States.

PMID: 34468138
PMCID: PMC9070000
DOI: 10.1021/acs.jctc.1c00565

Machine-Learning-Assisted Free Energy Simulation of Solution-Phase and Enzyme Reactions

Xiaoliang Pan et al. J Chem Theory Comput. 2021.

. 2021 Sep 14;17(9):5745-5758.

doi: 10.1021/acs.jctc.1c00565. Epub 2021 Sep 1.

Authors

Xiaoliang Pan¹, Junjie Yang¹, Richard Van¹, Evgeny Epifanovsky², Junming Ho³, Jing Huang⁴, Jingzhi Pu⁵, Ye Mei^{6

7

8}, Kwangho Nam⁹, Yihan Shao¹

Affiliations

¹ Department of Chemistry and Biochemistry, University of Oklahoma, 101 Stephenson Parkway, Norman, Oklahoma 73019, United States.
² Q-Chem, Inc., 6601 Owens Drive, Suite 105, Pleasanton, California 94588, United States.
³ School of Chemistry, University of New South Wales, Sydney, NSW 2052, Australia.
⁴ Key Laboratory of Structural Biology of Zhejiang Province, School of Life Sciences, Westlake University, 18 Shilongshan Road, Hangzhou, Zhejiang 310024, China.
⁵ Department of Chemistry and Chemical Biology, Indiana University-Purdue University Indianapolis, 402 North Blackford Street, LD326, Indianapolis, Indiana 46202, United States.
⁶ State Key Laboratory of Precision Spectroscopy, School of Physics and Electronic Science, East China Normal University, Shanghai 200062, China.
⁷ NYU-ECNU Center for Computational Chemistry at NYU Shanghai, Shanghai 200062, China.
⁸ Collaborative Innovation Center of Extreme Optics, Shanxi University, Taiyuan, Shanxi 030006, China.
⁹ Department of Chemistry and Biochemistry, University of Texas at Arlington, Arlington, Texas 76019, United States.

PMID: 34468138
PMCID: PMC9070000
DOI: 10.1021/acs.jctc.1c00565

Abstract

Despite recent advances in the development of machine learning potentials (MLPs) for biomolecular simulations, there has been limited effort on developing stable and accurate MLPs for enzymatic reactions. Here we report a protocol for performing machine-learning-assisted free energy simulation of solution-phase and enzyme reactions at the ab initio quantum-mechanical/molecular-mechanical (ai-QM/MM) level of accuracy. Within our protocol, the MLP is built to reproduce the ai-QM/MM energy and forces on both QM (reactive) and MM (solvent/enzyme) atoms. As an alternative strategy, a delta machine learning potential (ΔMLP) is trained to reproduce the differences between the ai-QM/MM and semiempirical (se) QM/MM energies and forces. To account for the effect of the condensed-phase environment in both MLP and ΔMLP, the DeePMD representation of a molecular system is extended to incorporate the external electrostatic potential and field on each QM atom. Using the Menshutkin and chorismate mutase reactions as examples, we show that the developed MLP and ΔMLP reproduce the ai-QM/MM energy and forces with errors that on average are less than 1.0 kcal/mol and 1.0 kcal mol^-1 Å^-1, respectively, for representative configurations along the reaction pathway. For both reactions, MLP/ΔMLP-based simulations yielded free energy profiles that differed by less than 1.0 kcal/mol from the reference ai-QM/MM results at only a fraction of the computational cost.

PubMed Disclaimer

Figures

**Figure 1:**
Workflow for the training of MLP and its use to generate energy and forces for MD simulations. Q_i is the ESP charge on QM atom i, which is fitted using the electrostatic potential ϕ_B on inner MM atom positions.

**Figure 2:**
Schemes for (a) Menshutkin and (b) chorismate mutase reactions.

**Figure 3:**
Conservation of the total energy in 100ps NVE simulations of the chorismate mutase reaction using A) PM3*, B) MLP, and C) PM3*+ΔMLP models. In each figure, the line shown in orange indicates the drift of energy (see the values mentioned in the main text for each model).

**Figure 4:**
Accuracy of MLP (top), PM3*+ΔMLP (middle), PM3 and PM3* (bottom) energy, forces, electrostatic potential (ϕ) and electric field ( $E$ ) for the 2,000 testing configurations for aqueous Menshutkin reaction. In each figure, the reference values are obtained from the B3LYP/6–31G*/MM calculations. The root-mean-square error (RMSE) value is also shown for each method.

**Figure 5:**
Distribution of high-level (B3LYP/6–31G*) and low-level (PM3*, MLP, and PM3*+ΔMLP energy differences for configuration collected from B3LYP/MM MD trajectories (blue) or low-level MD trajectories (orange) of the aqueous Menshutkin reaction.

**Figure 6:**
(A) Sampled pathway and (B) potential of mean force for the aqueous Menshutkin reaction based on umbrella sampling using PM3*+ΔMLP and MLP potentials in comparison to PM3* and B3LYP/6–31G* results. The pathways are represented by the average bond lengths for each of the 80 windows in the umbrella sampling simulations (shown in Figure S2). The stars show the locations of the transition states on the pathways.

**Figure 7:**
Accuracy of MLP (top), PM3*+ΔMLP (middle), PM3 and PM3* (bottom) energy and forces for the 2,000 testing configurations for the chorismate mutase reaction.

**Figure 8:**
Distribution of high-level (B3LYP/6–31G*) and low-level (PM3*, MLP, and PM3*+ΔMLP energy differences for configuration collected from B3LYP/MM MD trajectories (blue) or low-level MD trajectories (orange) of chorismate mutase reaction.

**Figure 9:**
(A) Sampled pathway and (B) potential of mean force for the chorismate mutase reaction based on umbrella sampling using PM3*, PM3*+ΔMLP and MLP (the 2nd iteration) potentials in comparison to B3LYP/6–31G* results. The pathways are represented by the average bond lengths for each of the 80 windows in the umbrella sampling simulations (in Figure S3). The stars show the locations of the transition states on the pathways.

See this image and copyright information in PMC

References

1. Gao J; Ma S; Major DT; Nam K; Pu J; Truhlar DG Mechanisms and Free Energies of Enzymatic Reactions. Chem. Rev 2006, 106, 3188–3209. - PMC - PubMed
1. Warshel A Computer Simulations of Enzyme Catalysis: Methods, Progress, and Insights. Ann. Rev. Biophys. Biomol. Struct 2003, 32, 425–443. - PubMed
1. Klähn M; Braun-Sand S; Rosta E; Warshel A On Possible Pitfalls in ab Initio Quantum Mechanics/Molecular Mechanics Minimization Approaches for Studies of Enzymatic Reactions. J. Phys. Chem. B 2005, 109, 15645–15650. - PMC - PubMed
1. Lin H; Truhlar DG QM/MM: What Have We Learned, Where Are We, and Where Do We Go From Here? Theor. Chem. Acc 2007, 117, 185–199.
1. Hu H; Yang W Free Energies of Chemical Reactions in Solution and in Enzymes with Ab Initio Quantum Mechanics/Molecular Mechanics Methods. Ann. Rev. Phys. Chem 2008, 59, 573–601. - PMC - PubMed

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Machine-Learning-Assisted Free Energy Simulation of Solution-Phase and Enzyme Reactions

Affiliations

Machine-Learning-Assisted Free Energy Simulation of Solution-Phase and Enzyme Reactions

Authors

Affiliations

Abstract

Figures

References

MeSH terms

Grants and funding

LinkOut - more resources

Full Text Sources

Research Materials