Lifelong Machine Learning Potentials

Marco Eckhoff¹, Markus Reiher¹

Affiliations

PMID: 37288932
PMCID: PMC10308836
DOI: 10.1021/acs.jctc.3c00279

Lifelong Machine Learning Potentials

Marco Eckhoff et al. J Chem Theory Comput. 2023.

. 2023 Jun 27;19(12):3509-3525.

doi: 10.1021/acs.jctc.3c00279. Epub 2023 Jun 8.

Authors

Marco Eckhoff¹, Markus Reiher¹

Affiliation

¹ ETH Zürich, Departement Chemie und Angewandte Biowissenschaften, 8093 Zürich, Switzerland.

PMID: 37288932
PMCID: PMC10308836
DOI: 10.1021/acs.jctc.3c00279

Abstract

Machine learning potentials (MLPs) trained on accurate quantum chemical data can retain the high accuracy, while inflicting little computational demands. On the downside, they need to be trained for each individual system. In recent years, a vast number of MLPs have been trained from scratch because learning additional data typically requires retraining on all data to not forget previously acquired knowledge. Additionally, most common structural descriptors of MLPs cannot represent efficiently a large number of different chemical elements. In this work, we tackle these problems by introducing element-embracing atom-centered symmetry functions (eeACSFs), which combine structural properties and element information from the periodic table. These eeACSFs are key for our development of a lifelong machine learning potential (lMLP). Uncertainty quantification can be exploited to transgress a fixed, pretrained MLP to arrive at a continuously adapting lMLP, because a predefined level of accuracy can be ensured. To extend the applicability of an lMLP to new systems, we apply continual learning strategies to enable autonomous and on-the-fly training on a continuous stream of new data. For the training of deep neural networks, we propose the continual resilient (CoRe) optimizer and incremental learning strategies relying on rehearsal of data, regularization of parameters, and the architecture of the model.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing financial interest.

Figures

**Figure 1**
S_N2 reaction at a methyl carbon atom with leaving group X and nucleophile Y.

**Figure 2**
Convergence of the optimizers Adam, RPROP, and CoRe for training reference data set B. The test set RMSE values of (a) energies E^test and (b) atomic force components F_α,n^test are shown as a function of the training epoch n_epoch. RMSE values of individual HDNNPs are represented by dots, while their mean, which is unequal to the ensemble RMSE value, is shown by a solid line. Lifelong adaptive data selection was applied in the optimizations.

**Figure 3**
Training data set reduction of the optimizers Adam, RPROP, and CoRe for training reference data set B. The number of considered training conformations N^train is shown as a function of the training epoch n_epoch. The values of N^train of individual HDNNPs are represented by dots, while their mean is shown by a solid line. The black dashed line represents the number of training conformations which was used for fitting in each epoch.

**Figure 4**
Final accuracy of lMLPs for which a fraction of p_late training data of reference data set B was first available at a late training epoch. These data were either chosen randomly or a certain block was used. More detailed information about the procedure is provided in the main text. The test RMSE values of (a) energies E^test and (b) atomic force components F_α,n^test are shown as a function of the late data fraction p_late. RMSE values of individual HDNNPs are represented by dots, their mean by a solid line, and their range by a lighter colored band. The CoRe optimizer (β₁^b = 0.7 for “Block” with p_late > 0 and β₁^b = 0.725 otherwise) and lifelong adaptive data selection were applied for 1500 epochs.

**Figure 5**
Potential energy surface of the S_N2 reaction Br^– + CH₃Cl ⇌ BrCH₃ + Cl^–. The lMLP ensemble prediction energy E is referenced to the minimum DFT reference energy E_min^ref of the given conformation space spanned by DFT optimized structures with constrained distances r_Cl–C and r_Br–C. The color represents the error of E with respect to the DFT reference energy E^ref. Black dots show the explicit evaluations of the lMLP. The colors between black dots are interpolated.

**Figure 6**
Absolute values of the errors with respect to the DFT reference and the uncertainty quantification for the ensemble prediction of (a) energies |ΔE| and (b) atomic force components |ΔF_α,n| of the validation data. The order x sorts the validation conformations according to their uncertainty quantification.

See this image and copyright information in PMC

References

1. Cramer C. J.Essentials of Computational Chemistry: Theories and Models; Wiley, 2013.
1. Jensen F.Introduction to Computational Chemistry; Wiley, 2017.
1. Burke K. Perspective on density functional theory. J. Chem. Phys. 2012, 136, 150901. 10.1063/1.4704546. - DOI - PubMed
1. Riplinger C.; Pinski P.; Becker U.; Valeev E. F.; Neese F. Sparse maps–A systematic infrastructure for reduced-scaling electronic structure methods. II. Linear scaling domain based pair natural orbital coupled cluster theory. J. Chem. Phys. 2016, 144, 024109. 10.1063/1.4939030. - DOI - PubMed
1. Das S.; Motamarri P.; Gavini V.; Turcksin B.; Li Y. W.; Leback B.. Fast, scalable and accurate finite-element based ab initio calculations using mixed precision computing: 46 PFLOPS simulation of a metallic dislocation system. International Conference for High Performance Computing, Networking, Storage and Analysis (SC19). Denver, CO, USA, 2019.

LinkOut - more resources

Full Text Sources

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Lifelong Machine Learning Potentials

Affiliation

Lifelong Machine Learning Potentials

Authors

Affiliation

Abstract

Conflict of interest statement

Figures

References

LinkOut - more resources

Full Text Sources