Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Jun 27;19(12):3509-3525.
doi: 10.1021/acs.jctc.3c00279. Epub 2023 Jun 8.

Lifelong Machine Learning Potentials

Affiliations

Lifelong Machine Learning Potentials

Marco Eckhoff et al. J Chem Theory Comput. .

Abstract

Machine learning potentials (MLPs) trained on accurate quantum chemical data can retain the high accuracy, while inflicting little computational demands. On the downside, they need to be trained for each individual system. In recent years, a vast number of MLPs have been trained from scratch because learning additional data typically requires retraining on all data to not forget previously acquired knowledge. Additionally, most common structural descriptors of MLPs cannot represent efficiently a large number of different chemical elements. In this work, we tackle these problems by introducing element-embracing atom-centered symmetry functions (eeACSFs), which combine structural properties and element information from the periodic table. These eeACSFs are key for our development of a lifelong machine learning potential (lMLP). Uncertainty quantification can be exploited to transgress a fixed, pretrained MLP to arrive at a continuously adapting lMLP, because a predefined level of accuracy can be ensured. To extend the applicability of an lMLP to new systems, we apply continual learning strategies to enable autonomous and on-the-fly training on a continuous stream of new data. For the training of deep neural networks, we propose the continual resilient (CoRe) optimizer and incremental learning strategies relying on rehearsal of data, regularization of parameters, and the architecture of the model.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing financial interest.

Figures

Figure 1
Figure 1
SN2 reaction at a methyl carbon atom with leaving group X and nucleophile Y.
Figure 2
Figure 2
Convergence of the optimizers Adam, RPROP, and CoRe for training reference data set B. The test set RMSE values of (a) energies Etest and (b) atomic force components Fα,ntest are shown as a function of the training epoch nepoch. RMSE values of individual HDNNPs are represented by dots, while their mean, which is unequal to the ensemble RMSE value, is shown by a solid line. Lifelong adaptive data selection was applied in the optimizations.
Figure 3
Figure 3
Training data set reduction of the optimizers Adam, RPROP, and CoRe for training reference data set B. The number of considered training conformations Ntrain is shown as a function of the training epoch nepoch. The values of Ntrain of individual HDNNPs are represented by dots, while their mean is shown by a solid line. The black dashed line represents the number of training conformations which was used for fitting in each epoch.
Figure 4
Figure 4
Final accuracy of lMLPs for which a fraction of plate training data of reference data set B was first available at a late training epoch. These data were either chosen randomly or a certain block was used. More detailed information about the procedure is provided in the main text. The test RMSE values of (a) energies Etest and (b) atomic force components Fα,ntest are shown as a function of the late data fraction plate. RMSE values of individual HDNNPs are represented by dots, their mean by a solid line, and their range by a lighter colored band. The CoRe optimizer (β1b = 0.7 for “Block” with plate > 0 and β1b = 0.725 otherwise) and lifelong adaptive data selection were applied for 1500 epochs.
Figure 5
Figure 5
Potential energy surface of the SN2 reaction Br + CH3Cl ⇌ BrCH3 + Cl. The lMLP ensemble prediction energy E is referenced to the minimum DFT reference energy Eminref of the given conformation space spanned by DFT optimized structures with constrained distances rCl–C and rBr–C. The color represents the error of E with respect to the DFT reference energy Eref. Black dots show the explicit evaluations of the lMLP. The colors between black dots are interpolated.
Figure 6
Figure 6
Absolute values of the errors with respect to the DFT reference and the uncertainty quantification for the ensemble prediction of (a) energies |ΔE| and (b) atomic force components |ΔFα,n| of the validation data. The order x sorts the validation conformations according to their uncertainty quantification.

References

    1. Cramer C. J.Essentials of Computational Chemistry: Theories and Models; Wiley, 2013.
    1. Jensen F.Introduction to Computational Chemistry; Wiley, 2017.
    1. Burke K. Perspective on density functional theory. J. Chem. Phys. 2012, 136, 150901.10.1063/1.4704546. - DOI - PubMed
    1. Riplinger C.; Pinski P.; Becker U.; Valeev E. F.; Neese F. Sparse maps–A systematic infrastructure for reduced-scaling electronic structure methods. II. Linear scaling domain based pair natural orbital coupled cluster theory. J. Chem. Phys. 2016, 144, 024109.10.1063/1.4939030. - DOI - PubMed
    1. Das S.; Motamarri P.; Gavini V.; Turcksin B.; Li Y. W.; Leback B.. Fast, scalable and accurate finite-element based ab initio calculations using mixed precision computing: 46 PFLOPS simulation of a metallic dislocation system. International Conference for High Performance Computing, Networking, Storage and Analysis (SC19). Denver, CO, USA, 2019.