Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2019 Jul 1;10(1):2903.
doi: 10.1038/s41467-019-10827-4.

Approaching coupled cluster accuracy with a general-purpose neural network potential through transfer learning

Affiliations

Approaching coupled cluster accuracy with a general-purpose neural network potential through transfer learning

Justin S Smith et al. Nat Commun. .

Abstract

Computational modeling of chemical and biological systems at atomic resolution is a crucial tool in the chemist's toolset. The use of computer simulations requires a balance between cost and accuracy: quantum-mechanical methods provide high accuracy but are computationally expensive and scale poorly to large systems, while classical force fields are cheap and scalable, but lack transferability to new systems. Machine learning can be used to achieve the best of both approaches. Here we train a general-purpose neural network potential (ANI-1ccx) that approaches CCSD(T)/CBS accuracy on benchmarks for reaction thermochemistry, isomerization, and drug-like molecular torsions. This is achieved by training a network to DFT data then using transfer learning techniques to retrain on a dataset of gold standard QM calculations (CCSD(T)/CBS) that optimally spans chemical space. The resulting potential is broadly applicable to materials science, biology, and chemistry, and billions of times faster than CCSD(T)/CBS calculations.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing interests.

Figures

Fig. 1
Fig. 1
Accuracy in predicting atomization energies. Error of the ANI-1ccx predicted atomization energy Ea on the GDB-10to13 benchmark relative to CCSD(T)*/CBS and compared against ωB97X
Fig. 2
Fig. 2
Accuracy in predicting reaction and isomerization energy. ANI-1ccx reaction and isomerization energy difference prediction on the a HC7/11 and b ISOL6 benchmarks, relative to CCSD(T)/CBS. Methods compared are the ANI-1ccx transfer learning potential, ANI-1x trained only on DFT data, the DFT reference (ωB97X), and our coupled-cluster extrapolation scheme CCSD(T)*/CBS. The top panel provides the HC7 reactions numbered 1, 2, 6, and 7 and bottom panel shows the ISOL6 reactions numbered 1–5
Fig. 3
Fig. 3
Accuracy in predicting torsional energies relevant to drug discovery. Methods compared are QM (red and green), molecular mechanics (blue), and ANI (orange) performance on 45 torsion profiles containing C, H, N, and O atomic elements. The gray dots represent the MAD of a given torsion scan vs. gold standard CCSD(T)/CBS. The box extends from the upper to lower quartile and the black horizontal line in the box is the median. The upper “whisker” extends to the last datum less than the third quartile plus 1.5 times the interquartile range while the lower “whisker” extends to the first datum greater than the first quartile minus 1.5 times the interquartile range
Fig. 4
Fig. 4
Diagram of the transfer learning technique evaluated in this work. Transfer learning starts from a pretrained ANI-1x DFT model, then retrains to higher accuracy CCSD(T)*/CBS data with some parameters fixed during training

References

    1. Ramsay OB. Serendipity: accidental discoveries in science (Roberts, R. M.) J. Chem. Educ. 2009;67:A311. doi: 10.1021/ed067pA311.1. - DOI
    1. Berson JA. Discoveries missed, discoveries made: creativity, influence, and fame in chemistry. Tetrahedron. 1992;48:3–17. doi: 10.1016/S0040-4020(01)80574-3. - DOI
    1. Pople John A. Quantum Chemical Models (Nobel Lecture) Angewandte Chemie International Edition. 1999;38(13-14):1894–1902. doi: 10.1002/(SICI)1521-3773(19990712)38:13/14<1894::AID-ANIE1894>3.0.CO;2-H. - DOI - PubMed
    1. Kohn W. Nobel Lecture: electronic structure of matter—wave functions and density functionals. Rev. Mod. Phys. 1999;71:1253–1266. doi: 10.1103/RevModPhys.71.1253. - DOI
    1. Purvis GD, Bartlett RJ. A full coupled-cluster singles and doubles model: the inclusion of disconnected triples. J. Chem. Phys. 1982;76:1910–1918. doi: 10.1063/1.443164. - DOI

Publication types