Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 May 21;15(1):4345.
doi: 10.1038/s41467-024-48567-9.

Explainable chemical artificial intelligence from accurate machine learning of real-space chemical descriptors

Affiliations

Explainable chemical artificial intelligence from accurate machine learning of real-space chemical descriptors

Miguel Gallegos et al. Nat Commun. .

Abstract

Machine-learned computational chemistry has led to a paradoxical situation in which molecular properties can be accurately predicted, but they are difficult to interpret. Explainable AI (XAI) tools can be used to analyze complex models, but they are highly dependent on the AI technique and the origin of the reference data. Alternatively, interpretable real-space tools can be employed directly, but they are often expensive to compute. To address this dilemma between explainability and accuracy, we developed SchNet4AIM, a SchNet-based architecture capable of dealing with local one-body (atomic) and two-body (interatomic) descriptors. The performance of SchNet4AIM is tested by predicting a wide collection of real-space quantities ranging from atomic charges and delocalization indices to pairwise interaction energies. The accuracy and speed of SchNet4AIM breaks the bottleneck that has prevented the use of real-space chemical descriptors in complex systems. We show that the group delocalization indices, arising from our physically rigorous atomistic predictions, provide reliable indicators of supramolecular binding events, thus contributing to the development of Explainable Chemical Artificial Intelligence (XCAI) models.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing interests.

Figures

Fig. 1
Fig. 1. Architecture of SchNet4AIM (modified SchNetPack toolbox).
Schematics of the SchNetPack toolbox targeted for the prediction of one-body (1P) and two-body (2P) terms, showing the main representation (left) and prediction (right) blocks. The atomic environment (AE) tensor, created by the representation block, describes an atom or atomic pair in a molecule. This is parsed to the prediction block which outputs a molecular property (P) decomposed into a collection of locally defined terms (K). The labels Z and R denote atomic numbers and positions of an M atom system, while rij signifies the distance between atoms i and j. AW, EAW and EPAW refer to AIMwise, ElementalAIMwise, and ElementalPairAIMwise output modules, respectively, used to construct universal and particle-specific models.
Fig. 2
Fig. 2. Dispersion plots for TO and EinterOH.
Dispersion plots for the kinetic energy of the O atoms, TO, and O–H interaction energies, EinterOH, of the water cluster database. The training and testing data points are shown in blue and red, respectively. All values are reported in atomic units (a.u.), whereas the testing error metrics are given in kcal mol−1. The mean absolute error (MAE) and root mean squared error (RMSE) were employed as measures of accuracy. The labels U and PS are used to refer to universal (AIMwise) and particle-specific (ElementalAIMwise or ElementalPairAIMwise) models, respectively. Source data are provided as a Source Data file.
Fig. 3
Fig. 3. 1,3-dipolar cycloaddition.
Schematic representation of the addition of acetonitrile oxide to ethylene to yield a 5-membered heterocycle, showing the transition state (TS) structure involved in the transformation. The numbers and labels of the main atoms involved in the reaction are shown. The remaining, and with the exception of the C atom in the CH3 moiety of acetonitrile oxide, are H atoms.
Fig. 4
Fig. 4. SchNet4AIM predictions in extrapolation regimes: electronics of a 1,3-dipolar cycloaddition.
Evolution of the atomic charges of the main atoms involved in the reaction (a), the total molecular charge of each of the reactants (b), and the electron delocalization between the terminals atoms (1,3 and 4,7) of both species (c). Atomic or molecular charges are denoted as Q, whereas the term δ is used to refer to the delocalized electron counts. Solid and dashed lines are used to represent the predicted (SchNet4AIM) and observed data, respectively. The SchNet4AIM models are tested in never-seen (extrapolation) regions of the chemical space. All values are reported in electrons (e) relative to the progress of the reaction coordinate (χ). The atomic labels correspond to the numbering shown in Fig. 3. d The transition state of the reaction, showing the emergent C–C and C–O bonds that will be formed, is also shown. Source data are provided as a Source Data file.
Fig. 5
Fig. 5. 13P-CO2 complexation and guest release.
a 13P-CO2 system at 500 (i), 950 (ii), 1318 (iii), and 1600 (iv) fs throughout the 300  K simulation, corresponding to some of the local maxima in the electron delocalization (δ) between CO2 and NH2, δ(CO2,NH2). b Correlation maps between the electronic and geometrical descriptors (i.e., the distance between the centers of mass, ∣R∣) for the CO2-Ph4 (j), CO2-NH21 (jj), and NH23-NH24 (jjj) contacts throughout the 300 K simulation. Blue dots indicate those binding events exclusively predicted by SchNet4AIM, whereas magenta ones show contacts simultaneously (±10 fs) estimated by the geometrical and electronic metrics. Each tick corresponds to 1000 fs. c 13P-CO2 system at 0, 100, 150, and 250 fs throughout the 900 K simulation, along with the SchNet4AIM computed electron delocalization between neighboring NH2 moieties, δ(NH2–NH2). For the electronic metrics, reported in electrons (e), the raw and bin-averaged data are shown. Time is reported in femtoseconds (fs). Source data are provided as a Source Data file.
Fig. 6
Fig. 6. Chemical interactions between CO2 and 13P after the cage cleavage.
Evolution of the SchNet4AIM predicted electron delocalization (δ) between CO2 and the phenyl (Ph), oxo (O), and amino (NH2) moieties of 13P throughout the 13P-CO2 900 K simulation and after the cage rupture event. Time is reported in femtoseconds (fs). The labels R1, R2, R3, and R4 denote the position of the NH2, O, and Ph moieties in 13P using the same numbering and color code as that shown in Fig. 5A. Source data are provided as a Source Data file.

Similar articles

Cited by

References

    1. Jung, J. et al. New parallel computing algorithm of molecular dynamics for extremely huge scale biological systems. J. Comput. Chem.42, 231–241 (2021). - PMC - PubMed
    1. Singh, V., Patra, S., Murugan, N. A., Toncu, D.-C. & Tiwari, A. Recent trends in computational tools and data-driven modeling for advanced materials. Mater. Adv.3, 4069–4087 (2022).
    1. Vogiatzis, K. D. et al. Computational approach to molecular catalysis by 3d transition metals: challenges and opportunities. Chem. Rev.119, 2453–2523 (2019). - PMC - PubMed
    1. Cembran, A., Bernardi, F., Olivucci, M. & Garavelli, M. Counterion controlled photoisomerization of retinal chromophore models: a computational investigation. J. Am. Chem. Soc.126, 16018–16037 (2004). - PubMed
    1. Vermeeren, P. et al. Pericyclic reaction benchmarks: hierarchical computations targeting ccsdt(q)/cbs and analysis of dft performance. Phys. Chem. Chem. Phys.24, 18028–18042 (2022). - PMC - PubMed

LinkOut - more resources