Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Jul 24;15(33):13313-13324.
doi: 10.1039/d4sc01029a. eCollection 2024 Aug 22.

A physics-aware neural network for protein-ligand interactions with quantum chemical accuracy

Affiliations

A physics-aware neural network for protein-ligand interactions with quantum chemical accuracy

Zachary L Glick et al. Chem Sci. .

Abstract

Quantifying intermolecular interactions with quantum chemistry (QC) is useful for many chemical problems, including understanding the nature of protein-ligand interactions. Unfortunately, QC computations on protein-ligand systems are too computationally expensive for most use cases. The flourishing field of machine-learned (ML) potentials is a promising solution, but it is limited by an inability to easily capture long range, non-local interactions. In this work we develop an atomic-pairwise neural network (AP-Net) specialized for modeling intermolecular interactions. This model benefits from a number of physical constraints, including a two-component equivariant message passing neural network architecture that predicts interaction energies via an intermediate prediction of monomer electron densities. The AP-Net model is trained on a comprehensive dataset composed of paired ligand and protein fragments. This model accurately predicts QC-quality interaction energies of protein-ligand systems at a computational cost reduced by orders of magnitude. Applications of the AP-Net model to molecular crystal structure prediction are explored, as well as limitations in modeling highly polarizable systems.

PubMed Disclaimer

Conflict of interest statement

There are no conflicts to declare.

Figures

Fig. 1
Fig. 1. An overview of the AP-Net architecture where (A) is the atomic property module and (B) is the interaction energy module. AP-Net predicts the four physically meaningful components of a protein–ligand interaction: electrostatics (Eelst), exchange (Eexch), induction/polarization (Eind), and London dispersion (Edisp).
Fig. 2
Fig. 2. (A) Depiction of the Splinter dimer dataset. This dataset was constructed by exhaustively pairing small protein and ligand fragments. Between 50 and 500 dimer configurations were generated for each pair of fragments. (B) Distribution of interaction energies (left) and AP-Net errors with respect to interaction energies (right) over 150 000 validation dimers of the Splinter dataset, in kcal mol−1. The respective mean absolute interaction energies and mean absolute errors of the two sets of distributions are labeled.
Fig. 3
Fig. 3. An example dimer from the SAPT-PDB-13K dataset. A small molecule inhibitor interacts with the nearest amino acid, a tyrosine, of an Escherichia coli sliding clamp protein. This dimer was extracted from PDB entry 4PNU.
Fig. 4
Fig. 4. Correlation between AP-Net predicted interaction energies and computed SAPT0/aDZ interaction energies on the 13 216 dimers in the SAPT-PDB-13K dataset.
Fig. 5
Fig. 5. Example of an alchemical ΔΔEint experiment. The chlorine group of the P1 substructure of the Factor Xa inhibitor, BAY 59-7939, is mutated to a methyl. The structure is extracted from PDB entry 2W26.
Fig. 6
Fig. 6. (Top Right) Visualization of a two-dimensional interaction energy scan (kcal mol−1) for the NMA dimer, varying the hydrogen bond distance (r) and angle (θ). (Top Left) The NMA dimer interaction energy surface computed at the cheaper SAPT0/aDZ level of theory. (Bottom Left) The NMA dimer interaction energy surface computed at the expensive CCSD(T)/CBS level of theory. (Top Center) The NMA dimer interaction energy surface predicted by an AP-Net model trained on ∼1.7 million SAPT0/aDZ data points. (Bottom Center) The NMA dimer interaction energy surface predicted by an AP-Net model trained on one hundred CCSD(T)/CBS data points. (Bottom Right) The NMA dimer interaction energy surface predicted by an AP-Net model trained on ∼1.7 million SAPT0/aDZ data points and then re-trained with one hundred CCSD(T)/CBS data points. All energies in kcal mol−1.
Fig. 7
Fig. 7. (Left) The experimentally observed form II of the 5-fluorouracil crystal. (Center) Comparison between computed and predicted relative crystal lattice energies. (Right) Relative crystal lattice energies as a function of density.

References

    1. Strekowski L. Wilson B. Mutat. Res. 2007;623:3–13. doi: 10.1016/j.mrfmmm.2007.03.008. - DOI - PubMed
    1. Bodley A. Liu L. F. Israel M. Seshadri R. Koseki Y. Giuliani F. C. Kirschenbaum S. Silber R. Potmesil M. Cancer Res. 1989;49:5969–5978. - PubMed
    1. Davis H. J. Phipps R. J. Chem. Sci. 2017;8:864–877. doi: 10.1039/C6SC04157D. - DOI - PMC - PubMed
    1. Huang B. Muy S. Feng S. Katayama Y. Lu Y.-C. Chen G. Shao-Horn Y. Phys. Chem. Chem. Phys. 2018;20:15680–15686. doi: 10.1039/C8CP02512F. - DOI - PubMed
    1. Rybtchinski B. ACS Nano. 2011;5:6791–6818. doi: 10.1021/nn2025397. - DOI - PubMed

LinkOut - more resources