Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Sep 26;62(18):4295-4299.
doi: 10.1021/acs.jcim.2c00840. Epub 2022 Sep 13.

On the Choice of Active Site Sequences for Kinase-Ligand Affinity Prediction

Affiliations

On the Choice of Active Site Sequences for Kinase-Ligand Affinity Prediction

Jannis Born et al. J Chem Inf Model. .

Abstract

Recent work showed that active site rather than full-protein-sequence information improves predictive performance in kinase-ligand binding affinity prediction. To refine the notion of an "active site", we here propose and compare multiple definitions. We report significant evidence that our novel definition is superior to previous definitions and better models of ATP-noncompetitive inhibitors. Moreover, we leverage the discontiguity of the active site sequence to motivate novel protein-sequence augmentation strategies and find that combining them further improves performance.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing financial interest.

Figures

Figure 1
Figure 1
Overview of active site site definitions and representations. A) Visualization of cAMP-dependent protein kinase catalytic subunit alpha (P17612). Residues unique to the active site definitions of refs (8) and (10) are colored in orange and green, respectively. Residues contained in both definitions are shown in red. B) Partial amino acid sequence (residues 48–62) of the same kinase. The upper gray panel displays the four kinase sequence representations examined in this work. The lower gray panel visualizes three kinase augmentation strategies, exemplified on the “combined” active site definition: flipping (i.e., reversing) the entire sequence, flipping contiguous subsequences, and swapping neighboring subsequences. Residues affected by the augmentation are encircled in black.
Figure 2
Figure 2
RMSE in affinity prediction for kinase split on validation and test data. 10-fold cross-validation results on kinase data from BindingDB. Performance of validation (A) and test data (B) is shown. Statistically significant differences between the three different active site configurations are marked with a star.

References

    1. Abbasi K.; Razzaghi P.; Poso A.; Ghanbari-Ara S.; Masoudi-Nejad A. Deep learning in drug target interaction prediction: current and future perspectives. Curr. Med. Chem. 2021, 28, 2100–2113. 10.2174/0929867327666200907141016. - DOI - PubMed
    1. Jones D.; Kim H.; Zhang X.; Zemla A.; Stevenson G.; Bennett W. D.; Kirshner D.; Wong S. E.; Lightstone F. C.; Allen J. E. Improved protein–ligand binding affinity prediction with structure-based deep fusion inference. J. Chem. Inf. Model. 2021, 61, 1583–1592. 10.1021/acs.jcim.0c01306. - DOI - PubMed
    1. Hassan-Harrirou H.; Zhang C.; Lemmin T. RosENet: improving binding affinity prediction by leveraging molecular mechanics energies with an ensemble of 3D convolutional neural networks. J. Chem. Inf. Model. 2020, 60, 2791–2802. 10.1021/acs.jcim.0c00075. - DOI - PubMed
    1. Li S.; Wan F.; Shu H.; Jiang T.; Zhao D.; Zeng J. MONN: a multi-objective neural network for predicting compound-protein interactions and affinities. Cell Syst 2020, 10, 308–322. 10.1016/j.cels.2020.03.002. - DOI
    1. Karimi M.; Wu D.; Wang Z.; Shen Y. Deepaffinity: interpretable deep learning of compound–protein affinity through unified recurrent and convolutional neural networks. Bioinformatics 2019, 35, 3329–3338. 10.1093/bioinformatics/btz111. - DOI - PMC - PubMed