Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Jul 3;14(1):15315.
doi: 10.1038/s41598-024-66173-z.

Deriving general structure-activity/selectivity relationship patterns for different subfamilies of cyclin-dependent kinase inhibitors using machine learning methods

Affiliations

Deriving general structure-activity/selectivity relationship patterns for different subfamilies of cyclin-dependent kinase inhibitors using machine learning methods

Sara Kaveh et al. Sci Rep. .

Abstract

Cyclin-dependent kinases (CDKs) play essential roles in regulating the cell cycle and are among the most critical targets for cancer therapy and drug discovery. The primary objective of this research is to derive general structure-activity relationship (SAR) patterns for modeling the selectivity and activity levels of CDK inhibitors using machine learning methods. To accomplish this, 8592 small molecules with different binding affinities to CDK1, CDK2, CDK4, CDK5, and CDK9 were collected from Binding DB, and a diverse set of descriptors was calculated for each molecule. The supervised Kohonen networks (SKN) and counter propagation artificial neural networks (CPANN) models were trained to predict the activity levels and therapeutic targets of the molecules. The validity of models was confirmed through tenfold cross-validation and external test sets. Using selected sets of molecular descriptors (e.g. hydrophilicity and total polar surface area) we derived activity and selectivity maps to elucidate local regions in chemical space for active and selective CDK inhibitors. The SKN models exhibited prediction accuracies ranging from 0.75 to 0.94 for the external test sets. The developed multivariate classifiers were used for ligand-based virtual screening of 2 million random molecules of the PubChem database, yielding areas under the receiver operating characteristic curves ranging from 0.72 to 1.00 for the SKN model. Considering the persistent challenge of achieving CDK selectivity, this research significantly contributes to addressing the issue and underscores the paramount importance of developing drugs with minimized side effects.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing interests.

Figures

Figure 1
Figure 1
The Venn diagram of the VIP-selected descriptors used for developing active/inactive classifiers for five groups of CDK molecules: CDK1: blue, CDK2: crimson, CDK4: green, CDK5: yellow, CDK9: brown.
Figure 2
Figure 2
The density plot, box plot, and beeswarm plot (ac) for Hy molecular descriptor for the active and inactive groups of CDK1, CDK2, and CDK5 molecules (df) for nRCONHR molecular descriptor for the active and inactive groups of CDK1, CDK2, and CDK5 molecules (gi) for C-043 molecular descriptor for the active and inactive groups of CDK1, CDK4, and CDK9 molecules.
Figure 3
Figure 3
The density plot, box plot, and beeswarm plot (ac) for nBnz molecular descriptor for the active and inactive groups of CDK4, and CDK5 molecules (df) for TPSA(Tot) molecular descriptor for the active and inactive groups of CDK2 and CDK5 molecules.
Figure 4
Figure 4
One-against-one visualization of the abstract PC space made by the 31 VIP-selected molecular descriptors used for multiclass classification of active CDK molecules (Dark blue: active CDK1, magenta: active CDK2, light blue: active CDK4, gold: active CDK5, orange red: active CDK9).
Figure 5
Figure 5
The density plot, box plot, and beeswarm plot (ac) for Ms (df) for TPSA(Tot) (gi) for nCbH molecular descriptor(s) within five different groups of active CDK inhibitors.
Figure 6
Figure 6
The distribution and coordinates of the PubChem-R and active CDK molecules within the two-dimensional subspaces created by one-by-one comparison of four VIP-selected molecular descriptors (a) TPSA(Tot) versus T (N.. N), (b) TPSA(Tot) versus Ms, (c) TPSA(NO) versus T (N.. N), (d) TPSA(NO) versus T (N.. N).

Similar articles

Cited by

References

    1. Alberts B, et al. Essential Cell Biology. Garland Science; 2015. pp. 603–610.
    1. Pavletich NP. Mechanisms of cyclin-dependent kinase regulation: structures of cdks, their cyclin activators, and cip and INK4 inhibitors1, 211998 Awardee, Walter J. Johnson Prize for the Encouragement of Research in the Life Sciences 2 Edited by P. E. Wright. J. Mol. Biol. 1999;287:821–828. doi: 10.1006/jmbi.1999.2640. - DOI - PubMed
    1. Ayala-Aguilera CC, et al. Small molecule kinase inhibitor drugs (1995–2021): Medical indication, pharmacology, and synthesis. J. Med. Chem. 2022;65:1047–1131. doi: 10.1021/acs.jmedchem.1c00963. - DOI - PubMed
    1. Braal CL, et al. Inhibiting CDK4/6 in breast cancer with palbociclib, ribociclib, and abemaciclib: Similarities and differences. Drugs. 2021;81:317–331. doi: 10.1007/s40265-020-01461-2. - DOI - PMC - PubMed
    1. Xie Z, et al. Lessons learned from past cyclin-dependent kinase drug discovery efforts. J. Med. Chem. 2022;65:6356–6389. doi: 10.1021/acs.jmedchem.1c02190. - DOI - PubMed

MeSH terms

Substances