Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Jun 9;65(11):5801-5810.
doi: 10.1021/acs.jcim.5c00506. Epub 2025 May 21.

iCliff Taylor's Version: Robust and Efficient Activity Cliff Determination

Affiliations

iCliff Taylor's Version: Robust and Efficient Activity Cliff Determination

Kenneth López-Pérez et al. J Chem Inf Model. .

Abstract

Activity cliffs represent an important challenge to tackle in cheminformatics and drug design. One of the most common indicators to quantify them is the Structure-Activity Landscape Index (SALI). Here, we expose the mathematical limitations of SALI's formulation, the most evident: it is undefined in instances where the similarity between two molecules is one. We show how using a simple Taylor's series can aid this main problem, yielding a defined expression that can capture the ranking information from the original SALI. The second issue to solve is the quadratic complexity of using SALI to describe the roughness of the activity landscape of a set. Here, we propose iCliff, an indicator that can quantify the roughness in linear complexity. For this, we leverage the iSIM framework to obtain the average similarity of the set and a rearrangement to obtain the average of the squared property differences. The calculations for 30 different AC-focused databases suggest that there is a strong correlation between iCliff and the average pairwise of SALI's pairwise Taylor Series. To further explore the individual effects of removing each molecule in the activity landscape, we propose complementary iCliff. With this tool, we were able to identify the molecules that have a high number of activity cliffs with the rest of the molecules in the set.

PubMed Disclaimer

Conflict of interest statement

CONFLICT OF INTEREST STATEMENT

The authors declare no competing financial interest.

Figures

Figure 1.
Figure 1.
Activity cliff example for a pair of molecules with peroxisome proliferator-activated receptor delta (PPARδ) activity. Tanimoto similarity calculated from RDKIT binary fingerprints (2048 bits).
Figure 2.
Figure 2.
Kendall’s tau correlations between the SALI and TS_SALI values for different truncation orders over the 30 studied libraries represented with ECFP4 (1024 bits), MACCS (166 bits), and RDKIT (2048 bits) binary fingerprints. In A) SALI is defined as in Eq. 1, in B) we use SALI with the squared property difference.
Figure 3.
Figure 3.
Distribution of pairwise TS_SALI values using 3rd order truncation for the A) ChEMBL264 and B) ChEMBL2835 libraries represented with ECFP4 fingerprints.
Figure 4.
Figure 4.
Relationship between the average TS SALI value (from the TS SALI pairwise matrix) and the iCliff value for the 30 ChEMBL’s databases represented with ECPF4 fingerprints (1024 bits). The tendency line is shown in gray.
Figure 5.
Figure 5.
iCliff values for the ChEMBL databases studied before and after removing activity cliffs (molecules in pairs with TS3_SALI > the 99th percentile). Molecules represented ECFP4 (1024-bit) fingerprints. The number of molecules annotated over the bars.
Figure 6.
Figure 6.
Kendall’s tau correlations between the cTS_SALI and iCliff values over the 30 studied libraries using ECFP4 fingerprints.
Figure 7.
Figure 7.
Variation of the Jaccard similarity between the ranking of cTS_SALI and complementary iCliff, when varying the fraction of molecules. A) CHEMBL2835_Ki and B) CHEMBL1862_Ki. Horizontal lines at 0.8 and 0.9, vertical line at 10% of the size of each set.
Figure 8.
Figure 8.
iCliff values for the studied ChEMBL databases before and after removing activity cliffs (5% of molecules with lowest complementary iCliff). Molecules are represented with ECFP4 (1024-bit) fingerprints.
Figure 9.
Figure 9.
Examples of separated pairs by the removal of the top 5% molecules with the lowest complementary iCliff values. Structures, pKi, and Tanimoto similarities between the pairs are shown. Molecules were represented using ECFP4 (1024-bit) fingerprints.

Update of

Similar articles

Cited by

References

    1. Silipo C; Vittoria A QSAR, Rational Approaches to the Design of Bioactive Compounds. In European Symposium on Quantitative Structure-Activity Relationships 1990: Sorrento, Italy); 1991.
    1. Stumpfe D; Bajorath J Exploring Activity Cliffs in Medicinal Chemistry. J Med Chem 2012, 55 (7), 2932–2942. 10.1021/jm201706b. - DOI - PubMed
    1. Stumpfe D; Hu H; Bajorath J Evolving Concept of Activity Cliffs. ACS Omega 2019, 4 (11), 14360–14368. 10.1021/acsomega.9b02221. - DOI - PMC - PubMed
    1. Stumpfe D; Hu Y; Dimova D; Bajorath J Recent Progress in Understanding Activity Cliffs and Their Utility in Medicinal Chemistry. J Med Chem 2014, 57 (1), 18–28. 10.1021/jm401120g. - DOI - PubMed
    1. Johnson MA; Maggiora GM Concepts and Applications of Molecular Similarity, 1st ed.; Wiley-Interscience, 1990.

MeSH terms

LinkOut - more resources