Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2010 Feb 22;50(2):298-308.
doi: 10.1021/ci9004139.

Binding affinity prediction with property-encoded shape distribution signatures

Affiliations

Binding affinity prediction with property-encoded shape distribution signatures

Sourav Das et al. J Chem Inf Model. .

Abstract

We report the use of the molecular signatures known as "property-encoded shape distributions" (PESD) together with standard support vector machine (SVM) techniques to produce validated models that can predict the binding affinity of a large number of protein ligand complexes. This "PESD-SVM" method uses PESD signatures that encode molecular shapes and property distributions on protein and ligand surfaces as features to build SVM models that require no subjective feature selection. A simple protocol was employed for tuning the SVM models during their development, and the results were compared to SFCscore, a regression-based method that was previously shown to perform better than 14 other scoring functions. Although the PESD-SVM method is based on only two surface property maps, the overall results were comparable. For most complexes with a dominant enthalpic contribution to binding (DeltaH/-TDeltaS > 3), a good correlation between true and predicted affinities was observed. Entropy and solvent were not considered in the present approach, and further improvement in accuracy would require accounting for these components rigorously.

PubMed Disclaimer

Figures

Figure 1
Figure 1
(Left) Side view of EP mapped protein interaction surface of complex 1fbp. P1 and P2 are two points chosen from random locations on the surface. The properties of these two points and the Euclidean distance d between them determine which PESD signature bin they will occupy. The graphical representation of the PESD signature of 1fbp is shown as a two dimensional grid of bins (Right). Darker circles indicate greater bin populations. Each row corresponds to specific endpoint color combinations while each column represents point-pair distances that increase from left to right.
Figure 2
Figure 2
H1-4 depict protein and ligand interaction surfaces encoded with EP and Active LP maps. PESD signatures derived from these surfaces were used as features for building binding affinity SVM models.
Figure 3
Figure 3
Plot of experimental affinities versus predicted affinities for PESD-SVM regression models I, II and V applied to their respective test sets.
Figure 4
Figure 4
Plot of RP, RS and number of test cases against chi-squared cutoff distances for PESD-SVM regression Model II.
Figure 5
Figure 5
Correlation between ligand pose (rmsd from native pose) and PESD-DOCK SVM Score for docked conformations of L-Benzyl succinate in 1cbx. The score of the native pose (rmsd = 0) is shown as a dashed line. Since PESD-DOCK SVM models were trained on positive affinity values (pKd/pKi), higher scores indicate favorable interactions.
Figure 6
Figure 6
(a) Plot of free energy versus enthalpy for 322 entries from the SCORPIO database. (b) Plot of free energy versus enthalpy for 111 out of 322 entries from the SCORPIO database. The ΔH/-TΔS was greater than 3 for these complexes. (c) Difference plot of 6a and 6b (d) Plot of experimental versus predicted affinities of all entries in the core set for which ΔH/-TΔS was greater than 3 and could be obtained from the SCORPIO database. The complex 1adl is circled in red.

Similar articles

Cited by

References

    1. Gehlhaar DK, Verkhivker GM, Rejto PA, Sherman CJ, Fogel DR, Fogel LJ, Freer ST. Molecular Recognition of the Inhibitor AG-1343 by HIV-1 Protease: Conformationally Flexible Docking by Evolutionary Programming. Chem Biol. 1995;2:317–324. - PubMed
    1. Muegge I, Martin YC. A General and Fast Scoring Function for Protein-Ligand Interactions: A Simplified Potential Approach. J Med Chem. 1999;42:791–804. - PubMed
    1. Böhm H-J. The development of a simple empirical scoring function to estimate the binding constant for a protein-ligand complex of known three-dimensional structure. J Comput-Aided Mol Des. 1994;8:243–256. - PubMed
    1. Rarey M, Kramer B, Lengauer T, Klebe G. A Fast Flexible Docking Method using an Incremental Construction Algorithm. J Mol Biol. 1996;261:470–489. - PubMed
    1. Jones G, Willett P, Glen RC, Leach AR, Taylor R. Development and validation of a genetic algorithm for flexible docking. J Mol Biol. 1997;267:727–748. - PubMed

Publication types