Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Comparative Study
. 2010 Feb 22:11:99.
doi: 10.1186/1471-2105-11-99.

A new protein binding pocket similarity measure based on comparison of clouds of atoms in 3D: application to ligand prediction

Affiliations
Comparative Study

A new protein binding pocket similarity measure based on comparison of clouds of atoms in 3D: application to ligand prediction

Brice Hoffmann et al. BMC Bioinformatics. .

Abstract

Background: Predicting which molecules can bind to a given binding site of a protein with known 3D structure is important to decipher the protein function, and useful in drug design. A classical assumption in structural biology is that proteins with similar 3D structures have related molecular functions, and therefore may bind similar ligands. However, proteins that do not display any overall sequence or structure similarity may also bind similar ligands if they contain similar binding sites. Quantitatively assessing the similarity between binding sites may therefore be useful to propose new ligands for a given pocket, based on those known for similar pockets.

Results: We propose a new method to quantify the similarity between binding pockets, and explore its relevance for ligand prediction. We represent each pocket by a cloud of atoms, and assess the similarity between two pockets by aligning their atoms in the 3D space and comparing the resulting configurations with a convolution kernel. Pocket alignment and comparison is possible even when the corresponding proteins share no sequence or overall structure similarities. In order to predict ligands for a given target pocket, we compare it to an ensemble of pockets with known ligands to identify the most similar pockets. We discuss two criteria to evaluate the performance of a binding pocket similarity measure in the context of ligand prediction, namely, area under ROC curve (AUC scores) and classification based scores. We show that the latter is better suited to evaluate the methods with respect to ligand prediction, and demonstrate the relevance of our new binding site similarity compared to existing similarity measures.

Conclusions: This study demonstrates the relevance of the proposed method to identify ligands binding to known binding pockets. We also provide a new benchmark for future work in this field. The new method and the benchmark are available at http://cbio.ensmp.fr/paris/.

PubMed Disclaimer

Figures

Figure 1
Figure 1
AUC score versus classification error as an evaluation of binding pocket similarity measure. Red circles represents pockets fixing ligand L1, blue squares represents pockets fixing ligand L2. The AUC score does not reflect the fact of good pocket clusterization, while the classification error does.
Figure 2
Figure 2
Relationship between AUC performances of the methods tested. (a) on the Kahraman dataset (b) on the Homogenous dataset. Each node corresponds to a particular method, parent nodes perform significantly better than child nodes according to the Wilcoxon signed-rank test.
Figure 3
Figure 3
Superposition of the binding pockets of two structurally different proteins binding ATP. A) overall structures of pdb: PDB: 1E8X in grey and PDB: 1DV2 in red superposed according to their binding sites using Sup-CK. ATP molecules are represented in blue. B) Superposition of the ATP molecules from PDB: 1DV2 and PDB: 1E8X when their binding sites are superposed. C) Positively charged protein regions around ATP molecules of PDB: 1E8X in grey and PDB: 1DV2 in red. D) Protein hydrophobic patches around ATP molecules of PDB: 1E8X in grey and PDB: 1DV2 in red.
Figure 4
Figure 4
Alignment two ATP binding pockets. Alignment of two ATP pockets made by sup-CK, atoms of each pockets are represented by blue and red points, two ATP ligands are traced in licorice.
Figure 5
Figure 5
Projection of the ext-KD dataset on the first two kernel principal components defined by the similarity measure sup-CK. Clustering of binding pockets according to their ligands, which illustrates the performance of this method for ligand prediction.
Figure 6
Figure 6
Performance on the HD dataset. (a) Mean AUC score and prediction error as functions of σ in the sup-CK method (pure geometrical version, λ = ∞), (b) mean AUC score and (c) classification error as functions of σ and λ when information on atoms partial charges is used.
Figure 7
Figure 7
Classification error of the sup-CKL algorithm. as a function of R and σ (λ = 0.25): (a) Kahraman dataset, (b) HD dataset.
Figure 8
Figure 8
Projection of ATP binding pockets on the two first kernel principal components of sup-CK. Repartition of the ATP binding pockets generated by the sup-CK similarity measure on the extended Kahraman dataset. Red squares represent ligases, blue stars represent transferases.

Similar articles

Cited by

References

    1. Xie L, Xie L, Bourne PE. A unified statistical model to support local sequence order independent similarity searching for ligand-binding sites and its application to genome-based drug discovery. Bioinformatics. 2009;25(12):i305–i312. doi: 10.1093/bioinformatics/btp220. - DOI - PMC - PubMed
    1. Kahraman A, Morris RJ, Laskowski RA, Thornton JM. Shape variation in protein binding pockets and their ligands. J Mol Biol. 2007;368:283–301. doi: 10.1016/j.jmb.2007.01.086. - DOI - PubMed
    1. Morris RJ, Najmanovich R, Kahraman A, Thornton J. Real spherical harmonic expansion coefficients as 3D shape descriptors for protein binding pocket and ligand comparisons. Bioinformatics. 2005;21(10):2347–2355. doi: 10.1093/bioinformatics/bti337. - DOI - PubMed
    1. Gold N, Jackson R. SitesBase: a database for structure-based protein-ligand binding site comparisons. Nucleic Acids Res. 2006;34:D231–D234. doi: 10.1093/nar/gkj062. - DOI - PMC - PubMed
    1. Shulman-Peleg A, Shatsky M, Nussinov R, Wolfson HJJ. MultiBind and MAPPIS: webservers for multiple alignment of protein 3D-binding sites and their interactions. Nucleic Acids Res. 2008;36:W260–W264. doi: 10.1093/nar/gkn185. - DOI - PMC - PubMed

Publication types

LinkOut - more resources