Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2014 Apr 23;2014(0):bau029.
doi: 10.1093/database/bau029. Print 2014.

PLIC: protein-ligand interaction clusters

Affiliations

PLIC: protein-ligand interaction clusters

Praveen Anand et al. Database (Oxford). .

Abstract

Most of the biological processes are governed through specific protein-ligand interactions. Discerning different components that contribute toward a favorable protein- ligand interaction could contribute significantly toward better understanding protein function, rationalizing drug design and obtaining design principles for protein engineering. The Protein Data Bank (PDB) currently hosts the structure of ∼68 000 protein-ligand complexes. Although several databases exist that classify proteins according to sequence and structure, a mere handful of them annotate and classify protein-ligand interactions and provide information on different attributes of molecular recognition. In this study, an exhaustive comparison of all the biologically relevant ligand-binding sites (84 846 sites) has been conducted using PocketMatch: a rapid, parallel, in-house algorithm. PocketMatch quantifies the similarity between binding sites based on structural descriptors and residue attributes. A similarity network was constructed using binding sites whose PocketMatch scores exceeded a high similarity threshold (0.80). The binding site similarity network was clustered into discrete sets of similar sites using the Markov clustering (MCL) algorithm. Furthermore, various computational tools have been used to study different attributes of interactions within the individual clusters. The attributes can be roughly divided into (i) binding site characteristics including pocket shape, nature of residues and interaction profiles with different kinds of atomic probes, (ii) atomic contacts consisting of various types of polar, hydrophobic and aromatic contacts along with binding site water molecules that could play crucial roles in protein-ligand interactions and (iii) binding energetics involved in interactions derived from scoring functions developed for docking. For each ligand-binding site in each protein in the PDB, site similarity information, clusters they belong to and description of site attributes are provided as a relational database-protein-ligand interaction clusters (PLIC). Database URL: http://proline.biochem.iisc.ernet.in/PLIC.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
PLIC database workflow. The flowchart illustrates the different steps involved in the construction of the PLIC database. All the protein–ligand complexes are downloaded from the PDB, and binding sites (comprising all the residues that are within 4.5 Å of any ligand atom) are extracted. Only the biologically relevant ligands are selected that resulted in 84 846 binding sites. An exhaustive all-versus-all comparison of these 84 846 binding sites is performed using PocketMatch, and a binding site similarity network is constructed at a PMAX cutoff of 0.8. Network-based clustering of binding sites is performed using the MCL algorithm to obtain clusters of similar binding sites. All the different attributes that are calculated for the interactions within the clusters along with computational tools that were used to derive them are mentioned in the box.
Figure 2.
Figure 2.
The EER of the PLIC database. The EER of different data types in PLIC is shown. The database consists of 13 tables, and the relationship between these tables is depicted here. The logical partition indicating the type of information is highlighted and labeled with different colors.
Figure 3.
Figure 3.
Database statistics. (A) The frequency of different ligand-binding sites present in the database is represented in the form of a histogram. The most populated ligands are labeled along with their frequencies. (B) The number of interactions present per CATH superfamily is depicted in the form of a histogram. The CATH superfamilies associated with most number of ligands are labeled. The pie charts depict the distribution of different (C) enzyme classes and (D) SCOP classes present in the database.
Figure 4.
Figure 4.
PLIC database server. (A) Snapshot of the query page for the PLIC database. (B) The page displaying the results of the query in the tabular form containing information about the name of the binding site, protein, ligand, UniprotID, EC number and CATH superfamily ID. (C) The results page displayed after a specific binding site name is clicked. The results page consists of Jmol plug-in for visualization of interactions, clusters indicating high-energy interaction zones for different probes, alignment of binding sites within the cluster, similar sites with PocketMatch scores, cluster information and various attributes associated with the interaction. (D) Barplot illustrating the distribution of various residues within the binding site environment of the cluster and box plots indicating the variations observed in different attributes of interactions within the cluster are displayed on the cluster analysis page.

References

    1. Yang J., Roy A., Zhang Y. (2013) BioLiP: a semi-manually curated database for biologically relevant ligand-protein interactions. Nucleic Acids Res., 41, D1096–D1103 - PMC - PubMed
    1. Schreyer A.M., Blundell T.L. (2013) CREDO: a structural interactomics database for drug discovery. Database (Oxford), 2013, bat049. - PMC - PubMed
    1. Ito J., Tabei Y., Shimizu K., et al. . (2012) PoSSuM: a database of similar protein-ligand binding and putative pockets. Nucleic Acids Res, 40, D541–D548 - PMC - PubMed
    1. Kufareva I., Ilatovskiy A.V., Abagyan R. (2012) Pocketome: an encyclopedia of small-molecule binding sites in 4D. Nucleic Acids Res., 40, D535–D540 - PMC - PubMed
    1. Hendlich M., Bergner A., Gunther J., et al. . (2003) Relibase: design and development of a database for comprehensive analysis of protein-ligand interactions. J. Mol. Biol., 326, 607–620 - PubMed

Publication types

LinkOut - more resources