Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2010 Oct 8:11:500.
doi: 10.1186/1471-2105-11-500.

Analysis of X-ray structures of matrix metalloproteinases via chaotic map clustering

Affiliations

Analysis of X-ray structures of matrix metalloproteinases via chaotic map clustering

Ilenia Giangreco et al. BMC Bioinformatics. .

Abstract

Background: Matrix metalloproteinases (MMPs) are well-known biological targets implicated in tumour progression, homeostatic regulation, innate immunity, impaired delivery of pro-apoptotic ligands, and the release and cleavage of cell-surface receptors. With this in mind, the perception of the intimate relationships among diverse MMPs could be a solid basis for accelerated learning in designing new selective MMP inhibitors. In this regard, decrypting the latent molecular reasons in order to elucidate similarity among MMPs is a key challenge.

Results: We describe a pairwise variant of the non-parametric chaotic map clustering (CMC) algorithm and its application to 104 X-ray MMP structures. In this analysis electrostatic potentials are computed and used as input for the CMC algorithm. It was shown that differences between proteins reflect genuine variation of their electrostatic potentials. In addition, the analysis has been also extended to analyze the protein primary structures and the molecular shapes of the MMP co-crystallised ligands.

Conclusions: The CMC algorithm was shown to be a valuable tool in knowledge acquisition and transfer from MMP structures. Based on the variation of electrostatic potentials, CMC was successful in analysing the MMP target family landscape and different subsites. The first investigation resulted in rational figure interpretation of both domain organization as well as of substrate specificity classifications. The second made it possible to distinguish the MMP classes, demonstrating the high specificity of the S1' pocket, to detect both the occurrence of punctual mutations of ionisable residues and different side-chain conformations that likely account for induced-fit phenomena. In addition, CMC demonstrated a potential comparable to the most popular UPGMA (Unweighted Pair Group Method with Arithmetic mean) method that, at present, represents a standard clustering bioinformatics approach. Interestingly, CMC and UPGMA resulted in closely comparable outcomes, but often CMC produced more informative and more easy interpretable dendrograms. Finally, CMC was successful for standard pairwise analysis (i.e., Smith-Waterman algorithm) of protein sequences and was used to convincingly explain the complementarity existing between the molecular shapes of the co-crystallised ligand molecules and the accessible MMP void volumes.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Choice of the parameter ϑ controlling the resolution at which the data are processed. (a) Plot of cluster entropy as a function of the mutual information (I). (b) Size of the three largest clusters obtained by the CMC algorithm as a function of the mutual information (I) whose value ranges from 0 to ln2 with a bin-width equal to 0.01.
Figure 2
Figure 2
MMP target family landscape. Dendrogram obtained from the partition data relative to the electrostatic analysis of the entire protein structures. Different background colour boxes distinguished MMP allocations classified according to the domain organization. Stable clusters at ϑ = 0.06 and K = 16 are labelled with their corresponding MMP class. Singletons are indicated with the PDB codes.
Figure 3
Figure 3
Analysis of the S1' specificity pocket. Dendrogram obtained from the partition data relative to the electrostatic analysis of the S1' subsite. Different background colour boxes were used to distinguish the main stable groups at ϑ = 0.08, by setting K = 10. Singletons are indicated with the PDB codes.
Figure 4
Figure 4
Representation of global electrostatic similarity at the S1' subsites. The intra-class and inter-class levels are indicated by the blue-coloured bars and solid red line, respectively. Specificity of S1' subsite demonstrated by its higher level of intra-class similarity compared with inter-class similarity.
Figure 5
Figure 5
High electrostatic similarity of S3-S1-S3' stretches apart from gelatinases subfamily. (a) Heat map relative to the pairwise electrostatic distances for S3-S1-S3' analysis. Colours ranging from red to blue indicate shifts from high to low similarity values. (b) Spy plot of the similarity values lower than 0.93; those greater than the threshold were dropped to zero.
Figure 6
Figure 6
Induced fit phenomena differentiates structures belonging to the same MMP class. Elucidation of differences between PDB:1CGL with PDB:1CGE and PDB:1CGF when comparing S3-S1-S3' regions. The Asn80 residue is coloured yellow for the apo-proteins (PDB:1CGF and PDB:1CGE) and cyan for the complex (PDB:1CGL) co-crystallised with a peptide inhibitor rendered as green sticks. The coordination of zinc with the three catalytic histidines and carboxylic group of the ligand is highlighted.
Figure 7
Figure 7
CMC analysis of MMP sequences. Sequence-based clustering of the MMP family. The level of sequence distance is indicated by the colour shift from blue to red. Proteins classes were labelled, PDB codes were referred to those sequences assigned as singletons during CMC analysis.
Figure 8
Figure 8
Analysis of co-crystallised ligands via molecular shape similarity. Dendrogram obtained from the partition data relative to shape similarity analysis of the 84 co-crystallised ligand molecules. Different colours were used to distinguish the main stable groups at ϑ = 0.08, by setting K = 2.

Similar articles

Cited by

References

    1. Burzlaff N. Concepts and Models in Bioinorganic Chemistry. Wiley-Vch; 2006. From Model Complexes for Zinc-Containing Enzymes; pp. 397–429.
    1. Overall CM, Kleifeld O. Tumor Microenvironment - Opinion: Validating Matrix Metalloproteinases as Drug Targets and Anti-Targets for Cancer Therapy. Nat Rev Cancer. 2006;6:227–239. doi: 10.1038/nrc1821. - DOI - PubMed
    1. Terp GE, Cruciani G, Christensen IT, Jorgensen FS. Structural Differences of Matrix Metalloproteinases with Potential Implication for Selectivity Examined by the GRID/CPCA Approach. J Med Chem. 2002;45:675–2684. doi: 10.1021/jm0109053. - DOI - PubMed
    1. Overall CM, Lopez-Otin C. Strategies for MMP Inhibition in Cancer: Innovations for the Post-Trial Era. Nat Rev Cancer. 2002;2:657–672. doi: 10.1038/nrc884. - DOI - PubMed
    1. Willett P. Similarity and Clustering in Chemical Information Systems. New York: John Wiley & Sons; 1987.

Publication types

LinkOut - more resources