Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Comparative Study
. 2025 May 26;65(10):5205-5215.
doi: 10.1021/acs.jcim.5c00549. Epub 2025 May 15.

Benchmarking Zinc-Binding Site Predictors: A Comparative Analysis of Structure-Based Approaches

Affiliations
Comparative Study

Benchmarking Zinc-Binding Site Predictors: A Comparative Analysis of Structure-Based Approaches

Cosimo Ciofalo et al. J Chem Inf Model. .

Erratum in

Abstract

Metalloproteins play crucial physiological roles across all domains of life, relying on metal ions for structural stability and catalytic activity. In recent years, computational approaches have emerged as powerful and increasingly reliable tools for predicting metal-binding sites in metalloproteins, enabling their application in the challenging field of metalloproteomics. Given the growing number of available tools, it is timely to design a reproducible approach to characterize their performance in specific usage scenarios. Thus, in this study, we selected some state-of-the-art structure-based predictors for zinc-binding sites and evaluated their performance on two data sets: experimental apoprotein structures and structural models generated by AlphaFold. Our results indicate that apoprotein structures pose significant challenges for predicting metal-binding sites. For these systems, the predictors achieved lower-than-expected performance due to the structural rearrangements occurring upon metalation. Conversely, predictions based on AlphaFold models yielded significantly better results, suggesting that they more closely resemble the holo forms of metalloproteins. Our findings highlight the great potential of metal-binding site predictions for advancing research in the field of metalloproteomics.

PubMed Disclaimer

Figures

1
1
Percentage distribution of sites across CLESs and structures. (Top) The histogram shows the percentage distribution of the number of sites across CLESs. The number above each bar indicates how many CLESs have the corresponding number of sites. (Bottom) The histogram shows the percentage distribution of the number of sites per structure. The number above each bar indicates how many structures harbor the corresponding number of sites.
2
2
Violin plot for CLES 22022, composed of 31 sites. Each violin represents the distribution of the distance between two selected donor atoms in the binding site. The white line at the center of each violin indicates the median value, while the bars around it mark the first and third quartiles. The circles and the triangles of the same color correspond to the sites whose distance matrices contain at least two elements with values beyond two standard deviations from the mean. the site of this cluster consists of 1 cysteine (Sγ), 1 aspartate (Cγ) and 1 histidine (Cε1), where Sγ, Cγ and Cε1 are the donor atom or donor atom proxy.
3
3
Performance of the predictors, based on the top 5 outputs. True positive (TP), redundant true positive (RTP), false positive (FP) and recall (R) values are displayed for all the predictors.
4
4
Percentage of CLESs identified by the predictors.
5
5
(A) Percentage of correctly predicted sites in CLESs featuring large rearrangements of the MBS. The performance of the three best predictors, each shown in a different color, is computed as the percentage of correctly identified sites over the structures in each CLES. Five different CLESs characterized by a high spread of side chains were selected. (B) Violin plot of CLES 18431. The site of this cluster consists of 4 cysteines, whose donor atoms are the Sγ atoms. (C) View of the 3D structure of the sites in CLES 18431. This panel shows the spread of side chains for three different apo-sites belonging to CLES 18431. Each apo-site is represented by a different color.
6
6
Precision and recall (R­(top)) of the selected predictors as a function of the number of outputs considered.
7
7
Precision and recall (R (top)) of the selected predictors as a function of the number of nonredundant outputs.
8
8
F1-score of the selected predictors as a function of the number of considered outputs. The F1-score has been calculated using the data of Figure .
9
9
Venn diagram representing the agreement of the predictors for the second benchmark. The diagrams represent the intersections of the true positives (TP) identified by the four tools. The percentages in the left panel were computed with respect to the total of 62 sites identified by at least two predictors.
10
10
Recall compared with precision for all the predictors. Recall is defined as the ratio between TP and the total number of expected sites in the data set.

Similar articles

References

    1. Waldron K. J., Rutherford J. C., Ford D., Robinson N. J.. Metalloproteins and Metal Sensing. Nature. 2009;460(7257):823–830. doi: 10.1038/nature08300. - DOI - PubMed
    1. Andreini C., Banci L., Bertini I., Rosato A.. Zinc through the Three Domains of Life. J. Proteome Res. 2006;5(11):3173–3178. doi: 10.1021/pr0603699. - DOI - PubMed
    1. Harding M. M.. The Architecture of Metal Coordination Groups in Proteins. Acta Crystallogr., Sect. D:Biol. Crystallogr. 2004;60(5):849–859. doi: 10.1107/S0907444904004081. - DOI - PubMed
    1. Jakubowski N., Lobinski R., Moens L.. Metallobiomolecules. The Basis of Life, the Challenge of Atomic Spectroscopy. J. Anal. At. Spectrom. 2004;19:1–4. doi: 10.1039/b313299b. - DOI
    1. Zhang Y., Gladyshev V. N.. Comparative Genomics of Trace Element Dependence in Biology. J. Biol. Chem. 2011;286(27):23623–23629. doi: 10.1074/jbc.R110.172833. - DOI - PMC - PubMed

Publication types

LinkOut - more resources