Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
[Preprint]. 2025 Mar 25:2025.03.21.644685.
doi: 10.1101/2025.03.21.644685.

SuperMetal: A Generative AI Framework for Rapid and Precise Metal Ion Location Prediction in Proteins

Affiliations

SuperMetal: A Generative AI Framework for Rapid and Precise Metal Ion Location Prediction in Proteins

Xiaobo Lin et al. bioRxiv. .

Update in

Abstract

Metal ions, as abundant and vital cofactors in numerous proteins, are crucial for enzymatic activities and protein interactions. Given their pivotal role and catalytic efficiency, accurately and efficiently identifying metal-binding sites is fundamental to elucidating their biological functions and has significant implications for protein engineering and drug discovery. To address this challenge, we present SuperMetal, a generative AI framework that leverages a score-based diffusion model coupled with a confidence model to predict metal-binding sites in proteins with high precision and efficiency. Using zinc ions as an example, SuperMetal outperforms existing state-of-the-art models, achieving a precision of 94 % and coverage of 90 %, with zinc ions localization within 0.52 ± 0.55 Å of experimentally determined positions, thus marking a substantial advance in metal-binding site prediction. Furthermore, SuperMetal demonstrates rapid prediction capabilities (under 10 seconds for proteins with ∼ 2000 residues) and remains minimally affected by increases in protein size. Notably, SuperMetal does not require prior knowledge of the number of metal ions-unlike AlphaFold 3, which depends on this information. Additionally, SuperMetal can be readily adapted to other metal ions or repurposed as a probe framework to identify other types of binding sites, such as protein-binding pockets.

Keywords: Generative AI; diffusion model; metal-binding sites; metalloprotein.

PubMed Disclaimer

Figures

Fig. 1
Fig. 1
Workflow of SuperMetal. The orange spheres represent the sampled Zn ions. The protein, shown in blue, is from 2J9R in the RCSB Protein Data Bank.
Fig. 2
Fig. 2. Precision Versus Coverage for SuperMetal and Metal3D at Different Probability Cutoffs.
The graph compares SuperMetal (purple line) and Metal3D (green line), illustrating how precision varies with coverage at different cutoff values. The labels indicate the probability cutoffs, p for the confidence model in SuperMetal and t for Metal3D.
Fig. 3
Fig. 3. Distribution of Mean Absolute Deviation (MAD) for SuperMetal and Metal3D at Increasing Probability Cutoffs.
This violin plot illustrates the MAD for metal ion location predictions by SuperMetal and Metal3D, with cutoffs ranging from lower to higher values for each model. SuperMetal is shown in purple and Metal3D in green. The plot employs kernel density estimation to display the data distribution, with the white circle indicating the median, the black box defining the first and third quartiles, and whiskers extending up to 1.5 times the interquartile range to capture the spread of typical data values.
Fig. 4
Fig. 4. Computational Time Analysis for SuperMetal and Metal3D Across Protein Sizes.
This scatter plot compares the computational time required by SuperMetal and Metal3D to predict metal-binding sites against the number of protein residues. Polynomial regression curves (purple and green dashed lines) are only used to clarify the trends.
Fig. 5
Fig. 5. Comparative visualization of Zn ion binding site predictions for the proteins 5IN2 (1) and 6BTP (2).
Zn ions are color-coded as follows: grey for experimentally determined Zn ions, cyan for Metal3D predictions, orange for SuperMetal predictions, and blue for AlphaFold 3 predictions. The protein structure is shown in green for the same input PDB file used in Metal3D and SuperMetal, while it is shown in yellow for AlphaFold 3. The transparent green region around the Zn ions highlights the protein’s atomic structure within a 5 Å radius of the metal ions. From left to right, the figure shows varying numbers of Zn ions specified in the AlphaFold 3 input, ranging from 1 Zn ion, to 2, and finally to 6 Zn ions.
Fig. 6
Fig. 6. Fundamental Theory of the Score-based Generative Diffusion Model for Metal Ions in Proteins.
The forward continuous-time stochastic differential equation (SDE) transitions the true locations of metal ions (top left) to random locations (top right). The score at each intermediate time step is predicted by a deep learning neural network, enabling the reverse process of the SDE. The grey part of the protein (top) represents the atomic structure of the protein surrounding the metal ions at their original positions.

Similar articles

  • SuperMetal: a generative AI framework for rapid and precise metal ion location prediction in proteins.
    Lin X, Su Z, Liu YL, Liu J, Kuang X, Cummings PT, Spencer-Smith J, Meiler J. Lin X, et al. J Cheminform. 2025 Jul 15;17(1):107. doi: 10.1186/s13321-025-01038-9. J Cheminform. 2025. PMID: 40665445 Free PMC article.
  • Short-Term Memory Impairment.
    Cascella M, Al Khalili Y. Cascella M, et al. 2024 Jun 8. In: StatPearls [Internet]. Treasure Island (FL): StatPearls Publishing; 2025 Jan–. 2024 Jun 8. In: StatPearls [Internet]. Treasure Island (FL): StatPearls Publishing; 2025 Jan–. PMID: 31424720 Free Books & Documents.
  • Management of urinary stones by experts in stone disease (ESD 2025).
    Papatsoris A, Geavlete B, Radavoi GD, Alameedee M, Almusafer M, Ather MH, Budia A, Cumpanas AA, Kiremi MC, Dellis A, Elhowairis M, Galán-Llopis JA, Geavlete P, Guimerà Garcia J, Isern B, Jinga V, Lopez JM, Mainez JA, Mitsogiannis I, Mora Christian J, Moussa M, Multescu R, Oguz Acar Y, Petkova K, Piñero A, Popov E, Ramos Cebrian M, Rascu S, Siener R, Sountoulides P, Stamatelou K, Syed J, Trinchieri A. Papatsoris A, et al. Arch Ital Urol Androl. 2025 Jun 30;97(2):14085. doi: 10.4081/aiua.2025.14085. Epub 2025 Jun 30. Arch Ital Urol Androl. 2025. PMID: 40583613 Review.
  • Sexual Harassment and Prevention Training.
    Cedeno R, Bohlen J. Cedeno R, et al. 2024 Mar 29. In: StatPearls [Internet]. Treasure Island (FL): StatPearls Publishing; 2025 Jan–. 2024 Mar 29. In: StatPearls [Internet]. Treasure Island (FL): StatPearls Publishing; 2025 Jan–. PMID: 36508513 Free Books & Documents.
  • Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.
    Struyf T, Deeks JJ, Dinnes J, Takwoingi Y, Davenport C, Leeflang MM, Spijker R, Hooft L, Emperador D, Domen J, Tans A, Janssens S, Wickramasinghe D, Lannoy V, Horn SRA, Van den Bruel A; Cochrane COVID-19 Diagnostic Test Accuracy Group. Struyf T, et al. Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3. Cochrane Database Syst Rev. 2022. PMID: 35593186 Free PMC article.

References

    1. Shu Nanjiang, Zhou Tuping, and Hovmöller Sven. Prediction of zinc-binding sites in proteins from sequence. Bioinformatics, 24(6):775–782, 2008. - PubMed
    1. Andreini Claudia, Banci Lucia, Bertini Ivano, and Rosato Antonio. Counting the zinc-proteins encoded in the human genome. Journal of proteome research, 5(1):196–201, 2006. - PubMed
    1. McCall Keith A, Huang Chih-chin, and Fierke Carol A. Function and mechanism of zinc metalloenzymes. The Journal of nutrition, 130(5):1437S–1446S, 2000. - PubMed
    1. Berg Jeremy Mand Shi Yigong. The galvanization of biology: a growing appreciation for the roles of zinc. Science, 271(5252):1081–1085, 1996. - PubMed
    1. Costa Maria Inês, Sarmento-Ribeiro Ana Bela, and Gonçalves Ana Cristina. Zinc: from biological functions to therapeutic potential. International Journal of Molecular Sciences, 24(5):4822, 2023. - PMC - PubMed

Publication types

LinkOut - more resources