Abstract
Metal ions, as abundant and vital cofactors in numerous proteins, are crucial for enzymatic activities and protein interactions. Given their pivotal role and catalytic efficiency, accurately and efficiently identifying metal-binding sites is fundamental to elucidating their biological functions and has significant implications for protein engineering and drug discovery. To address this challenge, we present SuperMetal, a generative AI framework that leverages a score-based diffusion model coupled with a confidence model to predict metal-binding sites in proteins with high precision and efficiency. Using zinc ions as an example, SuperMetal outperforms existing state-of-the-art models, achieving a precision of 94 % and coverage of 90 %, with zinc ions localization within 0.52 ± 0.55 Å of experimentally determined positions, thus marking a substantial advance in metal-binding site prediction. Furthermore, SuperMetal demonstrates rapid prediction capabilities (under 10 s for proteins with ∼ 2000 residues) and remains minimally affected by increases in protein size. Notably, SuperMetal does not require prior knowledge of the number of metal ions—unlike AlphaFold 3, which depends on this information. Additionally, SuperMetal can be readily adapted to other metal ions or repurposed as a probe framework to identify other types of binding sites, such as protein-binding pockets. Scientific contribution SuperMetal introduces a diffusion-based, SE(3)-equivariant generative model that places metal ions in proteins with 94 % precision, 90 % coverage, and sub-ångström (0.52 Å) accuracy in under 10 s, surpassing current methods and accelerating metal-aware protein engineering and drug discovery.
| Original language | English |
|---|---|
| Article number | 107 |
| Journal | Journal of Cheminformatics |
| Volume | 17 |
| Issue number | 1 |
| DOIs | |
| State | Published - Dec 2025 |
Funding
Z.S. was supported by the Vanderbilt Data Science Postdoctoral Fellowship. X.L. and X.K. received research support from the Vanderbilt Data Science Institute, while X.L. and P.T.C. were supported by the John R. Hall Professorship Endowment in Chemical Engineering. J.M. received a Humboldt Professorship from the Alexander von Humboldt Foundation and funding from the Deutsche Forschungsgemeinschaft (DFG) through SFB1423 (421152132), SFB 1664 (514901783), TRR (514664767), and SPP 2363 (460865652). Additionally, J.M. was supported by the Federal Ministry of Education and Research (BMBF) via the Center for Scalable Data Analytics and Artificial Intelligence (ScaDS.AI), the German Network for Bioinformatics Infrastructure (de.NBI), and the German Academic Exchange Service (DAAD) through the School of Embedded Composite AI (SECAI 15766814), with further support provided by the National Institutes of Health (NIH) through grants R01 HL122010, R01 DA046138, R01 AG068623, U01 AI150739, R01 CA227833, R01 LM013434, S10 OD016216, S10 OD020154, and S10 OD032234.
Keywords
- Diffusion model
- Generative AI
- Metal-binding sites
- Metalloprotein
Fingerprint
Dive into the research topics of 'SuperMetal: a generative AI framework for rapid and precise metal ion location prediction in proteins'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver