Abstract
With approximately 90% of industrial reactions occurring on surfaces, the role of heterogeneous catalysts is paramount. Currently, accurate surface exposure prediction is vital for heterogeneous catalyst design, but it is hindered by the high costs of experimental and computational methods. Here we introduce a foundation force-field-based model for predicting surface exposure and synthesizability (SurFF) across intermetallic crystals, which are essential materials for heterogeneous catalysts. We created a comprehensive intermetallic surface database using an active learning method and high-throughput density functional theory calculations, encompassing 12,553 unique surfaces and 344,200 single points. SurFF achieves density-functional-theory-level precision with a prediction error of 3 meV Å−2 and enables large-scale surface exposure prediction with a 105-fold acceleration. Validation against computational and experimental data both show strong alignment. We applied SurFF for large-scale predictions of surface energy and Wulff shapes for over 6,000 intermetallic crystals, providing valuable data for the community.
This is a preview of subscription content, access via your institution
Access options
Access Nature and 54 other Nature Portfolio journals
Get Nature+, our best-value online-access subscription
$32.99 / 30 days
cancel any time
Subscribe to this journal
Receive 12 digital issues and online access to articles
$119.00 per year
only $9.92 per issue
Buy this article
- Purchase on SpringerLink
- Instant access to the full article PDF.
USD 39.95
Prices may be subject to local taxes which are calculated during checkout





Similar content being viewed by others
Data availability
All of the datasets used in this work are also available via figshare47. The minimum data that are necessary to interpret, verify and extend the research in the article are provided via SurFF_CoreDataFiles.zip, which contains the core data files for the SurFF project, allowing the user to run the SurFF prediction for intermetalic crystals. All of this work’s data are provided via SurFF_DataFiles.zip, which contains all of the generated DFT datasets. Please refer to the GitHub link below for more detailed instructions on how to reproduce the main results and apply SurFF to prediction. Source data are provided with this paper.
Code availability
All of the codes in this work could be retrieved from GitHub repository via https://github.com/Long1Corn/SurFF or Zenodo (ref. 48). A step-by-step guide to use SurFF for prediction and a web user interface are also provided.
References
Nørskov, J. K. & Bligaard, T. The catalyst genome. Angew. Chem. Int. Ed. 52, 776–777 (2013).
Nørskov, J. K., Studt, F., Abild-Pedersen, F. & Bligaard, T. Fundamental concepts in heterogeneous catalysis (Wiley, 2015).
Jenewein, K. J., Akkoc, G. D., Kormányos, A. & Cherevko, S. Automated high-throughput activity and stability screening of electrocatalysts. Chem. Catal. 2, 2778–2794 (2022).
Back, S. et al. Convolutional neural network of atomic surface structures to predict binding energies for high-throughput screening of catalysts. J. Phys. Chem. Lett. 10, 4401–4408 (2019).
Fung, V., Hu, G., Ganesh, P. & Sumpter, B. G. Machine learned features from density of states for accurate adsorption energy prediction. Nat. Commun. https://doi.org/10.1038/s41467-020-20342-6 (2021).
Lee, J. & Jinnouchi, R. Machine learning-based screening of highly stable and active ternary Pt alloys for oxygen reduction reaction. J. Phys. Chem. C 125, 16963–16974 (2021).
Zhong, M. et al. Accelerated discovery of CO2 electrocatalysts using active machine learning. Nature 581, 178–183 (2020).
Tran, K. & Ulissi, Z. W. Active learning across intermetallics to guide discovery of electrocatalysts for CO2 reduction and H2 evolution. Nat. Catal. 1, 696–703 (2018).
Ge, X. et al. Atomic design of alkyne semihydrogenation catalysts via active learning. J. Am. Chem. Soc. 146, 4993–5004 (2024).
Jones, G., Bligaard, T., Abild-Pedersen, F. & Nørskov, J. K. Using scaling relations to understand trends in the catalytic activity of transition metals. J. Phys. Condens. Matter 20, 064239 (2008).
Getsoian, A. B., Zhai, Z. & Bell, A. T. Band-gap energy as a descriptor of catalytic activity for propene oxidation over mixed metal oxide catalysts. J. Am. Chem. Soc. 136, 13684–13697 (2014).
Tran, R. et al. Surface energies of elemental crystals. Sci. Data 3, 160080 (2016).
Cheula, R., Maestri, M. & Mpourmpakis, G. Modeling morphology and catalytic activity of nanoparticle ensembles under reaction conditions. ACS Catal. 10, 6149–6158 (2020).
Gilman, J. J. Direct measurements of the surface energies of crystals. J. Appl. Phys. 31, 2208–2218 (1960).
Eaglesham, D., White, A., Feldman, L., Moriya, N. & Jacobson, D. Equilibrium shape of Si. Phys. Rev. Lett. 70, 1643 (1993).
Ringe, E., Van Duyne, R. P. & Marks, L. Wulff construction for alloy nanoparticles. Nano Lett. 11, 3399–3403 (2011).
Zhang, K. et al. Spin-mediated promotion of Co catalysts for ammonia synthesis. Science 383, 1357–1363 (2024).
Li, Y. et al. Electrolyte-assisted polarization leading to enhanced charge separation and solar-to-hydrogen conversion efficiency of seawater splitting. Nat. Catal. 7, 77–88 (2024).
Henry, C. R. Morphology of supported nanoparticles. Prog. Surf. Sci. 80, 92–116 (2005).
Palizhati, A., Zhong, W., Tran, K., Back, S. & Ulissi, Z. W. Toward predicting intermetallics surface properties with high-throughput DFT and convolutional neural networks. J. Chem. Inf. Model. 59, 4742–4749 (2019).
Unke, O. T. et al. Machine learning force fields. Chem. Rev. 121, 10142–10186 (2021).
Chanussot, L. et al. Open Catalyst 2020 (OC20) dataset and community challenges. ACS Catal. 11, 6059–6072 (2021).
Tran, R. et al. The Open Catalyst 2022 (OC22) dataset and challenges for oxide electrocatalysts. ACS Catal. 13, 3066–3084 (2023).
Ben Mahmoud, C., Gardner, J. L. A. & Deringer, V. L. Data as the next challenge in atomistic machine learning. Nat. Comput. Sci. 4, 384–387 (2024).
Peverati, R. & Truhlar, D. G. Quest for a universal density functional: the accuracy of density functionals across a broad spectrum of databases in chemistry and physics. Phil. Trans. R. Soc. A 372, 20120476 (2014).
Jain, A. et al. Commentary: The Materials Project: a materials genome approach to accelerating materials innovation. APL Mater. https://doi.org/10.1063/1.4812323 (2013).
Lookman, T., Balachandran, P. V., Xue, D. & Yuan, R. Active learning in materials science with emphasis on adaptive sampling using uncertainties for targeted design. NPJ Comput. Mater. 5, 21 (2019).
Ren, P. et al. A survey of deep active learning. ACM Comput. Surveys 54, 1–40 (2021).
Li, J. et al. Deep learning accelerated gold nanocluster synthesis. Adv. Intel. Syst. 1, 1900029 (2019).
Liao, Y.-L., Wood, B. M., Das, A. & Smidt, T. EquiformerV2: improved equivariant transformer for scaling to higher-degree representations. In Proc. 12th International Conference on Learning Representations (ICLR, 2024).
Zitnick, L. et al. Spherical channels for modeling atomic interactions. In Proc. 36th Conference on Neural Information Processing Systems (NeurIPS, 2022).
DeepSeek, A. I. et al. DeepSeek-V2: a strong, economical, and efficient mixture-of-experts language model. Preprint at https://arxiv.org/abs/2405.04434 (2024).
Lan, X., Wang, Y., Liu, B., Kang, Z. & Wang, T. Thermally induced intermetallic Rh1Zn1 nanoparticles with high phase-purity for highly selective hydrogenation of acetylene. Chem. Sci. 15, 1758–1768 (2024).
Wang, J. et al. Redirecting dynamic surface restructuring of a layered transition metal oxide catalyst for superior water oxidation. Nat. Catal. 4, 212–222 (2021).
Feng, K. et al. Dual functionalized interstitial N atoms in Co3Mo3N enabling CO2 activation. ACS Catal. 12, 4696–4706 (2022).
Pajares, A. et al. Critical effect of carbon vacancies on the reverse water gas shift reaction over vanadium carbide catalysts. Appl. Catal. B 267, 118719 (2020).
Sanspeur, R. Y., Heras-Domingo, J., Kitchin, J. R. & Ulissi, Z. WhereWulff: a semiautonomous workflow for systematic catalyst surface reactivity under reaction conditions. J. Chem. Inf. Model. 63, 2427–2437 (2023).
Sun, W. & Ceder, G. Efficient creation and convergence of surface slabs. Surf. Sci. 617, 53–59 (2013).
Ong, S. P. et al. Python materials genomics (PyMatGen): a robust, open-source Python library for materials analysis. Comput. Mater. Sci. 68, 314–319 (2013).
Konyushkova, K., Sznitman, R. & Fua, P. Learning active learning from data. In Proc. 31st Conference on Neural Information Processing Systems (NeurIPS, 2017)
Lakshminarayanan, B., Pritzel, A. & Blundell, C. Simple and scalable predictive uncertainty estimation using deep ensembles. In Proc. 31st Conference on Neural Information Processing Systems (NeurIPS, 2017).
Yang, Y., Ma, Z., Nie, F., Chang, X. & Hauptmann, A. G. Multi-class active learning by uncertainty sampling with diversity maximization. Int. J. Comput. Vision 113, 113–127 (2015).
Passaro, S. & Zitnick, C. L. Reducing SO(3) convolutions to SO(2) for efficient equivariant GNNs. In Proc. 40th International Conference on Machine Learning (PMLR, 2023).
Geiger, M. & Smidt, T. e3nn: Euclidean neural networks. Preprint at https://arxiv.org/abs/2207.09453 (2022).
Klicpera, J., Becker, F. & Günnemann, S. GemNet: universal directional graph neural networks for molecules. In Proc. 35th Conference on Neural Information Processing Systems (NeurIPS, 2021).
Ying, C. et al. Do transformers really perform badly for graph representation? In Proc. 35th Conference on Neural Information Processing Systems (NeurIPS, 2021).
Yin, J. et al. SurFF_coredatafiles. figshare https://doi.org/10.6084/m9.figshare.26395756 (2025).
Yin, J. et al. SurFF: 1.0. Zenodo https://doi.org/10.5281/zenodo.15480651 (2025).
Ji, Y. et al. Selective CO-to-acetate electroreduction via intermediate adsorption tuning on ordered Cu–Pd sites. Nat. Catal. 5, 251–258 (2022).
Acknowledgements
This work is supported by the National Key R&D Program of China (grant no. 2022ZD0117501), the Scientific Research Innovation Capability Support Project for Young Faculty (grant no. ZYGXQNJSKYCXNLZCXM-E7), the Tsinghua University Initiative Scientific Research Program and the Carbon Neutrality and Energy System Transformation (CNEST) Program led by Tsinghua University. The primary affiliation of J.Y. is the Department of Chemical and Biomolecular Engineering, National University of Singapore.
Author information
Authors and Affiliations
Contributions
X.W. supervised the project. J.Y., X.W. and I.A.K. conceptualized this project. J.Y., H.C. and P.H. performed DFT calculations. J.Y., W.L. and J.L. trained the machine learning models. J.Q. collected experimental data from literature. X.L. and T.W. collected experimental data by experiments. All authors contributed towards writing the manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Peer review
Peer review information
Nature Computational Science thanks the anonymous, reviewer(s) for their contribution to the peer review of this work. Primary Handling Editor: Kaitlin McCardle, in collaboration with the Nature Computational Science team. Peer reviewer reports are available.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Supplementary Information
Supplementary Sections 1–7, Tables 1–18 and 20–21, and Figs. 1–39.
Supplementary Table 19
Experimental observations from the literature. Observed surfaces from the literature are compared with SurFF-predicted surface facets, energies and exposure values for selected intermetallics, highlighting qualitative matches.
Source data
Source Data Fig. 2
All data points to generate Fig. 2c; Source data for Fig. 2c–e.
Source Data Fig. 3
Source data for Fig. 3a–c,e,g,h; Source data to draw 3D crystals for Fig. 3d,f.
Source Data Fig. 4
Source data to draw 3D crystals for Fig. 4b.
Source Data Fig. 5
Source data for Fig. 5b.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Yin, J., Chen, H., Qiu, J. et al. SurFF: a foundation model for surface exposure and morphology across intermetallic crystals. Nat Comput Sci 5, 782–792 (2025). https://doi.org/10.1038/s43588-025-00839-0
Received:
Accepted:
Published:
Version of record:
Issue date:
DOI: https://doi.org/10.1038/s43588-025-00839-0


