Skip to main content

Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

  • Article
  • Published:

SurFF: a foundation model for surface exposure and morphology across intermetallic crystals

A preprint version of the article is available at Research Square.

Abstract

With approximately 90% of industrial reactions occurring on surfaces, the role of heterogeneous catalysts is paramount. Currently, accurate surface exposure prediction is vital for heterogeneous catalyst design, but it is hindered by the high costs of experimental and computational methods. Here we introduce a foundation force-field-based model for predicting surface exposure and synthesizability (SurFF) across intermetallic crystals, which are essential materials for heterogeneous catalysts. We created a comprehensive intermetallic surface database using an active learning method and high-throughput density functional theory calculations, encompassing 12,553 unique surfaces and 344,200 single points. SurFF achieves density-functional-theory-level precision with a prediction error of 3 meV Å−2 and enables large-scale surface exposure prediction with a 105-fold acceleration. Validation against computational and experimental data both show strong alignment. We applied SurFF for large-scale predictions of surface energy and Wulff shapes for over 6,000 intermetallic crystals, providing valuable data for the community.

This is a preview of subscription content, access via your institution

Access options

Buy this article

USD 39.95

Prices may be subject to local taxes which are calculated during checkout

Fig. 1: The overall framework of SurFF for predicting surface exposure.
Fig. 2: Dataset development via an active learning strategy.
Fig. 3: MLFF training and testing results.
Fig. 4: Evaluation of existing intermetallic crystals of known structures by the pre-training model and comparisons with literature experimental results.
Fig. 5: Model fine-tuning and transferring.

Similar content being viewed by others

Data availability

All of the datasets used in this work are also available via figshare47. The minimum data that are necessary to interpret, verify and extend the research in the article are provided via SurFF_CoreDataFiles.zip, which contains the core data files for the SurFF project, allowing the user to run the SurFF prediction for intermetalic crystals. All of this work’s data are provided via SurFF_DataFiles.zip, which contains all of the generated DFT datasets. Please refer to the GitHub link below for more detailed instructions on how to reproduce the main results and apply SurFF to prediction. Source data are provided with this paper.

Code availability

All of the codes in this work could be retrieved from GitHub repository via https://github.com/Long1Corn/SurFF or Zenodo (ref. 48). A step-by-step guide to use SurFF for prediction and a web user interface are also provided.

References

  1. Nørskov, J. K. & Bligaard, T. The catalyst genome. Angew. Chem. Int. Ed. 52, 776–777 (2013).

    Article  Google Scholar 

  2. Nørskov, J. K., Studt, F., Abild-Pedersen, F. & Bligaard, T. Fundamental concepts in heterogeneous catalysis (Wiley, 2015).

  3. Jenewein, K. J., Akkoc, G. D., Kormányos, A. & Cherevko, S. Automated high-throughput activity and stability screening of electrocatalysts. Chem. Catal. 2, 2778–2794 (2022).

    Google Scholar 

  4. Back, S. et al. Convolutional neural network of atomic surface structures to predict binding energies for high-throughput screening of catalysts. J. Phys. Chem. Lett. 10, 4401–4408 (2019).

    Article  Google Scholar 

  5. Fung, V., Hu, G., Ganesh, P. & Sumpter, B. G. Machine learned features from density of states for accurate adsorption energy prediction. Nat. Commun. https://doi.org/10.1038/s41467-020-20342-6 (2021).

  6. Lee, J. & Jinnouchi, R. Machine learning-based screening of highly stable and active ternary Pt alloys for oxygen reduction reaction. J. Phys. Chem. C 125, 16963–16974 (2021).

    Article  Google Scholar 

  7. Zhong, M. et al. Accelerated discovery of CO2 electrocatalysts using active machine learning. Nature 581, 178–183 (2020).

    Article  Google Scholar 

  8. Tran, K. & Ulissi, Z. W. Active learning across intermetallics to guide discovery of electrocatalysts for CO2 reduction and H2 evolution. Nat. Catal. 1, 696–703 (2018).

    Article  Google Scholar 

  9. Ge, X. et al. Atomic design of alkyne semihydrogenation catalysts via active learning. J. Am. Chem. Soc. 146, 4993–5004 (2024).

    Article  Google Scholar 

  10. Jones, G., Bligaard, T., Abild-Pedersen, F. & Nørskov, J. K. Using scaling relations to understand trends in the catalytic activity of transition metals. J. Phys. Condens. Matter 20, 064239 (2008).

    Article  Google Scholar 

  11. Getsoian, A. B., Zhai, Z. & Bell, A. T. Band-gap energy as a descriptor of catalytic activity for propene oxidation over mixed metal oxide catalysts. J. Am. Chem. Soc. 136, 13684–13697 (2014).

    Article  Google Scholar 

  12. Tran, R. et al. Surface energies of elemental crystals. Sci. Data 3, 160080 (2016).

    Article  Google Scholar 

  13. Cheula, R., Maestri, M. & Mpourmpakis, G. Modeling morphology and catalytic activity of nanoparticle ensembles under reaction conditions. ACS Catal. 10, 6149–6158 (2020).

    Article  Google Scholar 

  14. Gilman, J. J. Direct measurements of the surface energies of crystals. J. Appl. Phys. 31, 2208–2218 (1960).

    Article  Google Scholar 

  15. Eaglesham, D., White, A., Feldman, L., Moriya, N. & Jacobson, D. Equilibrium shape of Si. Phys. Rev. Lett. 70, 1643 (1993).

    Article  Google Scholar 

  16. Ringe, E., Van Duyne, R. P. & Marks, L. Wulff construction for alloy nanoparticles. Nano Lett. 11, 3399–3403 (2011).

    Article  Google Scholar 

  17. Zhang, K. et al. Spin-mediated promotion of Co catalysts for ammonia synthesis. Science 383, 1357–1363 (2024).

    Article  Google Scholar 

  18. Li, Y. et al. Electrolyte-assisted polarization leading to enhanced charge separation and solar-to-hydrogen conversion efficiency of seawater splitting. Nat. Catal. 7, 77–88 (2024).

    Article  Google Scholar 

  19. Henry, C. R. Morphology of supported nanoparticles. Prog. Surf. Sci. 80, 92–116 (2005).

    Article  Google Scholar 

  20. Palizhati, A., Zhong, W., Tran, K., Back, S. & Ulissi, Z. W. Toward predicting intermetallics surface properties with high-throughput DFT and convolutional neural networks. J. Chem. Inf. Model. 59, 4742–4749 (2019).

    Article  Google Scholar 

  21. Unke, O. T. et al. Machine learning force fields. Chem. Rev. 121, 10142–10186 (2021).

    Article  Google Scholar 

  22. Chanussot, L. et al. Open Catalyst 2020 (OC20) dataset and community challenges. ACS Catal. 11, 6059–6072 (2021).

    Article  Google Scholar 

  23. Tran, R. et al. The Open Catalyst 2022 (OC22) dataset and challenges for oxide electrocatalysts. ACS Catal. 13, 3066–3084 (2023).

    Article  Google Scholar 

  24. Ben Mahmoud, C., Gardner, J. L. A. & Deringer, V. L. Data as the next challenge in atomistic machine learning. Nat. Comput. Sci. 4, 384–387 (2024).

    Article  Google Scholar 

  25. Peverati, R. & Truhlar, D. G. Quest for a universal density functional: the accuracy of density functionals across a broad spectrum of databases in chemistry and physics. Phil. Trans. R. Soc. A 372, 20120476 (2014).

    Article  MathSciNet  MATH  Google Scholar 

  26. Jain, A. et al. Commentary: The Materials Project: a materials genome approach to accelerating materials innovation. APL Mater. https://doi.org/10.1063/1.4812323 (2013).

  27. Lookman, T., Balachandran, P. V., Xue, D. & Yuan, R. Active learning in materials science with emphasis on adaptive sampling using uncertainties for targeted design. NPJ Comput. Mater. 5, 21 (2019).

    Article  Google Scholar 

  28. Ren, P. et al. A survey of deep active learning. ACM Comput. Surveys 54, 1–40 (2021).

    Google Scholar 

  29. Li, J. et al. Deep learning accelerated gold nanocluster synthesis. Adv. Intel. Syst. 1, 1900029 (2019).

    Article  Google Scholar 

  30. Liao, Y.-L., Wood, B. M., Das, A. & Smidt, T. EquiformerV2: improved equivariant transformer for scaling to higher-degree representations. In Proc. 12th International Conference on Learning Representations (ICLR, 2024).

  31. Zitnick, L. et al. Spherical channels for modeling atomic interactions. In Proc. 36th Conference on Neural Information Processing Systems (NeurIPS, 2022).

  32. DeepSeek, A. I. et al. DeepSeek-V2: a strong, economical, and efficient mixture-of-experts language model. Preprint at https://arxiv.org/abs/2405.04434 (2024).

  33. Lan, X., Wang, Y., Liu, B., Kang, Z. & Wang, T. Thermally induced intermetallic Rh1Zn1 nanoparticles with high phase-purity for highly selective hydrogenation of acetylene. Chem. Sci. 15, 1758–1768 (2024).

    Article  Google Scholar 

  34. Wang, J. et al. Redirecting dynamic surface restructuring of a layered transition metal oxide catalyst for superior water oxidation. Nat. Catal. 4, 212–222 (2021).

    Article  Google Scholar 

  35. Feng, K. et al. Dual functionalized interstitial N atoms in Co3Mo3N enabling CO2 activation. ACS Catal. 12, 4696–4706 (2022).

    Article  Google Scholar 

  36. Pajares, A. et al. Critical effect of carbon vacancies on the reverse water gas shift reaction over vanadium carbide catalysts. Appl. Catal. B 267, 118719 (2020).

    Article  Google Scholar 

  37. Sanspeur, R. Y., Heras-Domingo, J., Kitchin, J. R. & Ulissi, Z. WhereWulff: a semiautonomous workflow for systematic catalyst surface reactivity under reaction conditions. J. Chem. Inf. Model. 63, 2427–2437 (2023).

    Article  Google Scholar 

  38. Sun, W. & Ceder, G. Efficient creation and convergence of surface slabs. Surf. Sci. 617, 53–59 (2013).

    Article  Google Scholar 

  39. Ong, S. P. et al. Python materials genomics (PyMatGen): a robust, open-source Python library for materials analysis. Comput. Mater. Sci. 68, 314–319 (2013).

    Article  Google Scholar 

  40. Konyushkova, K., Sznitman, R. & Fua, P. Learning active learning from data. In Proc. 31st Conference on Neural Information Processing Systems (NeurIPS, 2017)

  41. Lakshminarayanan, B., Pritzel, A. & Blundell, C. Simple and scalable predictive uncertainty estimation using deep ensembles. In Proc. 31st Conference on Neural Information Processing Systems (NeurIPS, 2017).

  42. Yang, Y., Ma, Z., Nie, F., Chang, X. & Hauptmann, A. G. Multi-class active learning by uncertainty sampling with diversity maximization. Int. J. Comput. Vision 113, 113–127 (2015).

    Article  MathSciNet  Google Scholar 

  43. Passaro, S. & Zitnick, C. L. Reducing SO(3) convolutions to SO(2) for efficient equivariant GNNs. In Proc. 40th International Conference on Machine Learning (PMLR, 2023).

  44. Geiger, M. & Smidt, T. e3nn: Euclidean neural networks. Preprint at https://arxiv.org/abs/2207.09453 (2022).

  45. Klicpera, J., Becker, F. & Günnemann, S. GemNet: universal directional graph neural networks for molecules. In Proc. 35th Conference on Neural Information Processing Systems (NeurIPS, 2021).

  46. Ying, C. et al. Do transformers really perform badly for graph representation? In Proc. 35th Conference on Neural Information Processing Systems (NeurIPS, 2021).

  47. Yin, J. et al. SurFF_coredatafiles. figshare https://doi.org/10.6084/m9.figshare.26395756 (2025).

  48. Yin, J. et al. SurFF: 1.0. Zenodo https://doi.org/10.5281/zenodo.15480651 (2025).

  49. Ji, Y. et al. Selective CO-to-acetate electroreduction via intermediate adsorption tuning on ordered Cu–Pd sites. Nat. Catal. 5, 251–258 (2022).

    Article  Google Scholar 

Download references

Acknowledgements

This work is supported by the National Key R&D Program of China (grant no. 2022ZD0117501), the Scientific Research Innovation Capability Support Project for Young Faculty (grant no. ZYGXQNJSKYCXNLZCXM-E7), the Tsinghua University Initiative Scientific Research Program and the Carbon Neutrality and Energy System Transformation (CNEST) Program led by Tsinghua University. The primary affiliation of J.Y. is the Department of Chemical and Biomolecular Engineering, National University of Singapore.

Author information

Authors and Affiliations

Authors

Contributions

X.W. supervised the project. J.Y., X.W. and I.A.K. conceptualized this project. J.Y., H.C. and P.H. performed DFT calculations. J.Y., W.L. and J.L. trained the machine learning models. J.Q. collected experimental data from literature. X.L. and T.W. collected experimental data by experiments. All authors contributed towards writing the manuscript.

Corresponding author

Correspondence to Xiaonan Wang  (王笑楠).

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature Computational Science thanks the anonymous, reviewer(s) for their contribution to the peer review of this work. Primary Handling Editor: Kaitlin McCardle, in collaboration with the Nature Computational Science team. Peer reviewer reports are available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Supplementary Sections 1–7, Tables 1–18 and 20–21, and Figs. 1–39.

Peer Review File

Supplementary Table 19

Experimental observations from the literature. Observed surfaces from the literature are compared with SurFF-predicted surface facets, energies and exposure values for selected intermetallics, highlighting qualitative matches.

Source data

Source Data Fig. 2

All data points to generate Fig. 2c; Source data for Fig. 2c–e.

Source Data Fig. 3

Source data for Fig. 3a–c,e,g,h; Source data to draw 3D crystals for Fig. 3d,f.

Source Data Fig. 4

Source data to draw 3D crystals for Fig. 4b.

Source Data Fig. 5

Source data for Fig. 5b.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Yin, J., Chen, H., Qiu, J. et al. SurFF: a foundation model for surface exposure and morphology across intermetallic crystals. Nat Comput Sci 5, 782–792 (2025). https://doi.org/10.1038/s43588-025-00839-0

Download citation

  • Received:

  • Accepted:

  • Published:

  • Version of record:

  • Issue date:

  • DOI: https://doi.org/10.1038/s43588-025-00839-0

Search

Quick links

Nature Briefing

Sign up for the Nature Briefing newsletter — what matters in science, free to your inbox daily.

Get the most important science stories of the day, free in your inbox. Sign up for Nature Briefing