Skip to main content

Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

Advertisement

npj Computational Materials
  • View all journals
  • Search
  • My Account Login
  • Content Explore content
  • About the journal
  • Publish with us
  • Sign up for alerts
  • RSS feed
  1. nature
  2. npj computational materials
  3. articles
  4. article
Constructing machine learning interatomic potentials with minimum amount of ab initio data
Download PDF
Download PDF
  • Article
  • Open access
  • Published: 17 March 2026

Constructing machine learning interatomic potentials with minimum amount of ab initio data

  • Wentao Zhang1,
  • Xingxing Wu2,
  • Chen Wang3,
  • Siyu Hu3,
  • Yueyang Liu4 &
  • …
  • Lin-Wang Wang1 

npj Computational Materials , Article number:  (2026) Cite this article

  • 3367 Accesses

  • Metrics details

We are providing an unedited version of this manuscript to give early access to its findings. Before final publication, the manuscript will undergo further editing. Please note there may be errors present which affect the content, and all legal disclaimers apply.

Subjects

  • Chemistry
  • Materials science
  • Mathematics and computing
  • Physics

Abstract

Machine learning interatomic potentials (MLIP) are powerful tools for using large-scale molecular dynamics (MD) to evaluate material properties, including the performance of solid-state electrolytes (SSEs). While there are many efforts for constructing universal big MLIP models, their accuracies and speeds of inference still need to be improved for many practical applications. Another approach is to develop a system-specific MLIP model relying on active learning strategy. Although much cheaper than training a big model, using the conventional procedure, it still requires large numbers of active learning loops and the corresponding DFT calculations to ensure convergency. Here, we propose a single-shot workflow that significantly accelerates small MLIP model development by leveraging the capabilities of the big model (using MACE as one example) and requiring only a few hundred additional DFT calculations. Our workflow comprises two stages, first the MACE model itself is fine-tuned to make it more accurate for the given system, second a smaller MLIP model (using NEP as one example) is distilled from the fine-tuned MACE model. We employed a MACE-driven sampling strategy, carried out additional DFT calculations without relying on active learning iterations. We show that fine-tuned MACE model can inherit the stability of the pretrained model, and fine-tuning the pretrained MACE model is much more DFT data efficient comparing to training a start-from-scratch NEP model. In the second stage, the fine-tuned MACE model provides the dataset to train the NEP model, allows the final NEP model to carry out large scale MD simulations with competitive accuracy. This integrated workflow establishes a systematic pathway for rapid MLIP development via small additional DFT dataset, with potential applications to many material systems.

Similar content being viewed by others

Learning together: Towards foundation models for machine learning interatomic potentials with meta-learning

Article Open access 17 July 2024

Accurate machine learning force fields via experimental and simulation data fusion

Article Open access 05 April 2024

Efficient and accurate spatial mixing of machine learned interatomic potentials for materials science

Article Open access 06 February 2026

Data availability

The computational data to support the findings of this study is available from the corresponding author on reasonable request.

Code availability

The code used for the MACE training is accessible from https://github.com/hyjwpk/ELoRA. The code used for NEP training is available from https://github.com/LonxunQuantum/MatPL.

References

  1. Janek, J. & Zeier, W. G. A solid future for battery development. Nat. Energy 1, 16141 (2016).

    Google Scholar 

  2. Chen, D. et al. Superionic ionic conductor discovery via multiscale topological learning. J. Am. Chem. Soc. 147, 20888–20898 (2025).

    Google Scholar 

  3. Zhang, W. et al. Rapid mining of fast ion conductors via subgraph isomorphism matching. J. Am. Chem. Soc. 146, 18535–18543 (2024).

    Google Scholar 

  4. He, X. et al. Crystal structural framework of lithium super-ionic conductors. Adv. Energy Mater. 9, 1902078 (2019).

    Google Scholar 

  5. Jun, K. et al. Lithium superionic conductors with corner-sharing frameworks. Nat. Mater. 21, 924–931 (2022).

    Google Scholar 

  6. Jun, K., Chen, Y., Wei, G., Yang, X. & Ceder, G. Diffusion mechanisms of fast lithium-ion conductors. Nat. Rev. Mater. 9, 887–905 (2024).

    Google Scholar 

  7. Zhang, W. et al. Revealing morphology evolution of lithium dendrites by large-scale simulation based on machine learning force field. Adv. Energy Mater. 13, 2202892 (2023).

    Google Scholar 

  8. Behler, J. & Parrinello, M. Generalized neural-network representation of high-dimensional potential-energy surfaces. Phys. Rev. Lett. 98, 146401 (2007).

    Google Scholar 

  9. Bartók, A. P., Payne, M. C., Kondor, R. & Csányi, G. Gaussian approximation potentials: the accuracy of quantum mechanics, without the electrons. Phys. Rev. Lett. 104, 136403 (2010).

    Google Scholar 

  10. Zhang, Y. et al. DP-GEN: a concurrent learning platform for the generation of reliable deep learning based potential energy models. Comput. Phys. Commun. 253, 107206 (2020).

    Google Scholar 

  11. Batatia, I., Kovacs, D. P., Simm, G., Ortner, C. & Csanyi, G. MACE: higher order equivariant message passing neural networks for fast and accurate force fields. Adv. Neural Inf. Process. Syst. 35, 11423–11436 (2022).

    Google Scholar 

  12. Batatia, I. et al. The design space of E(3)-equivariant atom-centred interatomic potentials. Nat. Mach. Intell. 7, 56–67 (2025).

    Google Scholar 

  13. Deng, B. et al. CHGNet as a pretrained universal neural network potential for charge-informed atomistic modelling. Nat. Mach. Intell. 5, 1031–1041 (2023).

    Google Scholar 

  14. Chen, C. & Ong, S. P. A universal graph deep learning interatomic potential for the periodic table. Nat. Comput. Sci. 2, 718–728 (2022).

    Google Scholar 

  15. Wang, R., Gao, Y., Wu, H. & Zhong, Z. Pre-training, fine-tuning, and distillation (PFD): automatically generating machine learning force fields from universal models. Phys. Rev. Mater. 9, 113802 (2025).

    Google Scholar 

  16. Radova, M., Stark, W. G., Allen, C. S., Maurer, R. J. & Bartók, A. P. Fine-tuning foundation models of materials interatomic potentials with frozen transfer learning. npj Comput. Mater. 11, 237 (2025).

    Google Scholar 

  17. Jain, A. et al. Commentary: The materials project: a materials genome approach to accelerating materials innovation. APL Mater. 1, 011002 (2013).

    Google Scholar 

  18. Batzner, S. et al. E(3)-equivariant graph neural networks for data-efficient and accurate interatomic potentials. Nat. Commun. 13, 2453 (2022).

  19. Fu, X. et al. Learning smooth and expressive interatomic potentials for physical property prediction. in Proceedings of the 42nd International Conference on Machine Learning 17875–17893 (PMLR, 2025).

  20. Rhodes, B. et al. Orb-v3: atomistic simulation at scale. Preprint at https://doi.org/10.48550/arXiv.2504.06231 (2025).

  21. Bochkarev, A., Lysogorskiy, Y. & Drautz, R. Graph atomic cluster expansion for semilocal interactions beyond equivariant message passing. Phys. Rev. X 14, 021036 (2024).

    Google Scholar 

  22. Batatia, I. et al. A foundation model for atomistic materials chemistry. J. Chem. Phys. 163, 184110 (2025).

    Google Scholar 

  23. Riebesell, J. et al. A framework to evaluate machine learning crystal stability predictions. Nat. Mach. Intell. 7, 836–847 (2025).

    Google Scholar 

  24. Kamaya, N. et al. A lithium superionic conductor. Nat. Mater. 10, 682–686 (2011).

    Google Scholar 

  25. Aono, H., Sugimoto, E., Sadaoka, Y., Imanaka, N. & Adachi, G. Ionic conductivity of the lithium titanium phosphate (Li1 + XMXTi2 − X(PO4)3, M = Al, Sc, Y, and La) systems. J. Electrochem. Soc. 136, 590 (1989).

    Google Scholar 

  26. Asano, T. et al. Solid halide electrolytes with high lithium-ion conductivity for application in 4 V class bulk-type all-solid-state batteries. Adv. Mater. 30, 1803075 (2018).

    Google Scholar 

  27. Qi, J., Ko, T. W., Wood, B. C., Pham, T. A. & Ong, S. P. Robust training of machine learning interatomic potentials with dimensionality reduction and stratified sampling. npj Comput. Mater. 10, 43 (2024).

  28. Henkelman, G., Uberuaga, B. P. & Jónsson, H. A climbing image nudged elastic band method for finding saddle points and minimum energy paths. J. Chem. Phys. 113, 9901–9904 (2000).

    Google Scholar 

  29. Deng, B. et al. Systematic softening in universal machine learning interatomic potentials. npj Comput. Mater. 11, 1–9 (2025).

    Google Scholar 

  30. Frey, N. C. et al. Neural scaling of deep chemical models. Nat. Mach. Intell. 5, 1297–1305 (2023).

    Google Scholar 

  31. Dunn, A., Wang, Q., Ganose, A., Dopp, D. & Jain, A. Benchmarking materials property prediction methods: the matbench test set and automatminer reference algorithm. npj Comput. Mater. 6, 138 (2020).

    Google Scholar 

  32. Fan, Z. et al. GPUMD: a package for constructing accurate machine-learned potentials and performing highly efficient atomistic simulations. J. Chem. Phys. 157, 114801(2022).

  33. Wang, C., Hu, S., Tan, G. & Jia, W. ELoRA: low-rank adaptation for equivariant GNNs. in Forty-Second International Conference on Machine Learning (ICML, 2025).

  34. Fu, X. et al. Forces are not enough: benchmark and critical evaluation for machine learning force fields with molecular simulations. in Transactions on Machine Learning Research (TMLR, 2022).

  35. He, X., Zhu, Y. & Mo, Y. Origin of fast ion diffusion in super-ionic conductors. Nat. Commun. 8, 15893 (2017).

    Google Scholar 

  36. Qi, J. et al. Bridging the gap between simulated and experimental ionic conductivities in lithium superionic conductors. Mater. Today Phys. 21, 100463 (2021).

    Google Scholar 

  37. Zhang, Y., Luo, J.-D., Yao, H.-B. & Jiang, B. Size dependent lithium-ion conductivity of solid electrolytes in machine learning molecular dynamics simulations. Artif. Intell. Chem. 2, 100051 (2024).

    Google Scholar 

  38. Wang, S., Liu, Y. & Mo, Y. Frustration in super-ionic conductors unraveled by the density of atomistic states. Angew. Chem. Int. Ed. 62, e202215544 (2023).

    Google Scholar 

  39. Yang, G. et al. Li-rich channels as the material gene for facile lithium diffusion in halide solid electrolytes. eScience 2, 79–86 (2022).

    Google Scholar 

  40. Hjorth Larsen, A. et al. The atomic simulation environment—a Python library for working with atoms. J. Phys. Condens. Matter 29, 273002 (2017).

    Google Scholar 

  41. Thompson, A. P. et al. LAMMPS - a flexible simulation tool for particle-based materials modeling at the atomic, meso, and continuum scales. Comput. Phys. Commun. 271, 108171 (2022).

    Google Scholar 

  42. Fan, Z., Siro, T. & Harju, A. Accelerated molecular dynamics force evaluation on graphics processing units for thermal conductivity calculations. Comput. Phys. Commun. 184, 1414–1425 (2013).

    Google Scholar 

  43. Ko, T. W., Finkler, J. A., Goedecker, S. & Behler, J. A fourth-generation high-dimensional neural network potential with accurate electrostatics including non-local charge transfer. Nat. Commun. 12, 398 (2021).

    Google Scholar 

  44. Kresse, G. & Furthmüller, J. Efficient iterative schemes for ab initio total-energy calculations using a plane-wave basis set. Phys. Rev. B 54, 11169–11186 (1996).

    Google Scholar 

  45. Kresse, G. & Joubert, D. From ultrasoft pseudopotentials to the projector augmented-wave method. Phys. Rev. B 59, 1758–1775 (1999).

    Google Scholar 

  46. Perdew, J. P., Ernzerhof, M. & Burke, K. Rationale for mixing exact exchange with density functional approximations. J. Chem. Phys. 105, 9982–9985 (1996).

    Google Scholar 

  47. Zhang, B., Lin, Z., Dong, H., Wang, L.-W. & Pan, F. Revealing cooperative Li-ion migration in Li1+xAlxTi2−x(PO4)3 solid state electrolytes with high Al doping. J. Mater. Chem. A 8, 342–348 (2020).

    Google Scholar 

  48. Mo, Y., Ong, S. P. & Ceder, G. First principles study of the Li10GeP2S12 lithium super ionic conductor material. Chem. Mater. 24, 15–17 (2012).

    Google Scholar 

  49. Fu, Z.-H. et al. The chemical origin of temperature-dependent lithium-ion concerted diffusion in sulfide solid electrolyte Li10GeP2S12. J. Energy Chem. 70, 59–66 (2022).

    Google Scholar 

  50. Huang, J. et al. Deep potential generation scheme and simulation protocol for the Li10GeP2S12-type superionic conductors. J. Chem. Phys. 154, 094703 (2021).

    Google Scholar 

  51. Klimeš, J., Bowler, D. R. & Michaelides, A. Chemical accuracy for the van der Waals density functional. J. Phys. Condens. Matter 22, 22201 (2009).

    Google Scholar 

  52. Hoover, W. G. Canonical dynamics: equilibrium phase-space distributions. Phys. Rev. A 31, 1695–1697 (1985).

    Google Scholar 

  53. Nosé, S. A unified formulation of the constant temperature molecular dynamics methods. J. Chem. Phys. 81, 511–519 (1984).

    Google Scholar 

Download references

Acknowledgements

This research was funded by the National Key Research and Development Program of China (Grant No. 2024YFA1408200).

Author information

Authors and Affiliations

  1. State Key Laboratory of Optoelectronic Materials and Devices, Institute of Semiconductors, Chinese Academy of Sciences, Beijing, China

    Wentao Zhang & Lin-Wang Wang

  2. Technology Department, Beijing Lonxun Quantum Co. Ltd, Beijing, China

    Xingxing Wu

  3. Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China

    Chen Wang & Siyu Hu

  4. State Key Laboratory of Semiconductor Physics and Chip Technologies, Institute of Semiconductors, Chinese Academy of Sciences, Beijing, China

    Yueyang Liu

Authors
  1. Wentao Zhang
    View author publications

    Search author on:PubMed Google Scholar

  2. Xingxing Wu
    View author publications

    Search author on:PubMed Google Scholar

  3. Chen Wang
    View author publications

    Search author on:PubMed Google Scholar

  4. Siyu Hu
    View author publications

    Search author on:PubMed Google Scholar

  5. Yueyang Liu
    View author publications

    Search author on:PubMed Google Scholar

  6. Lin-Wang Wang
    View author publications

    Search author on:PubMed Google Scholar

Contributions

W.Z.: Conceptualization, model development, data analysis, writing-original draft. X.W. and C.W.: Model development. S.H. and Y. L.: Data analysis. L.W.: Writing and editing, data analysis, and funding acquisition.

Corresponding author

Correspondence to Lin-Wang Wang.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary information (download PDF )

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Zhang, W., Wu, X., Wang, C. et al. Constructing machine learning interatomic potentials with minimum amount of ab initio data. npj Comput Mater (2026). https://doi.org/10.1038/s41524-026-02023-y

Download citation

  • Received: 24 September 2025

  • Accepted: 19 February 2026

  • Published: 17 March 2026

  • DOI: https://doi.org/10.1038/s41524-026-02023-y

Share this article

Anyone you share the following link with will be able to read this content:

Sorry, a shareable link is not currently available for this article.

Provided by the Springer Nature SharedIt content-sharing initiative

Download PDF

Associated content

Collection

Machine Learning Interatomic Potentials in Computational Materials

Advertisement

Explore content

  • Research articles
  • Reviews & Analysis
  • News & Comment
  • Collections
  • Follow us on X
  • Sign up for alerts
  • RSS feed

About the journal

  • Aims & Scope
  • Content types
  • Journal Information
  • Open Access
  • About the Editors
  • Contact
  • Editorial policies
  • Journal Metrics
  • About the partner

Publish with us

  • For Authors and Referees
  • Language editing services
  • Open access funding
  • Submit manuscript

Search

Advanced search

Quick links

  • Explore articles by subject
  • Find a job
  • Guide to authors
  • Editorial policies

npj Computational Materials (npj Comput Mater)

ISSN 2057-3960 (online)

nature.com footer links

About Nature Portfolio

  • About us
  • Press releases
  • Press office
  • Contact us

Discover content

  • Journals A-Z
  • Articles by subject
  • protocols.io
  • Nature Index

Publishing policies

  • Nature portfolio policies
  • Open access

Author & Researcher services

  • Reprints & permissions
  • Research data
  • Language editing
  • Scientific editing
  • Nature Masterclasses
  • Research Solutions

Libraries & institutions

  • Librarian service & tools
  • Librarian portal
  • Open research
  • Recommend to library

Advertising & partnerships

  • Advertising
  • Partnerships & Services
  • Media kits
  • Branded content

Professional development

  • Nature Awards
  • Nature Careers
  • Nature Conferences

Regional websites

  • Nature Africa
  • Nature China
  • Nature India
  • Nature Japan
  • Nature Middle East
  • Privacy Policy
  • Use of cookies
  • Legal notice
  • Accessibility statement
  • Terms & Conditions
  • Your US state privacy rights
Springer Nature

© 2026 Springer Nature Limited

Nature Briefing AI and Robotics

Sign up for the Nature Briefing: AI and Robotics newsletter — what matters in AI and robotics research, free to your inbox weekly.

Get the most important science stories of the day, free in your inbox. Sign up for Nature Briefing: AI and Robotics