A foundation machine learning potential with polarizable long-range interactions for materials modelling

Gao, Rongzhi; Yam, ChiYung; Mao, Jianjun; Chen, Shuguang; Chen, GuanHua; Hu, Ziyang

doi:10.1038/s41467-025-65496-3

Download PDF

Article
Open access
Published: 25 November 2025

A foundation machine learning potential with polarizable long-range interactions for materials modelling

Nature Communications volume 16, Article number: 10484 (2025) Cite this article

7750 Accesses
5 Citations
5 Altmetric
Metrics details

Subjects

Abstract

Long-range interactions are essential determinants of chemical system behavior across diverse environments. We present a foundation framework that integrates explicit polarizable long-range physics with an equivariant graph neural network potential. It employs a physically motivated polarizable charge equilibration scheme that directly optimizes electrostatic interaction energies rather than partial charges. The foundation model, trained across the periodic table up to Pu, demonstrates strong performance across key materials modeling challenges. It effectively captures long-range interactions that are challenging for traditional message-passing mechanisms and accurately reproduces polarization effects under external electric fields. We have applied the model to mechanical properties, ionic diffusivity in solid-state electrolytes, ferroelectric phase transitions, and reactive dynamics at electrode-electrolyte interfaces, highlighting the model’s capacity to balance accuracy and computational efficiency. Furthermore, we show that as a foundation model, it can be efficiently finetuned to achieve high-level accuracy for specific challenging systems.

Incorporating long-range electrostatics in neural network potentials via variational charge equilibration from shortsighted ingredients

Article Open access 04 March 2024

Machine learning potentials for metal-organic frameworks using an incremental learning approach

Article Open access 06 February 2023

Pushing charge equilibration-based machine learning potentials to their limits

Article Open access 16 September 2025

Introduction

Molecular dynamics (MD) simulations are indispensable in describing phenomena at the atomic level in disciplines such as chemistry, materials science, and biology^1,2,3,4. While ab initio MD (AIMD)⁵ simulation approaches offer unparalleled accuracy, their computational demands constrain their application to small timescales and length scales. On the other hand, classical MD (CMD)^6,7 simulations offer computational efficiency, but their accuracy is contingent upon empirical parameters. Machine learning interatomic potentials (MLIPs)^8,9,10,11 provide a solution that balances accuracy and computational efficiency. In particular, MLIPs based on equivariant graph neural networks, such as NequIP¹², DimeNet¹³, and MACE¹⁴, achieve excellent performance by introducing equivariant and invariant symmetries. Moreover, universal foundation models trained on the periodic table, such as M3GNet¹⁵, CHGNet¹⁶, GNoME¹⁷, MACE-MP-0¹⁸, MatterSim¹⁹, SevenNet²⁰, and ORB²¹ have emerged, showing remarkable prospects for materials discovery.

While existing local MLIPs with a cut-off of around 5 Å perform well in simulating interactions within localized chemical environments, they may fail to capture long-range phenomena. This hinders their ability to understand and elucidate the behaviors of complex materials²². One approach is to implicitly incorporate the effects of long-range interactions through message-passing mechanisms²³, which can extend the cut-off through layer-wise propagation. However, if certain parts of the system are disconnected on the graph, such as two molecules separated by a distance beyond the cut-off value, the message-passing scheme becomes ineffective²⁴. A promising direction involves investigating the potential advantages of explicitly including long-range interactions in the model formulation. These behaviors encompass electrostatic interactions, involving forces between charged particles, and dispersion terms, which arise from dynamic fluctuations in electron distribution within molecules or atoms.

Various explicit models have been proposed to address the challenges associated with long-range interactions by including electrostatic interactions. In these methods, charges are usually obtained through charge equilibration (QEq)²⁵ principles, including the Charge Equilibration via Neural Network Technique (CENT) method²⁶, Fourth-generation High-Dimensional Neural Network Potential (4G-HDNNP)^27,28, and Charge Equilibration Layer for Long-range Interactions (CELLI)²⁹ method. Generally, electrostatic parameters were trained to minimize the difference between QEq charges and reference partial charges derived from quantum mechanics (QM) calculations. Notably, partial charges obtained from QM calculations are either derived from the direct partitioning of the molecular wave function into atomic contributions or calculated based on the analysis of physical observables. However, due to the incompleteness of basis sets and variances in the partitioning methods, properties calculated based on partial charges may not always be reliable^16,30. The ambiguity in DFT-assigned charges suggests that directly learning of them may be inessential for constructing accurate interatomic potentials^26,31,32. On the contrary, the polarizable charge equilibration (PQEq)^{30,33,34,35,36} method proposed by Naserifar et al. enhances the QEq method by using QM electrostatic interaction energies instead of partial charges as targets for evaluating interatomic potentials. Additionally, PQEq has proven to be effective in capturing polarization effects by explicitly introducing core-shell displacement. Another way to solve the partition-dependent issue is to take the dipole moment into the loss function, as done by SpookyNet³⁷. However, it still uses point-like charges, making it behaves like QEq-based methods when dealing with polarization effects.

In this work, we introduce a framework that integrates the equivariant message-passing neural network potentials with the explicit polarizable long-range potential. We initially conducted benchmark tests on datasets containing systems with various charge states proposed by references^28,38, showcasing the efficacy of our framework in handling both long-range interactions and varying charge environments. Building on these results, we extended the framework to develop a foundation model encompassing the periodic table up to Pu, while maintaining computational efficiency. We have applied this foundation model across different areas of applications, including the predictions of mechanical properties of materials, phase transitions in ferroelectric materials, and reactive molecular dynamics of solid-state batteries interface, thus leading to significant advancements in the fields of materials science and chemical simulations. The model also demonstrates transferability, accurately predicting interactions between clusters in various charge states and molecular polarizability. Furthermore, we show that the foundation model can be efficiently finetuned to achieve ab initio accuracy for specific challenging systems.

Results

Theoretical framework

Our framework, which integrates neural network potential and polarizable long-range interactions, is depicted in Fig. 1. For a given chemical system and boundary conditions, our goal is to construct a mapping from atomic coordinates ${{{{\bf{r}}}}}_{i}$ and atomic types ${Z}_{i}$ to the total potential energy ${E}_{{{{\rm{pot}}}}}$. The potential energy is expressed as the sum of the second-order expansion³⁰ with respect to charge fluctuations and the density functional theory³⁹ D3 (DFT-D3) van der Waals dispersion energies correction term^40,41 ${E}_{{{{\rm{D}}}}3}$,

$${E}_{{{{\rm{pot}}}}} ={\sum }_{i}\left(\,{E}_{0}^{i}\left({{{{\bf{r}}}}}_{i},\,{Z}_{i}\right)+{\chi }_{i}^{0}{q}_{i}+\frac{1}{2}{\eta }_{i}^{0}{q}_{i}^{2}+\frac{1}{2}{K}_{s}^{i}{\left|{{{{\bf{r}}}}}_{{ic}}-{{{{\bf{r}}}}}_{{is}}\right|}^{2}\right)\\ \;\;\;+{\sum }_{{ik} > {jl}}{C}_{{ik},{jl}}\left({{{{\bf{r}}}}}_{{ik},{jl}}\right){q}_{{ik}}{q}_{{jl}}+{E}_{{{{\rm{D}}}}3}\\ ={\sum }_{i}{E}_{0}^{i}+{E}_{{{{\rm{PQEq}}}}}+{E}_{{{{\rm{D}}}}3}.$$

(1)

The zeroth-order atomic energy ${E}_{0}^{i}$ corresponds to the final layer scalar output of the equivariant graph neural networks, while the higher-order self- and interatomic Coulomb interactions represent the polarizable long-range electrostatic interactions ${E}_{{{{\rm{PQEq}}}}}$. To account for charge transfer and polarization effects, the partial atomic charge ${q}_{i}$ of atom i is the sum of the nuclear core charge ${q}_{{ic}}$ and shell charge ${q}_{{is}}$, both of which assume a Gaussian charge distribution form. The first-order coefficients ${\chi }^{0}$ are electronegativities, commonly defined as half of the sum of ionization potential (IP) and electron affinities (EA). The second-order coefficients ${\eta }^{0}$ signifies idempotential or chemical hardness, defined as ${{{\rm{IP}}}}-{{{\rm{EA}}}}$. The spring constant ${K}_{s}^{i}$ denotes the isotropic harmonic connectivity between the shell position ${{{{\bf{r}}}}}_{{is}}$ and core position ${{{{\bf{r}}}}}_{{ic}}$ (equal to ${{{{\bf{r}}}}}_{i}$) of atom i. The Gaussian electrostatic energy is given by $C\left({{{{\bf{r}}}}}_{{ik},{jl}}\right){q}_{{ik}}{q}_{{jl}}$, where i and j are the atomic indices, and k and l represent the core (c) or shell (s), respectively. The displacement vector is given by ${{{{\bf{r}}}}}_{{ik},{jl}}={{{{\bf{r}}}}}_{{ik}}-{{{{\bf{r}}}}}_{{jl}}$. The detailed derivation of the response to the external electric field can be found in the Supplementary Note 1. In previous works, PQEq parameters for 102 elements have been derived from experimental data or high-level QM calculations^25,42, and have accurately reproduced QM electrostatic interaction energies³⁰. In the equivariant neural networks¹², the initial features are generated using a trainable one-hot embedding that operates on the atomic types. The interatomic distance of atom i and atom j, denoted as $|{{{{\bf{R}}}}}_{{ij}}|$, is expanded by radial basis functions. Concurrently, the directional component of the interatomic vectors, expressed as ${{{{\bf{R}}}}}_{{ij}}/|{{{{\bf{R}}}}}_{{ij}}|$, is expanded by spherical harmonics functions. These features are utilized to construct the atomic graph. Then, through the equivariant message passing²³ schemes, the atomic features are updated. A multilayer perceptron layer is then connected to derive the scalar output. It should be noted that the neural network potential component is specifically designed to capture energy contributions distinct from electrostatic interactions, thereby ensuring no overlap in energy accounting between different components. Subject to the conservation of the net charge and the equality of chemical potentials for all atoms, the linear equations can be used to update partial charges (${q}_{i}$) and shell positions (${{{{\bf{r}}}}}_{{is}}$). The partial charges can be obtained self-consistently in each MD step. The forces on atoms and virial stress on the cell can be calculated via automatic differentiation after partial charges are determined, as detailed in Supplementary Note 2.

Validation on diverse charge-state datasets

To evaluate the capability of our framework in capturing different charge states and long-range interactions, we validated our framework against the dataset developed by Ko et al.²⁸ and Maruf et al.³⁸, which encompass various charge states and charge transfer systems. They contain six distinct subsets: Ag cluster with positive and negative total charge, (Ag₃^+/−), Na-Cl ionic cluster with one neutral Na removed (Na_8/9Cl₈⁺), hydrogenated carbon chains in both neutral and cationic states (C₁₀H₂/C₁₀H₃⁺), a periodic system consisting of Au clusters adsorbed on an MgO-(001) surface, and benzotriazole interactions with Cu-(111) surfaces in dry (BTA-Cu) and aqueous (BTA (H₂O)-Cu) environments. We demonstrate the advantage of integrating equivariant message-passing networks with explicitly polarizable long-range interaction models by benchmarking against both 4G-HDNNP, a local MLIP with explicit long-range interactions, and NequIP, which implicitly captures long-range effects through its message-passing mechanism. 4G-HDNNP differentiates charged systems by explicitly training DFT partial charges and incorporating them as descriptors into neural networks for short-range interactions. We employed the NequIP models trained in the reference³⁸ (Maruf’s NequIP) as our baseline equivariant model. Our framework explicitly incorporates long-range interactions while adopts Maruf’s NequIP as the E₀ component without touching its settings, thereby ensuring a fair comparative analysis. The results are presented in Table 1, with detailed neural network architectural specifications provided in the Supplementary Table 3.

Table 1 A comparison of root-mean-square test error metrics across different charge-state datasets^28,38 was performed among 4G-HDNNP²⁸, Marfu’s NequIP³⁸, and our work

Full size table

For the carbon chains, the introduction of additional H⁺ causes some C atoms to exhibit opposite charge states⁴³, as shown in Supplementary Fig. 1a. Although we did not fit partial charges, our model qualitatively captures the distribution of DFT partial charges, as shown also in Supplementary Fig. 1a. In the linear C₁₀H₂ molecule, our model demonstrates symmetric charge distribution around the center which shows agreement with DFT. After explicitly incorporating physical long-range interactions, our framework achieved additional improvements in error metrics compared to NequIP and 4G-HDNNP. For positively charged NaCl clusters, the predicted energy and forces by our model are still more accurate compared to 4G-HDNNP, as well as the potential energy surface shown in Supplementary Fig. 1b. For Au₂ adsorption on MgO surfaces and benzotriazole interactions with Cu-(111) surfaces in dry and aqueous environments, our model outperforms NequIP significantly. Additionally, our model correctly predicted the preferential adsorption geometries for both doped and undoped MgO, as listed in Supplementary Table 1. As for the charged Ag clusters, due to their small system size and lacking long-range effects, the entire structure falls within the typical cutoff radius used by the MLIP. The energy dependence on the overall charge state of clusters leads to degeneracy between atomic structures and potential energy surfaces, resulting in poor performance of the charge-independent NequIP model. We acknowledge that for such small, highly charged systems, the 4G-HDNNP approach to learning charge equilibration parameters is more convenient. Nonetheless, we also achieved reduced force error with the fixed physical parameters compared to 4G-HDNNP. Although PQEq partial charges are not directly comparable to DFT Hirshfeld ones, we compare them to show their relevance. Varying degrees of agreement across different chemical systems are observed, as shown in Supplementary Table 2.

Our comprehensive evaluation demonstrates that explicit incorporation of physical long-range interactions significantly enhances the performance of MLIPs across diverse charge-state systems. Notably, our proposed framework outperforms 4G-HDNNP in most cases for energies and forces without fitting partial charges. This advantage likely stems from a more physically meaningful partition of the total energy.

Foundation model benchmark

We trained a foundation model for all the periodic table elements up to Pu using the MPtrj dataset¹⁶ following our framework (our model), as described in the Methods section. We also trained a model maintaining the same MLIPs framework and dataset, but without the polarizable long-range interactions (w/o-lr model). The former one demonstrated mean absolute errors (MAE) of 18 meV per atom for energies, 0.065 eV$\cdot$Å⁻¹ for forces, and 0.301 GPa for stresses across the test set, outperforming the w/o-lr model, as shown in Supplementary Table 4. These proved the benefit of incorporating long-range interactions in the enhancement of prediction accuracy.

Message-passing mechanisms will fail when graph connections break in the system. For instance, in dimers, when the interatomic distance exceeds the graph cut-off, the atoms no longer interact with each other. We analysed interaction energies for Na-Na and Na-Cl dimers across interatomic distances ranging from 1.8 Å to 12.0 Å. As shown in Fig. 2a, both models (our model and the w/o-lr model) trained on the MPtrj dataset accurately reproduce the equilibrium bond lengths of the two dimers. However, the w/o-lr model failed to differentiate interaction energies beyond the MLIPs cutoff distance, i.e., 5.0 Å, while our model successfully captured the DFT potential energy surface throughout the entire distance range, demonstrating its capability in describing extended ionic interactions. Moreover, the message-passing mechanism exhibits deficiencies when simulating layered materials. Taking lithium iron phosphate (LiFePO₄) as a case study, we observed that the w/o-lr model predicted an irreversible phase transition, as shown in Supplementary Fig. 3. In contrast, our model correctly reproduced the experimentally⁴⁴ and AIMD⁴⁵ thermally stable olivine structure. These results collectively demonstrate that the explicit long-range interactions are crucial for accurately describing both ionic systems and layered materials, particularly in predicting structural stability and thermal behavior.

**Fig. 2: Benchmarking of the foundation model.**

Polarization plays a fundamental role in determining molecular responses to external electric fields. Conventional QEq-based models have limitations in their response to an external electric field applied orthogonally to the molecular dipole. We validated this capability through a benchmark study examining a water molecule’s response to external electrostatic fields⁴⁶. The water molecule was fixed in the yz-plane, and we applied varying external electric fields along the x-axis to derive energy curves, with the results presented in Fig. 2b. The model with polarizable long-range interactions demonstrates a trend consistent with the reference values obtained from DFT. Due to the external electrostatic field direction consistently being orthogonal to the molecule, the electrostatic energy in the QEq-based model (like 4G-HDNNP) is 0 and is completely independent of field strength (i.e., ${{{\mathbf{\mu }}}} \, {{\cdot }} \, {{{\mathbf{\epsilon }}}}=0$, where ${{{\mathbf{\mu }}}}$, ${{{\mathbf{\epsilon }}}}$ are dipole moment and electric field, respectively). This is because the QEq model treats atoms in the molecule as point charges or Gaussian charges, without distinguishing between core and shell components. We also compared the response of O atom partial charges across DFT Hirshfeld calculations, our model, and the conventional QEq model, as shown in Supplementary Fig. 4. The agreement between the PQEq model and DFT is observed, while the QEq model completely lacks response. This comparative analysis establishes that the explicit polarizable interactions are crucial for modeling molecular systems subject to external electric fields.

Besides the failure of response in static conditions, the QEq method may exhibit non-physical behavior in dynamics⁴⁷. As illustrated in Supplementary Fig. 5, we conducted additional investigations of two water molecules under an external electric field of 0.25 V$\cdot$Å⁻¹. In charge distribution models, it is essential to allow systems to form an internal electric field opposing the external field. The QEq model assumes the system behaves as a perfect conductor without penalizing charge transfer as a function of distance. This limitation of the QEq method leads to non-physical charge transfer between molecules separated by large distances⁴⁷. In contrast, within the PQEq model, the individual water molecules can polarize, generating internal electric fields that counteract the external field, thereby capturing more realistic electrostatic responses in molecular systems. Under the QEq scheme, the water molecules accumulate non-zero net charges and migrate to the top and bottom of the simulation box. In contrast, our model maintains charge neutrality of the water molecules, which remain stationary despite the applied electric field, aligning with DFT results.

We conducted benchmarking using properties that were not labeled during the training process. We applied our foundation model to predict the mechanical properties of 10,154 materials from the Materials Project⁴⁸. Figure 2c–e illustrates the comparison of the bulk modulus $B$ determined by the foundation model and DFT. Our model demonstrated impressive performance, achieving an R² of 0.94 with the Voigt approach ${B}_{{{{\rm{V}}}}}$ and Hill average method ${B}_{{{{\rm{VRH}}}}}$, and 0.93 using the Reuss method ${B}_{{{{\rm{R}}}}}$. In comparison, as shown in Supplementary Fig. 2, the w/o-lr model achieved lower R² values of 0.89, 0.86, and 0.88 for ${B}_{{{{\rm{V}}}}}$, ${B}_{{{{\rm{R}}}}}$, and ${B}_{{{{\rm{VRH}}}}}$, respectively. These not only highlight the robust performance of the model but also establish a solid foundation for the high-throughput screening of materials with exceptional mechanical properties.

Transferability of the foundation model

In terms of foundation model transferability, evaluations were conducted across diverse systems. The model accurately reproduces the potential energy surfaces and successfully distinguishes between neutral and ionic states in OH-OH systems, capturing the long-range Coulombic repulsion of ionic state as shown in Supplementary Fig. 8. Further validation using a periodic water system demonstrates the model’s capability to predict polarization effects under varying electric fields, achieving comparable performance to specialized models as depicted in Supplementary Fig. 10. Additionally, as shown in Supplementary Fig. 11, when tested on 7211 molecules from the QM-7b dataset⁴⁹, the model shows remarkable accuracy in predicting molecular polarizability with a mean absolute error of 4.57 atomic units (a.u.), highlighting its robust transferability in capturing polarization.

Computational efficiency

Despite the incorporation of additional long-range interaction calculations, our foundation model maintains high computational efficiency. Performance benchmarks were conducted on a single NVIDIA H100 GPU to quantitatively assess the computational cost. As illustrated in Supplementary Fig. 6, for a periodic system comprising 2160 atoms, our model requires approximately 0.07 s per molecular dynamics time step, representing a significant improvement over conventional universal MLIPs, which require approximately 0.91 s. This computational advantage extends to larger systems, as demonstrated by simulations of a 24000-atom system, where our model maintains efficiency with only 1.49 s per time step. Such computational performance enables nanosecond-scale molecular dynamics simulations of systems containing tens of thousands of atoms, making it practical for more realistic modeling of materials.

Materials modeling applications

To demonstrate the capability of our model to reproduce results from AIMD simulations, we simulate the kinetic transport properties of a solid-state electrolyte. Specifically, we investigated lithium-ion diffusivity within the cubic phase of Li₇La₃Zr₂O₁₂ (c-LLZO)⁵⁰, a crystalline superionic conductor known for its remarkable stability as a lithium conductor. Owing to the efficiency of our foundation model, we are able to conduct MD simulations on the c-LLZO comprising 64 formula units, with a duration of 2 ns for each temperature range from 800 K to 1800K. Figure 3a presents the Arrhenius plots derived from our model, w/o-lr model, and AIMD⁵¹ simulations. The diffusion coefficients were determined from the slope of the logarithmic mean square displacements (MSD) versus logarithmic time within the Fickian regime⁵². MSD curves of our model are depicted in Fig. 3b. Compared to previous work using AIMD simulations, our model clearly reproduces the diffusion coefficients and activation energy. The performance surpasses that of the w/o-lr model, which lacks explicit long-range interactions. With the ability to reach larger model sizes and longer simulation times, our model achieves these results with significantly reduced uncertainties. This capability not only delivers comparable results to AIMD simulations but also enables larger-scale simulations that cannot be assessed by AIMD, enhancing the statistical significance of MD studies. Consequently, the framework developed in this work provides more reliable and comprehensive data for diffusion analysis in solid-state electrolytes.

Fig. 3: Ionic diffusivity of cubic phase Li7La3Zr2O12. — **Fig. 3: Ionic diffusivity of cubic phase Li₇La₃Zr₂O₁₂.**

To show the application of polarizable potentials, we performed MD simulations on a typical perovskite ferroelectric material, BaTiO₃. As temperature increases, BaTiO₃ undergoes a sequential phase transition process, transforming from rhombohedral to orthorhombic, then to tetragonal, and finally to a cubic structure⁵³. Accurately simulating phase changes in ferroelectric substances requires precise potential energy functions that can respond to the small atomic shifts and structural changes, as well as account for the free energy landscape under the conditions of finite-temperature thermodynamics⁵⁴. The phase sequence of BaTiO₃ has been extensively studied using effective Hamiltonians^55,56,57,58, CMD^59,60, reactive force fields (ReaxFF)⁶¹, specialized MLIPs models trained on the BaTiO₃ system^54,62, and experiments⁶³. To detect lattice distortions that distinguish different phases, a 10 × 10 ×10 supercell of BaTiO₃ was simulated. The use of larger simulation cells (10 × 10 × 10 in our work versus 4 × 4 × 4 in Gigli et al.⁵⁴) may provide some advantages for phase transition studies. Larger supercells can effectively reduce the impact of temperature fluctuations, enabling better temperature control for sampling the free energy landscape, thereby yielding results with greater statistical significance. Figure 4 illustrates the simulation results, which clearly identify four distinct phases and the three first-order phase transitions within the simulated temperature range. Below 145 K, the overall average polarization of the supercell aligns with the (111) direction, indicative of the rhombohedral phase. At 145 K, a phase transition to orthorhombic is suggested as the y-component of polarization approaches zero. With further increase to 205 K, the polarization predominantly orients along the x-direction, indicating the tetragonal phase. This phase persists until 285 K, where the cubic paraelectric phase is observed. The obtained phase transition temperatures align closely with those predicted by various models and experiments, as listed in Table 2. It is worth noting that the supercell size can affect phase transition temperatures, so that direct comparison between different simulations is not expected. Utilizing first principles methods or fitting potential energy surfaces to first principles data often underestimates these temperatures, which is attributed to the approximated exchange-correlation functional⁵⁹ employed in DFT. Although the predicted phase transition temperatures are lower than the experimentally observed values, the model effectively captures the sequence of phase transitions as measured in experiments, demonstrating the efficacy of the foundation model in modeling highly polarized phase transitions in ferroelectric materials.

Fig. 4: Phase transition of BaTiO3. — **Fig. 4: Phase transition of BaTiO₃.**

Table 2 The phase transition temperatures of BaTiO₃ obtained by effective Hamiltonian^56,57, second principles⁵⁸, reactive force field (ReaxFF)⁶¹, specialized machine learning potential (MLIP)⁵⁴ and experiments⁶³. The size of the supercell is also displayed. ${T}_{{{{\rm{c}}}},\,{{{\rm{R}}}}-{{{\rm{O}}}}}$, ${T}_{{{{\rm{c}}}},\,{{{\rm{O}}}}-{{{\rm{T}}}}}$, and ${T}_{{{{\rm{c}}}},\,{{{\rm{T}}}}-{{{\rm{C}}}}}$ are the phase transition temperatures of rhombohedral–orthorhombic, orthorhombic–tetragonal, and tetragonal–cubic, respectively

Full size table

The final application studied in this work involves all-solid-state batteries, which represent a breakthrough in the evolution of next-generation energy solutions, owing to their superior energy density and inherent safety features⁶⁴. Lithium thiophosphate, known for its exceptional ionic conductivity ( ~ 10⁻³ S cm⁻¹), is deemed the most promising candidate for solid electrolytes and has been widely studied through experiments^65,66,67 and computer modeling^68,69,70. Notably, the nanoporous β-Li₃PS₄ has been validated to exhibit outstanding cycling stability⁷¹, presumably attributed to the formation of Li₂S and Li₃P solid-electrolyte interphases (SEI) during the initial battery cycles, which serves to passivate further degradation of the electrolyte^69,72,73,74.

To investigate the formation of SEI, we utilized the foundation model to simulate the interfacial reactive MD of the solid electrolyte β-Li₃PS₄ in contact with the lithium metal anode, encompassing a system of 13,760 atoms. The initial β-Li₃PS₄ /Li interfacial structure, measuring 33.15 nm × 3.1 nm × 2.6 nm, was utilized in the MD simulations, as depicted in Fig. 5a. The highly reactive Li metal began to interact with the PS₄ tetrahedron at the interface, triggering the formation of the SEI layer as illustrated in Fig. 5b. With the growth of the SEI layer, electrons were transferred from the Li anode to the Li₃PS₄ electrolyte, which causes a gradual increase of the partial charges of Li as they migrate from the anode, across the SEI, and into the electrolyte. This results in a transition from metal lithium to ions. The dynamic behavior of partial charges at the anode-electrolyte interface is a critical aspect showcased in Fig. 5c. Initially, the partial charges of lithium changed almost linearly from the anode to the electrolyte. Over time, this distribution evolved and showed a distinct plateau. This phenomenon could be attributed to the ordered structuring of the SEI, signifying the nucleation of crystalline structures. Ultimately, a stable lithium partial charge plateau was formed between the anode and the electrolyte, spanning a range of 15 to 21.5 nm. This is further supported by the visualized structure depicted in Fig. 5b, where the plateau within the SEI aligns with the Li₂S crystal and amorphous regions. Chen et al.⁶⁸ employed AIMD simulations to investigate the radial distribution functions (RDF) of Li-P and Li-S bonds, which concluded that the decompositions of electrolytes in lithium thiophosphates are primarily due to the decomposition of PS₄ tetrahedron by the active lithium metal, leading to the formation of new Li-P and Li-S bonds. Our RDF analysis confirmed the formation of Li-S and Li-P bonds within the SEI and electrolyte as depicted in Fig. 5d. In agreement with the AIMD simulations⁶⁸, we observed the emergence of a new Li-P peak within the SEI at approximately 2.5 Å. Furthermore, within the SEI, the Li-S bond displayed RDF characteristics consistent with crystalline Li₂S⁷⁵. For the Li-P bonds, the peak shape closely resembles that of amorphous Li₃P AIMD simulations⁷⁶.

Fig. 5: Visualization of solid electrolyte interphase between Li anode and Li3PS4 electrolyte. — **Fig. 5: Visualization of solid electrolyte interphase between Li anode and Li₃PS₄ electrolyte.**

As illustrated in Fig. 6a, b, the ultimate decomposition products of P form only short-range ordered structures, in contrast to S, which forms long-range ordered structures. This observation agrees with the measurements where Li₂S crystals were detected but Li₃P crystals were absent in cryogenic transmission electron microscopy (cryo-TEM) experiments, as noted in the reference⁶⁶. To enhance the understanding of the formation mechanism, the atomic compositions of the SEI in both crystalline and amorphous Li₂S regions were examined. As depicted in Fig. 6c, d, within the first 2 ns, the emergence of amorphous Li₂S and Li₃P regions was attributed to the swift diffusion of P and S atoms. Conversely, the formation of crystalline Li₂S regions was predominantly influenced by the swift diffusion of Li atoms and the slower diffusion of P and S atoms. As the system further evolves to 4 ns, the diffusions slowdown, which also confirmed that the formation of crystalline Li₂S region hindered further atomic diffusion and thus slowed the growth of the SEI. As illustrated in Fig. 5b, during the initial phase of decomposition (0–100 ps), the swift diffusion of P, S, and Li led to the formation of an interface of about 3 nm thick. However, in a later stage between 100 and 2000 ps, the formation of Li₂S nuclei impeded the diffusion of P and S atoms, resulting in a period of sluggish SEI growth. Consequently, the interface experienced only a slight expansion in this period, increasing by only 1 nm. Furthermore, in the following 2 ns, there was essentially no SEI growth. The final interface structure ( ~ 8.5 nm) could be characterized by a 4.5 nm crystalline Li₂S region sandwiched between two 2-nm transitional layers. The cessation of growth at this interface suggests that the crystalline Li₂S and amorphous Li₃P within the crystal region contribute to the stabilization of both the lithium metal anode and the electrolyte, which agrees with the theoretical predictions made by DFT calculations⁷².

Fig. 6: The formation mechanism of Li/Li3PS4 solid electrolyte interphase layer. — **Fig. 6: The formation mechanism of Li/Li₃PS₄ solid electrolyte interphase layer.**

Additionally, we studied the interface reactions between Li₆PS₅Cl and lithium metal, as detailed in Supplementary Note 13. As shown in Supplementary Fig. 13, at a pressure of 1 atm and a temperature of 300 K, Li₆PS₅Cl formed an interface of about 5-nm thick after 2 ns. This indicates that the Li₆PS₅Cl electrolyte exhibits a higher stability against lithium metal at room temperature compared to Li₃PS₄. Upon elevating the temperature to 500 K, we detected Li₂S at the grain boundaries, which aligns with the cryo-TEM observations⁶⁶.

Finetuning for enhanced accuracy

While our foundation model demonstrates general performance, achieving ab initio accuracy for specific systems requires finetuning. Using Na_8/9Cl₈⁺ clusters and Au₂ dimers on MgO surfaces from Ko et al.²⁸ as test cases, we performed targeted finetuning using 20% subsets of configurations and mean squared force error as the sole loss function. After finetuning, our model achieved good agreement with DFT references, accurately distinguishing potential energy surfaces for positive charged clusters and surface conditions, as shown in Supplementary Fig. 12 and Supplementary Table 6. This success demonstrates the effectiveness of the two-step approach: foundation pretraining followed by efficient finetuning, enabling quantum-accurate predictions while maintaining the model’s fundamental physical insights and transferability.

Discussion

In this study, we introduced a framework that integrates equivariant machine learning interatomic potentials with explicit long-range polarizable interactions. QEq-based models inherently struggle with atomic polarization because they do not distinguish between core and shell components. Due to their inability to handle atomic-level responses to external fields, they may exhibit non-physical behaviors in some cases, as described in the reference⁴⁷. Our framework incorporates polarizable long-range interactions and accurately captures atomic polarization in the presence of external electric fields. Since our framework decomposes the total energy into electrostatic energy and remaining energies that only depend on atomic types and positions, any future machine learning techniques and/or high-order electrostatic energy expansion schemes can be readily integrated into it, empowering possible accuracy enhancement.

Our foundation model successfully reproduced DFT potential energy surfaces for Na-Cl/Na-Na dimers across extended interatomic distances, outperforming conventional message-passing MLIP beyond the cut-off regions. The model demonstrates exceptional versatility across multiple materials science challenges, including accurate prediction of thermal stability in complex layered materials, high-fidelity capture of molecular responses to external electric fields, and reliable prediction of bulk modulus. The model has been also successfully applied to study dynamic properties and reactive molecular dynamics, such as lithium-ion diffusion in solid-state electrolytes and temperature-dependent phase transitions in ferroelectric materials like BaTiO₃, while also elucidating interface formation mechanisms between lithium thiophosphate electrolytes and lithium metal anodes in solid-state batteries.

Beyond these specific applications, we evaluated the model’s transferability, a critical attribute for a foundation model. The model accurately reproduces the potential energy surfaces for OH-OH systems, correctly distinguishing between the neutral state and the long-range Coulombic repulsion of the ionic state. The model achieved a remarkable mean absolute error of just 4.57 a.u. for molecular polarizability, highlighting its robust and generalizable learned representation of polarization physics. Finally, we demonstrated our foundation model’s capacity to achieve ab initio accuracy for specific systems through efficient finetuning.

We acknowledge that our model aims to achieve a balance between computational efficiency and accuracy. Several advanced machine learning approaches for long-range interactions may achieve higher accuracy for specific, complex systems, including higher-order charge expansion methods and simultaneous optimization of QEq parameters^32,77 in each step. However, more accurate methods incur higher computational complexity. The method that dynamically solves charge equilibration parameters at each step requires additional cost for MD simulations. It is also observed that the charge equilibration parameters do not change significantly during MD simulations for that method⁷⁷, suggesting that expanding polarization charges to higher orders while using fixed, a priori parameters may achieve higher accuracy with tolerable computational overheads. Additionally, several approaches that do not require a charge equilibration process provide valuable insights. Latent Ewald Summation²⁴ captures long-range interactions by learning hidden variables from local descriptors and applying Ewald summation to these variables. LSR-MP⁷⁸ employs a fragmentation-based approach with hierarchical message passing between atoms and fragments to learn the long-range interactions. NeuralP³M⁷⁹ introduces mesh points alongside atoms to discretize space and capture long-range effects. SO3krates⁸⁰ utilizes spherical harmonic coordinates and equivariant attention for interactions at arbitrary length scales. LOREM⁸¹ models charges as equivariant tensors to capture orientation-dependent interactions beyond the standard cutoff radius. In contrast to these approaches, our foundation model explicitly simulates charge redistribution through a physics-driven framework that maintains a transparent connection to the underlying physical principles of charge polarization and electrostatic interactions. This design choice enhances interpretability while preserving computational efficiency. A promising direction for future work lies in exploring strategic integrations of our framework with these complementary methodologies to develop even more comprehensive and powerful foundation models.

Methods

Model training details

To train the foundation equivariance neural network potential, we used the MPtrj dataset¹⁶ sourced from Materials Projects⁴⁸ as the training dataset. All configurations were calculated using DFT with the PBE⁸²/PBE + U⁸³ exchange-correlation functional and pseudopotential basis. The PBE/PBE + U mixing compatibility correction⁸⁴ was applied to ensure energy consistency. Due to transferability issues with the Yb element⁸⁵, all data containing Yb in the MPtrj dataset were excluded. The higher-order energy terms and the electrostatic Coulomb interaction energies due to charge fluctuations should be deducted from the total potential energy. Appropriately setting the PQEq parameters is crucial for training a foundation framework. Detailed information on the fitting process and the PQEq parameters used in this work can be found in the Supplementary Note 10 and Supplementary Table 7.

For equivariant neural network training, the NequIP¹² model was utilized, incorporating six equivariant layers with node features set at 256 × 0e + 256 × 1e. Spherical harmonic basis functions were used to represent interatomic directions, denoted as 1 × 0e + 1 × 1o. Interatomic distances were described using eight Bessel basis functions, and a cut-off value of 5.0 Å was chosen for building the MLIP neighbor list. The dataset is divided into a training set, validation set, and test set in a ratio of 8:1:1.

The total loss function (${{{\mathcal{L}}}}$) of prediction values ${{{{\bf{x}}}}}^{{{{\rm{pred}}}}}$ and reference first principle values ${{{{\bf{x}}}}}^{{{{\rm{ref}}}}}$ can be expressed as,

$$L\left({{{{\bf{x}}}}}^{{{{\rm{ref}}}}},{{{{\bf{x}}}}}^{{{{\rm{pred}}}}}\right)={\sum }_{i}^{N}{\frac{1}{N}\left({x}_{i}^{{{{\rm{ref}}}}}-{x}_{i}^{{{{\rm{pred}}}}}\right)}^{2}$$

(2)

$${{{\mathcal{L}}}}={\omega }_{E}\cdot L\left({{{{\bf{E}}}}}^{{{{\rm{ref}}}}},{{{{\bf{E}}}}}^{{{{\rm{pred}}}}}\right)+{\omega }_{F}\cdot L\left({{{{\bf{F}}}}}^{{{{\rm{ref}}}}},{{{{\bf{F}}}}}^{{{{\rm{pred}}}}}\right)+{\omega }_{S}\cdot L\left({{{{\bf{S}}}}}^{{{{\rm{ref}}}}},{{{{\bf{S}}}}}^{{{{\rm{pred}}}}}\right)$$

(3)

where energies ${{{\bf{E}}}}$, forces ${{{\bf{F}}}}$, and stresses ${{{\bf{S}}}}$ weights were set as ${\omega }_{E}=1$, ${\omega }_{F}=1$, and ${\omega }_{S}=10$ with quantity units of eV, eV$\cdot$Å⁻¹, and eV$\cdot$Å⁻³. The batch size was selected as 128, and the Adam optimizer was employed with an initial learning rate of 10^–3. All the trainable parameters were initialized with the random seed 3407. The implementation of the entire neural networks and long-range interactions was based on JAX 0.4.20⁸⁶. The training was conducted on a single NVIDIA H100-PCIe GPU, utilizing CUDA version 12.3 and Driver version 545.23.06.

Molecular dynamics simulations

The Atomic Simulation Environment (ASE)⁸⁷ was utilized as the interface for geometry relaxations and MD simulations. Structure relaxations were conducted using the limited-memory BFGS⁸⁸ method. Both atomic coordinates and cell vectors were optimized concurrently until the forces fell below the convergence threshold of 0.05 eV/Å.

The c-LLZO with a space group of ${{{\rm{Ia}}}}\bar{3}{{{\rm{d}}}}$ originated from experimental structures detailed in the reference⁵⁰. The structure was expanded to a 2 × 2 × 2 supercell. Subsequently, 180 lithium atoms were randomly placed on the 24(d) sites, and an additional 268 lithium atoms on the 96(h) sites. After relaxation, the structure served as the starting point for MD simulations. The isothermal isobaric (NpT) ensemble simulations were performed with the temperature control via Nosé-Hoover thermostat^89,90 and pressure maintained at 1 atm using Parrinello-Rahman barostat^91,92 to determine the lattice constants at various temperatures. Then, the canonical (NVT) ensemble simulations with the Nosé-Hoover thermostat were utilized to ascertain the lithium diffusion properties within c-LLZO. A time step of 2 fs was used for all NpT and NVT simulations. In the case of NVT simulations, following a 100 ps equilibration period, 2-ns trajectories’ mean squared displacements of Li ions were employed to calculate the self-diffusion coefficients at various target temperatures. Additionally, we conducted an uncertainty analysis of the diffusion coefficients, applying the empirical error estimation method as detailed in the reference⁵¹.

To construct the phase diagram of the BaTiO₃ crystal structure, a supercell consisting of a 10 × 10 × 10 ${{{\rm{R}}}}3{{{\rm{m}}}}$ BaTiO₃ lattice with 5000 atoms was created. The crystal structure data for BaTiO₃ was sourced from the Materials Project⁴⁸. NpT simulations with the Nosé-Hoover thermostat and Parrinello-Rahman barostat were performed with a 1-fs timestep. Throughout the simulations, the pressure was consistently maintained at 1 atmosphere. To regulate temperature, the procedure began at 5 K and involved incrementing the temperature by 10 K every 20 ps. The average lattice constants and the local polarization of the unit cell, denoted as ${{{\bf{P}}}}$, were determined using data from the final 10 ps of trajectories at each temperature step. For the comprehensive calculation method of the local polarization, please refer to the Supplementary Note 12.

We utilized a 13760-atom structure consisting of a ca. 13.6-nm thick ${{{\rm{Im}}}}\bar{3}{{{\rm{m}}}}$ lithium metal anode with (100) facet, paired with a ca. 19.1 nm thick Pnma β-Li₃PS₄ electrolyte with (010) facet to illustrate the SEI formation across realistic spatial and temporal scales. The system was structured with periodic lateral dimensions measuring 3.1 nm by 2.6 nm. The unit cell of anode and electrolyte structures originated from the Materials Project⁴⁸. The entire cell was subject to periodic boundary conditions, with a 1 nm atomic layer fixed on each side along the x-direction to ensure a singular interface reaction. NpT MD simulations were conducted at a standard temperature of 300 K and a pressure of 1 atm by Nosé-Hoover thermostat and Parrinello-Rahman barostat with a timestep of 2 fs. Initial velocities for the particles were assigned randomly following the Maxwell-Boltzmann distribution, and the simulations were carried out for a duration of 4 ns.

Data availability

For datasets containing different charge states, the C₁₀H₂/C₁₀H₃⁺, Na_8/9Cl₈⁺, Ag₃^+/− and Au₂-MgO datasets are publicly available from the reference⁹³ at https://doi.org/10.24435/materialscloud:f3-yh, while BTA-Cu and BTA (H₂O)-Cu datasets are available at https://github.com/ahmad-research-group/nequip-charge. The MPtrj dataset is also publicly available from the reference⁹⁴ through https://doi.org/10.6084/m9.figshare.23713842. Detailed polarizable charge equilibration parameters are provided in the Supplementary Table 7. The trajectory data of molecular dynamics are available at https://github.com/reaxnet/reaxnet. Source data are provided with this paper.

Code availability

A JAX implementation code is available under GNU General Public License v3.0 via GitHub at https://github.com/reaxnet/reaxnet or Zenodo (https://doi.org/10.5281/zenodo.17123107⁹⁵).

References

Karplus, M. & Petsko, G. A. Molecular dynamics simulations in biology. Nature 347, 631–639 (1990).
Article ADS CAS PubMed Google Scholar
Senftle, T. P. et al. The ReaxFF reactive force-field: development, applications and future directions. npj Comput. Mater. 2, 15011 (2016).
Article ADS CAS Google Scholar
Yao, N., Chen, X., Fu, Z.-H. & Zhang, Q. Applying classical, ab initio, and machine-learning molecular dynamics simulations to the liquid electrolyte for rechargeable batteries. Chem. Rev. 122, 10970–11021 (2022).
Article CAS PubMed Google Scholar
Chen, P. J. et al. Molecular dynamics investigations of dienolate [4 + 2] reactions. J. Am. Chem. Soc. 146, 12758–12765 (2024).
Article ADS CAS PubMed PubMed Central Google Scholar
Iftimie, R., Minary, P. & Tuckerman, M. E. Ab initio molecular dynamics: concepts, recent developments, and future trends. Proc. Natl Acad. Sci. USA 102, 6654–6659 (2005).
Article ADS CAS PubMed PubMed Central Google Scholar
Rappe, A. K., Casewit, C. J., Colwell, K. S., Goddard, W. A. & Skiff, W. M. UFF, a full periodic table force field for molecular mechanics and molecular dynamics simulations. J. Am. Chem. Soc. 114, 10024–10035 (1992).
Article ADS CAS Google Scholar
van Duin, A. C. T., Dasgupta, S., Lorant, F. & Goddard, W. A. ReaxFF: a reactive force field for hydrocarbons. J. Phys. Chem. A 105, 9396–9409 (2001).
Article Google Scholar
Schütt, K. et al. SchNet: a continuous-filter convolutional neural network for modeling quantum interactions. In Advances in Neural Information Processing Systems. (NeurIPS, 2017).
Schütt, K. T., Unke, O. T. & Gastegger, M. Equivariant message passing for the prediction of tensorial properties and molecular spectra. In International Conference on Machine Learning. (PMLR, 2021).
Behler, J. & Parrinello, M. Generalized neural-network representation of high-dimensional potential-energy surfaces. Phys. Rev. Lett. 98, 146401 (2007).
Article ADS PubMed Google Scholar
Zhang, L. et al. Deep potential molecular dynamics: a scalable model with the accuracy of quantum mechanics. Phys. Rev. Lett. 120, 143001 (2018).
Article ADS CAS PubMed Google Scholar
Batzner, S. et al. E(3)-equivariant graph neural networks for data-efficient and accurate interatomic potentials. Nat. Commun. 13, 2453 (2022).
Article ADS CAS PubMed PubMed Central Google Scholar
Gasteiger, J., Groß, J. & Günnemann, S. Directional message passing for molecular graphs. In International Conference on Learning Representations. (ICLR, 2020).
Batatia, I., Kovacs, D. P., Simm, G. N. C., Ortner, C. & Csanyi, G. MACE: Higher order equivariant message passing neural networks for fast and accurate force fields. In Advances in Neural Information Processing Systems. (NeurIPS, 2022).
Chen, C. & Ong, S. P. A universal graph deep learning interatomic potential for the periodic table. Nat. Comput. Sci. 2, 718–728 (2022).
Article PubMed Google Scholar
Deng, B. et al. CHGNet as a pretrained universal neural network potential for charge-informed atomistic modelling. Nat. Mach. Intell. 5, 1031–1041 (2023).
Article Google Scholar
Merchant, A. et al. Scaling deep learning for materials discovery. Nature 624, 80–85 (2023).
Article ADS CAS PubMed PubMed Central Google Scholar
Batatia, I. et al. A foundation model for atomistic materials chemistry. Preprint at arXiv https://arxiv.org/abs/2401.00096 (2024).
Yang, H. et al. MatterSim: a deep learning atomistic model across elements, temperatures and pressures. Preprint at arXiv https://arxiv.org/abs/2405.04967 (2024).
Park, Y., Kim, J., Hwang, S. & Han, S. Scalable parallel algorithm for graph neural network interatomic potentials in molecular dynamics simulations. J. Chem. Theory Comput. 20, 4857–4868 (2024).
Article CAS PubMed Google Scholar
Neumann, M. et al. Orb: a fast, scalable neural network potential. Preprint at arXiv https://arxiv.org/abs/2410.22570 (2024).
Anstine, D. M. & Isayev, O. Machine learning interatomic potentials and long-range physics. J. Phys. Chem. A 127, 2417–2431 (2023).
Article CAS PubMed PubMed Central Google Scholar
Gilmer, J., Schoenholz, S. S., Riley, P. F., Vinyals, O. & Dahl, G. E. Neural message passing for quantum chemistry. In International Conference on Machine Learning. (PMLR, 2017).
Cheng, B. Latent Ewald summation for machine learning of long-range interactions. npj Comput. Mater. 11, 80 (2025).
Article ADS Google Scholar
Rappe, A. K. & Goddard, W. A. Charge equilibration for molecular dynamics simulations. J. Phys. Chem. 95, 3358 (1991).
Article ADS CAS Google Scholar
Ghasemi, S. A., Hofstetter, A., Saha, S. & Goedecker, S. Interatomic potentials for ionic systems with density functional accuracy based on charge densities obtained by a neural network. Phys. Rev. B 92, 045131 (2015).
Article ADS Google Scholar
Behler, J. Four generations of high-dimensional neural network potentials. Chem. Rev. 121, 10037–10072 (2021).
Article ADS CAS PubMed Google Scholar
Ko, T. W., Finkler, J. A., Goedecker, S. & Behler, J. A fourth-generation high-dimensional neural network potential with accurate electrostatics, including non-local charge transfer. Nat. Commun. 12, 398 (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
Fuchs, P., Sanocki, M. & Zavadlav, J. Learning non-local molecular interactions via equivariant local representations and charge equilibration. npj Comput. Mater. 11, 287 (2025).
Naserifar, S., Brooks, D. J., Goddard, W. A. & Cvicek, V. Polarizable charge equilibration model for predicting accurate electrostatic interactions in molecules and solids. J. Chem. Phys. 146, 124117 (2021).
Article ADS Google Scholar
Shaidu, Y., Pellegrini, F., Küçükbenli, E., Lot, R. & de Gironcoli, S. Incorporating long-range electrostatics in neural network potentials via variational charge equilibration from shortsighted ingredients. npj Comput. Mater. 10, 47 (2024).
Article ADS CAS Google Scholar
King, D. S., Kim, D., Zhong, P. & Cheng, B. Machine learning of charges and long-range interactions from energies and forces. Nat. Commun. 16, 8763 (2025).
Kwon, S., Naserifar, S., Lee, H. M. & Goddard, W. A. Polarizable charge equilibration model for transition-metal elements. J. Phys. Chem. A 122, 9350–9358 (2018).
Article CAS PubMed Google Scholar
Oppenheim, J. J., Naserifar, S. & Goddard, W. A. Extension of the polarizable charge equilibration model to higher oxidation states with applications to Ge, As, Se, Br, Sn, Sb, Te, I, Pb, Bi, Po, and At elements. J. Phys. Chem. A 122, 639–645 (2018).
Article CAS PubMed Google Scholar
Yang, M. Y., O’Mari, O., Goddard, W. A. & Vullev, V. I. How permanent are the permanent macrodipoles of anthranilamide bioinspired molecular electrets? J. Am. Chem. Soc. 146, 5162–5172 (2024).
Article ADS CAS PubMed PubMed Central Google Scholar
Naserifar, S., Chen, Y., Kwon, S., Xiao, H. & Goddard III, W. A. Artificial intelligence and QM/MM with a polarizable reactive force field for next-generation electrocatalysts. Matter 4, 195–216 (2021).
Article CAS Google Scholar
Unke, O. T. et al. SpookyNet: Learning force fields with electronic degrees of freedom and nonlocal effects. Nat. Commun. 12, 7273 (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
Maruf, M. U., Kim, S. & Ahmad, Z. Equivariant machine learning interatomic potentials with global charge redistribution. Preprint at arXiv https://arxiv.org/abs/2503.17949 (2025).
Kohn, W. & Sham, L. J. Self-consistent equations including exchange and correlation effects. Phys. Rev. 140, A1133 (1965).
Article ADS MathSciNet Google Scholar
Grimme, S., Antony, J., Ehrlich, S. & Krieg, H. A consistent and accurate ab initio parametrization of density functional dispersion correction (DFT-D) for the 94 elements H-Pu. J. Chem. Phys. 132, 154104 (2010).
Article ADS PubMed Google Scholar
Grimme, S., Ehrlich, S. & Goerigk, L. Effect of the damping function in dispersion corrected density functional theory. J. Comput. Chem. 32, 1456–1465 (2011).
Article ADS CAS PubMed Google Scholar
Lide, D. R. CRC Handbook of Chemistry and Physics. 102nd edn, 1624, (CRC Press, 2004).
Hirshfeld, F. L. Bonded-atom fragments for describing molecular charge densities. Theor. Chim. Acta 44, 129–138 (1977).
Article CAS Google Scholar
Delacourt, C., Poizot, P., Tarascon, J. M. & Masquelier, C. The existence of a temperature-driven solid solution in Li_xFePO₄ for 0 ≤ x ≤ 1. Nat. Mater. 4, 254–260 (2005).
Article ADS CAS Google Scholar
Yang, J. & Tse, J. S. Li ion diffusion mechanisms in LiFePO₄: an ab initio molecular dynamics study. J. Phys. Chem. A 115, 13045–13049 (2011).
Article CAS PubMed Google Scholar
Zhang, Y. & Jiang, B. Universal machine learning for the response of atomistic systems to external fields. Nat. Commun. 14, 6424 (2023).
Article ADS CAS PubMed PubMed Central Google Scholar
Koski, J. P. et al. Water in an external electric field: comparing charge distribution methods using ReaxFF simulations. J. Chem. Theory Comput. 18, 580–594 (2022).
Article CAS PubMed Google Scholar
Jain, A. et al. The Materials Project: a materials genome approach to accelerating materials innovation. APL Mater. 1, 011002 (2013).
Article ADS Google Scholar
Yang, Y. et al. Quantum mechanical static dipole polarizabilities in the QM7b and AlphaML showcase databases. Sci. Data 6, 152 (2019).
Article PubMed PubMed Central Google Scholar
Awaka, J. et al. Crystal structure of fast lithium-ion-conducting cubic Li₇La₃Zr₂O₁₂. Chem. Lett. 40, 60–62 (2010).
Article Google Scholar
He, X., Zhu, Y., Epstein, A. & Mo, Y. Statistical variances of diffusional properties from ab initio molecular dynamics simulations. npj Comput. Mater. 4, 18 (2018).
Article ADS Google Scholar
Das, T., Merinov, B. V., Yang, M. Y. & Goddard, W. A. Structural, dynamic, and diffusion properties of a Li₆(PS)₄Cl superionic conductor from molecular dynamics simulations; prediction of a dramatically improved conductor. J. Mater. Chem. A 10, 16319–16327 (2022).
Article CAS Google Scholar
Merz, W. J. The electric and optical behavior of BaTiO₃ single-domain crystals. Phys. Rev. 76, 1221–1225 (1949).
Article ADS CAS Google Scholar
Gigli, L. et al. Thermodynamics and dielectric response of BaTiO₃ by data-driven modeling. npj Comput. Mater. 8, 209 (2022).
Article ADS CAS Google Scholar
Zhong, W., Vanderbilt, D. & Rabe, K. M. Phase transitions in BaTiO₃ from first principles. Phys. Rev. Lett. 73, 1861–1864 (1994).
Article ADS CAS PubMed Google Scholar
Mayer, F. et al. Improved description of the potential energy surface in BaTiO₃ by anharmonic phonon coupling. Phys. Rev. B 106, 064108 (2022).
Article ADS CAS Google Scholar
Zhong, W. & Vanderbilt, D. Effect of quantum fluctuations on structural phase transitions in SrTiO₃ and BaTiO₃. Phys. Rev. B 53, 5047–5050 (1996).
Article ADS CAS Google Scholar
Zhang, J. et al. Structural phase transitions and dielectric properties of BaTiO₃ from a second-principles method. Phys. Rev. B 108, 134117 (2023).
Article ADS CAS Google Scholar
Qi, Y., Liu, S., Grinberg, I. & Rappe, A. M. Atomistic description for temperature-driven phase transitions in BaTiO₃. Phys. Rev. B 94, 134308 (2016).
Article ADS Google Scholar
Tinte, S., Stachiotti, M. G., Sepliarsky, M., Migoni, R. L. & Rodriguez, C. O. Atomistic modelling of BaTiO₃ based on first-principles calculations. J. Phys. Condens. Matter 11, 9679–9690 (1999).
Article ADS CAS Google Scholar
Akbarian, D. et al. Understanding the influence of defects and surface chemistry on ferroelectric switching: a ReaxFF investigation of BaTiO₃. Phys. Chem. Chem. Phys. 21, 18240–18249 (2019).
Article CAS PubMed Google Scholar
Ouyang, X. et al. Quantum-accurate modeling of ferroelectric phase transition in perovskites from message-passing neural networks. J. Phys. Chem. C. 127, 20890–20902 (2023).
Article CAS Google Scholar
Samara, G. A. Pressure and temperature dependences of the dielectric properties of the perovskites BaTiO3 and SrTiO3. Phys. Rev. 151, 378–386 (1966).
Article ADS CAS Google Scholar
Kamaya, N. et al. A lithium superionic conductor. Nat. Mater. 10, 682–686 (2011).
Article ADS CAS PubMed Google Scholar
Mizuno, F., Hayashi, A., Tadanaga, K. & Tatsumisago, M. New, highly ion-conductive crystals precipitated from Li₂S–P₂S₅ glasses. Adv. Mater. 17, 918–921 (2005).
Article CAS Google Scholar
Luo, S. et al. Nanostructure of the interphase layer between a single li dendrite and sulfide electrolyte in all-solid-state li batteries. ACS Energy Lett. 7, 3064–3071 (2022).
Article CAS Google Scholar
Wood, K. N. et al. Operando X-ray photoelectron spectroscopy of solid electrolyte interphase formation and evolution in Li₂S-P₂S₅ solid-state electrolytes. Nat. Commun. 9, 10 (2018).
Article Google Scholar
Cheng, T., Merinov, B. V., Morozov, S. & Goddard, W. A. Quantum mechanics reactive dynamics study of solid Li-electrode/Li₆PS₅Cl-electrolyte interface. ACS Energy Lett. 2, 1454–1459 (2017).
Article CAS Google Scholar
Lepley, N. D., Holzwarth, N. A. W. & Du, Y. A. Structures, Li⁺ mobilities, and interfacial properties of solid electrolytes Li₃PS₄ and Li₃PO₄ from first principles. Phys. Rev. B 88, 104103 (2013).
Article ADS Google Scholar
Ren, F. et al. Visualizing the SEI formation between lithium metal and solid-state electrolyte. Energy Environ. Sci. 17, 2743–2752 (2024).
Article CAS Google Scholar
Liu, Z. et al. Anomalous high ionic conductivity of nanoporous β-Li₃PS₄. J. Am. Chem. Soc. 135, 975–978 (2013).
Article ADS CAS PubMed Google Scholar
Marana, N. L. et al. Stability and formation of the Li₃PS₄/Li, Li₃PS₄/Li₂S, and Li₂S/Li interfaces: a theoretical study. Langmuir 39, 18797–18806 (2023).
Article CAS PubMed PubMed Central Google Scholar
Richards, W. D., Miara, L. J., Wang, Y., Kim, J. C. & Ceder, G. Interface stability in solid-state batteries. Chem. Mater. 28, 266–273 (2016).
Article CAS Google Scholar
Zhu, Y., He, X. & Mo, Y. Origin of outstanding stability in the lithium solid electrolyte materials: insights from thermodynamic analyses based on first-principles calculations. ACS Appl. Mater. Interfaces 7, 23685–23693 (2015).
Article CAS PubMed Google Scholar
Liu, Z. Revealing reaction mechanisms of nanoconfined Li₂S: implications for lithium–sulfur batteries. Phys. Chem. Chem. Phys. 20, 11713–11721 (2018).
Article CAS PubMed Google Scholar
Jung, S. C. & Han, Y.-K. Thermodynamic and kinetic origins of lithiation-induced amorphous-to-crystalline phase transition of phosphorus. J. Phys. Chem. C. 119, 12130–12137 (2015).
Article CAS Google Scholar
Rinaldi, M., Bochkarev, A., Lysogorskiy, Y. & Drautz, R. Charge-constrained atomic cluster expansion. Phys. Rev. Materials 9, 033802 (2025).
Li, Y. et al. Long-short-range message-passing: a physics-informed framework to capture non-local interaction for scalable molecular dynamics simulation. In International Conference on Learning Representations. (ICLR, 2023).
Wang, Y. et al. NeuralP³M: A long-range interaction modeling enhancer for geometric GNNs. In Advances in Neural Information Processing Systems. (NeurIPS, 2024).
Frank, T., Unke, O. & Müller, K.-R. So3krates: equivariant attention for interactions on arbitrary length-scales in molecular systems. In Advances in Neural Information Processing Systems. (NeurIPS, 2022).
Rumiantsev, E., Langer, M. F., Sodjargal, T.-E., Ceriotti, M. & Loche, P. Learning long-range representations with equivariant messages. Preprint at arXiv https://arxiv.org/abs/2507.19382 (2025).
Perdew, J. P., Burke, K. & Ernzerhof, M. Generalized gradient approximation made simple. Phys. Rev. Lett. 77, 3865 (1996).
Article ADS CAS PubMed Google Scholar
Anisimov, V. I., Zaanen, J. & Andersen, O. K. Band theory and Mott insulators: Hubbard U instead of Stoner I. Phys. Rev. B 44, 943–954 (1991).
Article ADS CAS Google Scholar
Wang, A. et al. A framework for quantifying uncertainty in DFT energy corrections. Sci. Rep. 11, 15496 (2021).
Article CAS PubMed PubMed Central Google Scholar
Ong, S. P. et al. Python Materials Genomics (pymatgen): a robust, open-source Python library for materials analysis. Comput. Mater. Sci. 68, 314–319 (2013).
Article CAS Google Scholar
Frostig, R., Johnson, M. J. & Leary, C. Compiling machine learning programs via high-level tracing. in Syst. Mach. Learn. (2018).
Larsen, A. H. et al. The atomic simulation environment—a Python library for working with atoms. J. Phys. Condens. Matter 29, 273002 (2017).
Article Google Scholar
Liu, D. C. & Nocedal, J. On the limited memory BFGS method for large-scale optimization. Math. Program. 45, 503–528 (1989).
Article MathSciNet Google Scholar
Nosé, S. A unified formulation of the constant temperature molecular dynamics methods. J. Chem. Phys. 81, 511–519 (1984).
Article ADS Google Scholar
Hoover, W. G. Canonical dynamics: equilibrium phase–space distributions. Phys. Rev. A Gen. Phys. 31, 1695–1697 (1985).
Article ADS CAS PubMed Google Scholar
Parrinello, M. & Rahman, A. Crystal structure and pair potentials: a molecular-dynamics study. Phys. Rev. Lett. 45, 1196–1199 (1980).
Article ADS CAS Google Scholar
Parrinello, M. & Rahman, A. Polymorphic transitions in single crystals: a new molecular dynamics method. J. Appl. Phys. 52, 7182–7190 (1981).
Article ADS CAS Google Scholar
Ko, T.W., Finkler, J. A., Goedecker, S. & Behler, J. A fourth-generation high-dimensional neural network 722 potential with accurate electrostatics including non-local charge transfer. Materials Cloud Archive 723 2020.X https://doi.org/10.24435/materialscloud:f3-yh (2020).
Deng, B. Materials project trajectory (MPtrj) dataset. Figshare https://doi.org/10.6084/m9.figshare.23713842 (2023).
Gao, R. et al. ReaxNet v1.0. Zenodo https://doi.org/10.5281/zenodo.17123107 (2025).

Download references

Acknowledgments

This work was supported by Hong Kong Quantum AI Lab Limited, Air @ InnoHK of Hong Kong Government (G. C. and Z. H.), Seed Fund for Basic Research for New Staff 2022/23 (Grant No. 2201101550 (Z. H.)), the GuangDong Basic and Applied Basic Research Foundation (Grant No. 2024A1515013283 (Z. H.)), the Guangdong Shenzhen Joint Key Fund (Grant No. 2019B1515120045 (C. Y.)), and the National Natural Science Foundation of China (Grant No. 22073007 (C. Y.)).

Author information

Authors and Affiliations

Department of Chemistry, The University of Hong Kong, Pokfulam, Hong Kong SAR, China
Rongzhi Gao, GuanHua Chen & Ziyang Hu
Hong Kong Quantum AI Lab Limited, Pak Shek Kok, Hong Kong SAR, China
ChiYung Yam, Jianjun Mao, Shuguang Chen, GuanHua Chen & Ziyang Hu
Shenzhen Institute for Advanced Study, University of Electronic Science and Technology of China, Shenzhen, China
ChiYung Yam
MattVerse Limited, Pak Shek Kok, Hong Kong SAR, China
Shuguang Chen

Authors

Rongzhi Gao
View author publications
Search author on:PubMed Google Scholar
ChiYung Yam
View author publications
Search author on:PubMed Google Scholar
Jianjun Mao
View author publications
Search author on:PubMed Google Scholar
Shuguang Chen
View author publications
Search author on:PubMed Google Scholar
GuanHua Chen
View author publications
Search author on:PubMed Google Scholar
Ziyang Hu
View author publications
Search author on:PubMed Google Scholar

Contributions

R. G.: Conceptualization, investigation, code development, analysis, visualization, and writing. C. Y.: Revision, analysis, and funding acquisition. J. M.: Analysis, review, and discussion. S. C.: Analysis, review, and discussion. G. C.: Review, funding acquisition, and supervision. Z. H.: Conceptualization, analysis, revision, funding acquisition, and supervision.

Corresponding authors

Correspondence to GuanHua Chen or Ziyang Hu.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature Communications thanks Philip Loche and the other anonymous reviewer(s) for their contribution to the peer review of this work. A peer review file is available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information (download PDF )

Transparent Peer Review file (download PDF )

Source data

Source data (download XLSX )

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Gao, R., Yam, C., Mao, J. et al. A foundation machine learning potential with polarizable long-range interactions for materials modelling. Nat Commun 16, 10484 (2025). https://doi.org/10.1038/s41467-025-65496-3

Download citation

Received: 29 December 2024
Accepted: 15 October 2025
Published: 25 November 2025
Version of record: 25 November 2025
DOI: https://doi.org/10.1038/s41467-025-65496-3