Abstract
Long-range interactions are essential determinants of chemical system behavior across diverse environments. We present a foundation framework that integrates explicit polarizable long-range physics with an equivariant graph neural network potential. It employs a physically motivated polarizable charge equilibration scheme that directly optimizes electrostatic interaction energies rather than partial charges. The foundation model, trained across the periodic table up to Pu, demonstrates strong performance across key materials modeling challenges. It effectively captures long-range interactions that are challenging for traditional message-passing mechanisms and accurately reproduces polarization effects under external electric fields. We have applied the model to mechanical properties, ionic diffusivity in solid-state electrolytes, ferroelectric phase transitions, and reactive dynamics at electrode-electrolyte interfaces, highlighting the model’s capacity to balance accuracy and computational efficiency. Furthermore, we show that as a foundation model, it can be efficiently finetuned to achieve high-level accuracy for specific challenging systems.
Similar content being viewed by others
Introduction
Molecular dynamics (MD) simulations are indispensable in describing phenomena at the atomic level in disciplines such as chemistry, materials science, and biology1,2,3,4. While ab initio MD (AIMD)5 simulation approaches offer unparalleled accuracy, their computational demands constrain their application to small timescales and length scales. On the other hand, classical MD (CMD)6,7 simulations offer computational efficiency, but their accuracy is contingent upon empirical parameters. Machine learning interatomic potentials (MLIPs)8,9,10,11 provide a solution that balances accuracy and computational efficiency. In particular, MLIPs based on equivariant graph neural networks, such as NequIP12, DimeNet13, and MACE14, achieve excellent performance by introducing equivariant and invariant symmetries. Moreover, universal foundation models trained on the periodic table, such as M3GNet15, CHGNet16, GNoME17, MACE-MP-018, MatterSim19, SevenNet20, and ORB21 have emerged, showing remarkable prospects for materials discovery.
While existing local MLIPs with a cut-off of around 5 Å perform well in simulating interactions within localized chemical environments, they may fail to capture long-range phenomena. This hinders their ability to understand and elucidate the behaviors of complex materials22. One approach is to implicitly incorporate the effects of long-range interactions through message-passing mechanisms23, which can extend the cut-off through layer-wise propagation. However, if certain parts of the system are disconnected on the graph, such as two molecules separated by a distance beyond the cut-off value, the message-passing scheme becomes ineffective24. A promising direction involves investigating the potential advantages of explicitly including long-range interactions in the model formulation. These behaviors encompass electrostatic interactions, involving forces between charged particles, and dispersion terms, which arise from dynamic fluctuations in electron distribution within molecules or atoms.
Various explicit models have been proposed to address the challenges associated with long-range interactions by including electrostatic interactions. In these methods, charges are usually obtained through charge equilibration (QEq)25 principles, including the Charge Equilibration via Neural Network Technique (CENT) method26, Fourth-generation High-Dimensional Neural Network Potential (4G-HDNNP)27,28, and Charge Equilibration Layer for Long-range Interactions (CELLI)29 method. Generally, electrostatic parameters were trained to minimize the difference between QEq charges and reference partial charges derived from quantum mechanics (QM) calculations. Notably, partial charges obtained from QM calculations are either derived from the direct partitioning of the molecular wave function into atomic contributions or calculated based on the analysis of physical observables. However, due to the incompleteness of basis sets and variances in the partitioning methods, properties calculated based on partial charges may not always be reliable16,30. The ambiguity in DFT-assigned charges suggests that directly learning of them may be inessential for constructing accurate interatomic potentials26,31,32. On the contrary, the polarizable charge equilibration (PQEq)30,33,34,35,36 method proposed by Naserifar et al. enhances the QEq method by using QM electrostatic interaction energies instead of partial charges as targets for evaluating interatomic potentials. Additionally, PQEq has proven to be effective in capturing polarization effects by explicitly introducing core-shell displacement. Another way to solve the partition-dependent issue is to take the dipole moment into the loss function, as done by SpookyNet37. However, it still uses point-like charges, making it behaves like QEq-based methods when dealing with polarization effects.
In this work, we introduce a framework that integrates the equivariant message-passing neural network potentials with the explicit polarizable long-range potential. We initially conducted benchmark tests on datasets containing systems with various charge states proposed by references28,38, showcasing the efficacy of our framework in handling both long-range interactions and varying charge environments. Building on these results, we extended the framework to develop a foundation model encompassing the periodic table up to Pu, while maintaining computational efficiency. We have applied this foundation model across different areas of applications, including the predictions of mechanical properties of materials, phase transitions in ferroelectric materials, and reactive molecular dynamics of solid-state batteries interface, thus leading to significant advancements in the fields of materials science and chemical simulations. The model also demonstrates transferability, accurately predicting interactions between clusters in various charge states and molecular polarizability. Furthermore, we show that the foundation model can be efficiently finetuned to achieve ab initio accuracy for specific challenging systems.
Results
Theoretical framework
Our framework, which integrates neural network potential and polarizable long-range interactions, is depicted in Fig. 1. For a given chemical system and boundary conditions, our goal is to construct a mapping from atomic coordinates \({{{{\bf{r}}}}}_{i}\) and atomic types \({Z}_{i}\) to the total potential energy \({E}_{{{{\rm{pot}}}}}\). The potential energy is expressed as the sum of the second-order expansion30 with respect to charge fluctuations and the density functional theory39 D3 (DFT-D3) van der Waals dispersion energies correction term40,41 \({E}_{{{{\rm{D}}}}3}\),
a The framework takes atomic numbers (\({Z}_{i}\)) and coordinates (\({{{{\bf{r}}}}}_{i}\)) as inputs and integrates a neural network block and an explicit polarizable long-range interactions block. For the neural network block, atomic numbers are converted into feature vectors via one-hot encoding, while the coordinates are transformed into a representation of the local environment using radial basis functions and spherical harmonics to capture the distance (\({|{{{\bf{R}}}}}_{{ij}}|\)) and directional (\({{{{\bf{R}}}}}_{{ij}}/|{{{{\bf{R}}}}}_{{ij}}|\)) information. These embeddings are processed through an equivariant graph neural network to output the scalar machine learning potential energy \({E}_{0}\). Explicit, physically-grounded calculations are performed for long-range effects. An electrostatic energy component (\({E}_{{{{\rm{PQEq}}}}}\)) is calculated using the polarizable charge equilibration (PQEq) method, and a dispersion energy component (\({E}_{{{{\rm{D}}}}3}\)) is calculated using the DFT-D3 method. The total energy is the sum of these components. Gradients of the energy yield forces and stress on the system. The long-range block also provides partial charges as an auxiliary output. b A flowchart detailing the iterative self-consistent procedure for the PQEq method. Starting with atomic core positions (\({{{{\bf{r}}}}}_{{ic}}\)), atomic numbers (\({Z}_{i}\)), and predetermined atomic parameters (electronegativities \({\chi }_{i}^{0}\), chemical hardness \({\eta }_{i}^{0}\), and spring constant \({K}_{s}^{i}\)), the method uses a core-shell Gaussian model to build and solve a system of linear equations. This loop is repeated until the partial charges (\({q}_{i}\)) and shell positions (\({{{{\bf{r}}}}}_{{is}}\)) converge, yielding the final electrostatic energy (\({E}_{{{{\rm{PQEq}}}}}\)) and the converged partial charges. c Partition of core-shell Gaussian charge model used in long-range electrostatics. The partial charge (\({q}_{i}\)) of each atom is represented as the sum of the core (\({q}_{{ic}}\)) and shell (\({q}_{{is}}\)) charge. The harmonic spring constant, \({K}_{s}^{i}\), couples the core and shell.
The zeroth-order atomic energy \({E}_{0}^{i}\) corresponds to the final layer scalar output of the equivariant graph neural networks, while the higher-order self- and interatomic Coulomb interactions represent the polarizable long-range electrostatic interactions \({E}_{{{{\rm{PQEq}}}}}\). To account for charge transfer and polarization effects, the partial atomic charge \({q}_{i}\) of atom i is the sum of the nuclear core charge \({q}_{{ic}}\) and shell charge \({q}_{{is}}\), both of which assume a Gaussian charge distribution form. The first-order coefficients \({\chi }^{0}\) are electronegativities, commonly defined as half of the sum of ionization potential (IP) and electron affinities (EA). The second-order coefficients \({\eta }^{0}\) signifies idempotential or chemical hardness, defined as \({{{\rm{IP}}}}-{{{\rm{EA}}}}\). The spring constant \({K}_{s}^{i}\) denotes the isotropic harmonic connectivity between the shell position \({{{{\bf{r}}}}}_{{is}}\) and core position \({{{{\bf{r}}}}}_{{ic}}\) (equal to \({{{{\bf{r}}}}}_{i}\)) of atom i. The Gaussian electrostatic energy is given by \(C\left({{{{\bf{r}}}}}_{{ik},{jl}}\right){q}_{{ik}}{q}_{{jl}}\), where i and j are the atomic indices, and k and l represent the core (c) or shell (s), respectively. The displacement vector is given by \({{{{\bf{r}}}}}_{{ik},{jl}}={{{{\bf{r}}}}}_{{ik}}-{{{{\bf{r}}}}}_{{jl}}\). The detailed derivation of the response to the external electric field can be found in the Supplementary Note 1. In previous works, PQEq parameters for 102 elements have been derived from experimental data or high-level QM calculations25,42, and have accurately reproduced QM electrostatic interaction energies30. In the equivariant neural networks12, the initial features are generated using a trainable one-hot embedding that operates on the atomic types. The interatomic distance of atom i and atom j, denoted as \(|{{{{\bf{R}}}}}_{{ij}}|\), is expanded by radial basis functions. Concurrently, the directional component of the interatomic vectors, expressed as \({{{{\bf{R}}}}}_{{ij}}/|{{{{\bf{R}}}}}_{{ij}}|\), is expanded by spherical harmonics functions. These features are utilized to construct the atomic graph. Then, through the equivariant message passing23 schemes, the atomic features are updated. A multilayer perceptron layer is then connected to derive the scalar output. It should be noted that the neural network potential component is specifically designed to capture energy contributions distinct from electrostatic interactions, thereby ensuring no overlap in energy accounting between different components. Subject to the conservation of the net charge and the equality of chemical potentials for all atoms, the linear equations can be used to update partial charges (\({q}_{i}\)) and shell positions (\({{{{\bf{r}}}}}_{{is}}\)). The partial charges can be obtained self-consistently in each MD step. The forces on atoms and virial stress on the cell can be calculated via automatic differentiation after partial charges are determined, as detailed in Supplementary Note 2.
Validation on diverse charge-state datasets
To evaluate the capability of our framework in capturing different charge states and long-range interactions, we validated our framework against the dataset developed by Ko et al.28 and Maruf et al.38, which encompass various charge states and charge transfer systems. They contain six distinct subsets: Ag cluster with positive and negative total charge, (Ag3+/−), Na-Cl ionic cluster with one neutral Na removed (Na8/9Cl8+), hydrogenated carbon chains in both neutral and cationic states (C10H2/C10H3+), a periodic system consisting of Au clusters adsorbed on an MgO-(001) surface, and benzotriazole interactions with Cu-(111) surfaces in dry (BTA-Cu) and aqueous (BTA (H2O)-Cu) environments. We demonstrate the advantage of integrating equivariant message-passing networks with explicitly polarizable long-range interaction models by benchmarking against both 4G-HDNNP, a local MLIP with explicit long-range interactions, and NequIP, which implicitly captures long-range effects through its message-passing mechanism. 4G-HDNNP differentiates charged systems by explicitly training DFT partial charges and incorporating them as descriptors into neural networks for short-range interactions. We employed the NequIP models trained in the reference38 (Maruf’s NequIP) as our baseline equivariant model. Our framework explicitly incorporates long-range interactions while adopts Maruf’s NequIP as the E0 component without touching its settings, thereby ensuring a fair comparative analysis. The results are presented in Table 1, with detailed neural network architectural specifications provided in the Supplementary Table 3.
For the carbon chains, the introduction of additional H+ causes some C atoms to exhibit opposite charge states43, as shown in Supplementary Fig. 1a. Although we did not fit partial charges, our model qualitatively captures the distribution of DFT partial charges, as shown also in Supplementary Fig. 1a. In the linear C10H2 molecule, our model demonstrates symmetric charge distribution around the center which shows agreement with DFT. After explicitly incorporating physical long-range interactions, our framework achieved additional improvements in error metrics compared to NequIP and 4G-HDNNP. For positively charged NaCl clusters, the predicted energy and forces by our model are still more accurate compared to 4G-HDNNP, as well as the potential energy surface shown in Supplementary Fig. 1b. For Au2 adsorption on MgO surfaces and benzotriazole interactions with Cu-(111) surfaces in dry and aqueous environments, our model outperforms NequIP significantly. Additionally, our model correctly predicted the preferential adsorption geometries for both doped and undoped MgO, as listed in Supplementary Table 1. As for the charged Ag clusters, due to their small system size and lacking long-range effects, the entire structure falls within the typical cutoff radius used by the MLIP. The energy dependence on the overall charge state of clusters leads to degeneracy between atomic structures and potential energy surfaces, resulting in poor performance of the charge-independent NequIP model. We acknowledge that for such small, highly charged systems, the 4G-HDNNP approach to learning charge equilibration parameters is more convenient. Nonetheless, we also achieved reduced force error with the fixed physical parameters compared to 4G-HDNNP. Although PQEq partial charges are not directly comparable to DFT Hirshfeld ones, we compare them to show their relevance. Varying degrees of agreement across different chemical systems are observed, as shown in Supplementary Table 2.
Our comprehensive evaluation demonstrates that explicit incorporation of physical long-range interactions significantly enhances the performance of MLIPs across diverse charge-state systems. Notably, our proposed framework outperforms 4G-HDNNP in most cases for energies and forces without fitting partial charges. This advantage likely stems from a more physically meaningful partition of the total energy.
Foundation model benchmark
We trained a foundation model for all the periodic table elements up to Pu using the MPtrj dataset16 following our framework (our model), as described in the Methods section. We also trained a model maintaining the same MLIPs framework and dataset, but without the polarizable long-range interactions (w/o-lr model). The former one demonstrated mean absolute errors (MAE) of 18 meV per atom for energies, 0.065 eV\(\cdot\)Å−1 for forces, and 0.301 GPa for stresses across the test set, outperforming the w/o-lr model, as shown in Supplementary Table 4. These proved the benefit of incorporating long-range interactions in the enhancement of prediction accuracy.
Message-passing mechanisms will fail when graph connections break in the system. For instance, in dimers, when the interatomic distance exceeds the graph cut-off, the atoms no longer interact with each other. We analysed interaction energies for Na-Na and Na-Cl dimers across interatomic distances ranging from 1.8 Å to 12.0 Å. As shown in Fig. 2a, both models (our model and the w/o-lr model) trained on the MPtrj dataset accurately reproduce the equilibrium bond lengths of the two dimers. However, the w/o-lr model failed to differentiate interaction energies beyond the MLIPs cutoff distance, i.e., 5.0 Å, while our model successfully captured the DFT potential energy surface throughout the entire distance range, demonstrating its capability in describing extended ionic interactions. Moreover, the message-passing mechanism exhibits deficiencies when simulating layered materials. Taking lithium iron phosphate (LiFePO4) as a case study, we observed that the w/o-lr model predicted an irreversible phase transition, as shown in Supplementary Fig. 3. In contrast, our model correctly reproduced the experimentally44 and AIMD45 thermally stable olivine structure. These results collectively demonstrate that the explicit long-range interactions are crucial for accurately describing both ionic systems and layered materials, particularly in predicting structural stability and thermal behavior.
a The interaction energies for Na-Na (green) and Na-Cl (purple) dimers predicted by our model, the model without polarizable long-range interactions (w/o-lr model), and reference calculations using density functional theory (DFT)39. b Evaluation of polarizable interactions in water molecules. The water molecule model is oriented in the yz-plane, with oxygen and hydrogen atoms shown in red and white, respectively. The energy response curves as a function of external electric field (\({{{\mathbf{\epsilon }}}}\)) strength applied along the x-axis. Results are compared among our model (red), the Charge Equilibration (QEq)-based model25 (blue), and the reference DFT calculations (gray) serving as the reference values. c–e Comparison of bulk modulus from DFT calculations (reference values) and our model (prediction values): (c) Voigt approach (BV), (d) Reuss method (BR), and (e) Hill average (BVRH). The R-squared values (R2) and mean absolute errors (MAE) of the bulk modulus are also shown.
Polarization plays a fundamental role in determining molecular responses to external electric fields. Conventional QEq-based models have limitations in their response to an external electric field applied orthogonally to the molecular dipole. We validated this capability through a benchmark study examining a water molecule’s response to external electrostatic fields46. The water molecule was fixed in the yz-plane, and we applied varying external electric fields along the x-axis to derive energy curves, with the results presented in Fig. 2b. The model with polarizable long-range interactions demonstrates a trend consistent with the reference values obtained from DFT. Due to the external electrostatic field direction consistently being orthogonal to the molecule, the electrostatic energy in the QEq-based model (like 4G-HDNNP) is 0 and is completely independent of field strength (i.e., \({{{\mathbf{\mu }}}} \, {{\cdot }} \, {{{\mathbf{\epsilon }}}}=0\), where \({{{\mathbf{\mu }}}}\), \({{{\mathbf{\epsilon }}}}\) are dipole moment and electric field, respectively). This is because the QEq model treats atoms in the molecule as point charges or Gaussian charges, without distinguishing between core and shell components. We also compared the response of O atom partial charges across DFT Hirshfeld calculations, our model, and the conventional QEq model, as shown in Supplementary Fig. 4. The agreement between the PQEq model and DFT is observed, while the QEq model completely lacks response. This comparative analysis establishes that the explicit polarizable interactions are crucial for modeling molecular systems subject to external electric fields.
Besides the failure of response in static conditions, the QEq method may exhibit non-physical behavior in dynamics47. As illustrated in Supplementary Fig. 5, we conducted additional investigations of two water molecules under an external electric field of 0.25 V\(\cdot\)Å−1. In charge distribution models, it is essential to allow systems to form an internal electric field opposing the external field. The QEq model assumes the system behaves as a perfect conductor without penalizing charge transfer as a function of distance. This limitation of the QEq method leads to non-physical charge transfer between molecules separated by large distances47. In contrast, within the PQEq model, the individual water molecules can polarize, generating internal electric fields that counteract the external field, thereby capturing more realistic electrostatic responses in molecular systems. Under the QEq scheme, the water molecules accumulate non-zero net charges and migrate to the top and bottom of the simulation box. In contrast, our model maintains charge neutrality of the water molecules, which remain stationary despite the applied electric field, aligning with DFT results.
We conducted benchmarking using properties that were not labeled during the training process. We applied our foundation model to predict the mechanical properties of 10,154 materials from the Materials Project48. Figure 2c–e illustrates the comparison of the bulk modulus \(B\) determined by the foundation model and DFT. Our model demonstrated impressive performance, achieving an R2 of 0.94 with the Voigt approach \({B}_{{{{\rm{V}}}}}\) and Hill average method \({B}_{{{{\rm{VRH}}}}}\), and 0.93 using the Reuss method \({B}_{{{{\rm{R}}}}}\). In comparison, as shown in Supplementary Fig. 2, the w/o-lr model achieved lower R² values of 0.89, 0.86, and 0.88 for \({B}_{{{{\rm{V}}}}}\), \({B}_{{{{\rm{R}}}}}\), and \({B}_{{{{\rm{VRH}}}}}\), respectively. These not only highlight the robust performance of the model but also establish a solid foundation for the high-throughput screening of materials with exceptional mechanical properties.
Transferability of the foundation model
In terms of foundation model transferability, evaluations were conducted across diverse systems. The model accurately reproduces the potential energy surfaces and successfully distinguishes between neutral and ionic states in OH-OH systems, capturing the long-range Coulombic repulsion of ionic state as shown in Supplementary Fig. 8. Further validation using a periodic water system demonstrates the model’s capability to predict polarization effects under varying electric fields, achieving comparable performance to specialized models as depicted in Supplementary Fig. 10. Additionally, as shown in Supplementary Fig. 11, when tested on 7211 molecules from the QM-7b dataset49, the model shows remarkable accuracy in predicting molecular polarizability with a mean absolute error of 4.57 atomic units (a.u.), highlighting its robust transferability in capturing polarization.
Computational efficiency
Despite the incorporation of additional long-range interaction calculations, our foundation model maintains high computational efficiency. Performance benchmarks were conducted on a single NVIDIA H100 GPU to quantitatively assess the computational cost. As illustrated in Supplementary Fig. 6, for a periodic system comprising 2160 atoms, our model requires approximately 0.07 s per molecular dynamics time step, representing a significant improvement over conventional universal MLIPs, which require approximately 0.91 s. This computational advantage extends to larger systems, as demonstrated by simulations of a 24000-atom system, where our model maintains efficiency with only 1.49 s per time step. Such computational performance enables nanosecond-scale molecular dynamics simulations of systems containing tens of thousands of atoms, making it practical for more realistic modeling of materials.
Materials modeling applications
To demonstrate the capability of our model to reproduce results from AIMD simulations, we simulate the kinetic transport properties of a solid-state electrolyte. Specifically, we investigated lithium-ion diffusivity within the cubic phase of Li7La3Zr2O12 (c-LLZO)50, a crystalline superionic conductor known for its remarkable stability as a lithium conductor. Owing to the efficiency of our foundation model, we are able to conduct MD simulations on the c-LLZO comprising 64 formula units, with a duration of 2 ns for each temperature range from 800 K to 1800K. Figure 3a presents the Arrhenius plots derived from our model, w/o-lr model, and AIMD51 simulations. The diffusion coefficients were determined from the slope of the logarithmic mean square displacements (MSD) versus logarithmic time within the Fickian regime52. MSD curves of our model are depicted in Fig. 3b. Compared to previous work using AIMD simulations, our model clearly reproduces the diffusion coefficients and activation energy. The performance surpasses that of the w/o-lr model, which lacks explicit long-range interactions. With the ability to reach larger model sizes and longer simulation times, our model achieves these results with significantly reduced uncertainties. This capability not only delivers comparable results to AIMD simulations but also enables larger-scale simulations that cannot be assessed by AIMD, enhancing the statistical significance of MD studies. Consequently, the framework developed in this work provides more reliable and comprehensive data for diffusion analysis in solid-state electrolytes.
a Crystal structure of cubic phase \({{{\rm{Ia}}}}\bar{3}{{{\rm{d}}}}\) Li7La3Zr2O12 (c-LLZO) and Arrhenius plots depicting the lithium-ion diffusion coefficients across varying temperatures (T). The dark blue polyhedron signifies La located at the 24(c) site, and the light brown polyhedron indicates Zr at the 16(a) site. Li fraction occupies the 24(d) and 96(h) sites. Predicted diffusion coefficients (D) of ab initio molecular dynamics (AIMD)51, our model, and the model without polarizable long-range interactions (w/o-lr model) are presented to calculate activation energies (Ea). The error bars represent the standard deviation of the diffusivity, calculated based on the total number of effective ion hops observed in the MD simulation, following the methodology proposed by the reference51. b 2-ns mean square displacements (MSD) verse time (\(\tau\)) using our model of lithium-ion in c-LLZO with different temperatures ranging from 800 K to 1800 K in an increment of 200 K. The linear dashed gray lines, with a slope of 1, are also plotted.
To show the application of polarizable potentials, we performed MD simulations on a typical perovskite ferroelectric material, BaTiO3. As temperature increases, BaTiO3 undergoes a sequential phase transition process, transforming from rhombohedral to orthorhombic, then to tetragonal, and finally to a cubic structure53. Accurately simulating phase changes in ferroelectric substances requires precise potential energy functions that can respond to the small atomic shifts and structural changes, as well as account for the free energy landscape under the conditions of finite-temperature thermodynamics54. The phase sequence of BaTiO3 has been extensively studied using effective Hamiltonians55,56,57,58, CMD59,60, reactive force fields (ReaxFF)61, specialized MLIPs models trained on the BaTiO3 system54,62, and experiments63. To detect lattice distortions that distinguish different phases, a 10 × 10 ×10 supercell of BaTiO3 was simulated. The use of larger simulation cells (10 × 10 × 10 in our work versus 4 × 4 × 4 in Gigli et al.54) may provide some advantages for phase transition studies. Larger supercells can effectively reduce the impact of temperature fluctuations, enabling better temperature control for sampling the free energy landscape, thereby yielding results with greater statistical significance. Figure 4 illustrates the simulation results, which clearly identify four distinct phases and the three first-order phase transitions within the simulated temperature range. Below 145 K, the overall average polarization of the supercell aligns with the (111) direction, indicative of the rhombohedral phase. At 145 K, a phase transition to orthorhombic is suggested as the y-component of polarization approaches zero. With further increase to 205 K, the polarization predominantly orients along the x-direction, indicating the tetragonal phase. This phase persists until 285 K, where the cubic paraelectric phase is observed. The obtained phase transition temperatures align closely with those predicted by various models and experiments, as listed in Table 2. It is worth noting that the supercell size can affect phase transition temperatures, so that direct comparison between different simulations is not expected. Utilizing first principles methods or fitting potential energy surfaces to first principles data often underestimates these temperatures, which is attributed to the approximated exchange-correlation functional59 employed in DFT. Although the predicted phase transition temperatures are lower than the experimentally observed values, the model effectively captures the sequence of phase transitions as measured in experiments, demonstrating the efficacy of the foundation model in modeling highly polarized phase transitions in ferroelectric materials.
The temperature dependence of (a) lattice constants (a, b, and c in gray, red, and blue, respectively), and (b) local polarizations of unit cells in each direction (Px, Py, and Pz in gray, red, and blue, respectively) exhibit notable changes during the phase transitions observed from molecular dynamics simulations on 10 × 10 × 10 supercell of BaTiO3.
The final application studied in this work involves all-solid-state batteries, which represent a breakthrough in the evolution of next-generation energy solutions, owing to their superior energy density and inherent safety features64. Lithium thiophosphate, known for its exceptional ionic conductivity ( ~ 10−3 S cm−1), is deemed the most promising candidate for solid electrolytes and has been widely studied through experiments65,66,67 and computer modeling68,69,70. Notably, the nanoporous β-Li3PS4 has been validated to exhibit outstanding cycling stability71, presumably attributed to the formation of Li2S and Li3P solid-electrolyte interphases (SEI) during the initial battery cycles, which serves to passivate further degradation of the electrolyte69,72,73,74.
To investigate the formation of SEI, we utilized the foundation model to simulate the interfacial reactive MD of the solid electrolyte β-Li3PS4 in contact with the lithium metal anode, encompassing a system of 13,760 atoms. The initial β-Li3PS4 /Li interfacial structure, measuring 33.15 nm × 3.1 nm × 2.6 nm, was utilized in the MD simulations, as depicted in Fig. 5a. The highly reactive Li metal began to interact with the PS4 tetrahedron at the interface, triggering the formation of the SEI layer as illustrated in Fig. 5b. With the growth of the SEI layer, electrons were transferred from the Li anode to the Li3PS4 electrolyte, which causes a gradual increase of the partial charges of Li as they migrate from the anode, across the SEI, and into the electrolyte. This results in a transition from metal lithium to ions. The dynamic behavior of partial charges at the anode-electrolyte interface is a critical aspect showcased in Fig. 5c. Initially, the partial charges of lithium changed almost linearly from the anode to the electrolyte. Over time, this distribution evolved and showed a distinct plateau. This phenomenon could be attributed to the ordered structuring of the SEI, signifying the nucleation of crystalline structures. Ultimately, a stable lithium partial charge plateau was formed between the anode and the electrolyte, spanning a range of 15 to 21.5 nm. This is further supported by the visualized structure depicted in Fig. 5b, where the plateau within the SEI aligns with the Li2S crystal and amorphous regions. Chen et al.68 employed AIMD simulations to investigate the radial distribution functions (RDF) of Li-P and Li-S bonds, which concluded that the decompositions of electrolytes in lithium thiophosphates are primarily due to the decomposition of PS4 tetrahedron by the active lithium metal, leading to the formation of new Li-P and Li-S bonds. Our RDF analysis confirmed the formation of Li-S and Li-P bonds within the SEI and electrolyte as depicted in Fig. 5d. In agreement with the AIMD simulations68, we observed the emergence of a new Li-P peak within the SEI at approximately 2.5 Å. Furthermore, within the SEI, the Li-S bond displayed RDF characteristics consistent with crystalline Li₂S75. For the Li-P bonds, the peak shape closely resembles that of amorphous Li3P AIMD simulations76.
a The initial structure of β-Li3PS4 (010)/Li (100) interface. The elements are color-coded: Li in green, S in purple, and P in pink. A segment of the primary reaction zone, ranging from 8 to 26 nanometers along the x direction (Rx), is displayed. b Snapshots at different times during the MD simulations. The partial charges on lithium ions (qLi) are represented with color coding to enhance the visibility of structural transformations during the formation of the solid electrolyte interphase (SEI). The SEI layer, approximately 8.5 nm in thickness, forms after 4 ns molecular dynamics (MD) simulations, comprising an amorphous β-Li3PS4/Li2S interface ( ~ 2 nm), a crystalline Li2S layer ( ~ 4.5 nm), and an amorphous Li2S (Li3P)/Li interface layer ( ~ 2 nm) in sequence. c The distributions of the Li partial atomic charges along the x-direction at initial, 20, 100, 2000, and 4000 ps state. d The radial distribution function (g(r)) plots of Li-S and Li-P within the electrolyte and SEI layer.
As illustrated in Fig. 6a, b, the ultimate decomposition products of P form only short-range ordered structures, in contrast to S, which forms long-range ordered structures. This observation agrees with the measurements where Li2S crystals were detected but Li3P crystals were absent in cryogenic transmission electron microscopy (cryo-TEM) experiments, as noted in the reference66. To enhance the understanding of the formation mechanism, the atomic compositions of the SEI in both crystalline and amorphous Li2S regions were examined. As depicted in Fig. 6c, d, within the first 2 ns, the emergence of amorphous Li2S and Li3P regions was attributed to the swift diffusion of P and S atoms. Conversely, the formation of crystalline Li2S regions was predominantly influenced by the swift diffusion of Li atoms and the slower diffusion of P and S atoms. As the system further evolves to 4 ns, the diffusions slowdown, which also confirmed that the formation of crystalline Li2S region hindered further atomic diffusion and thus slowed the growth of the SEI. As illustrated in Fig. 5b, during the initial phase of decomposition (0–100 ps), the swift diffusion of P, S, and Li led to the formation of an interface of about 3 nm thick. However, in a later stage between 100 and 2000 ps, the formation of Li2S nuclei impeded the diffusion of P and S atoms, resulting in a period of sluggish SEI growth. Consequently, the interface experienced only a slight expansion in this period, increasing by only 1 nm. Furthermore, in the following 2 ns, there was essentially no SEI growth. The final interface structure ( ~ 8.5 nm) could be characterized by a 4.5 nm crystalline Li2S region sandwiched between two 2-nm transitional layers. The cessation of growth at this interface suggests that the crystalline Li2S and amorphous Li3P within the crystal region contribute to the stabilization of both the lithium metal anode and the electrolyte, which agrees with the theoretical predictions made by DFT calculations72.
The snapshots capture the dynamic decomposition behavior of PS4, showcasing: (a) the sulfur, and (b) the phosphorus component along the x direction (Rx). To elucidate formation mechanism of the solid electrolyte interphase (SEI) layer, atomic component analyses of the (c) crystalline, and (d) amorphous Li2S regions are compared among 4-ns (upper), 2-ns (middle), and the initial state (lower). The elements are color-coded: Li in green, S in purple, and P in pink. The x-component root mean squared displacements (RMSDx) of Li, P, and S between the initial and the 2-ns states and between 4-ns and 2-ns states are also shown.
Additionally, we studied the interface reactions between Li6PS5Cl and lithium metal, as detailed in Supplementary Note 13. As shown in Supplementary Fig. 13, at a pressure of 1 atm and a temperature of 300 K, Li6PS5Cl formed an interface of about 5-nm thick after 2 ns. This indicates that the Li6PS5Cl electrolyte exhibits a higher stability against lithium metal at room temperature compared to Li3PS4. Upon elevating the temperature to 500 K, we detected Li2S at the grain boundaries, which aligns with the cryo-TEM observations66.
Finetuning for enhanced accuracy
While our foundation model demonstrates general performance, achieving ab initio accuracy for specific systems requires finetuning. Using Na8/9Cl8+ clusters and Au2 dimers on MgO surfaces from Ko et al.28 as test cases, we performed targeted finetuning using 20% subsets of configurations and mean squared force error as the sole loss function. After finetuning, our model achieved good agreement with DFT references, accurately distinguishing potential energy surfaces for positive charged clusters and surface conditions, as shown in Supplementary Fig. 12 and Supplementary Table 6. This success demonstrates the effectiveness of the two-step approach: foundation pretraining followed by efficient finetuning, enabling quantum-accurate predictions while maintaining the model’s fundamental physical insights and transferability.
Discussion
In this study, we introduced a framework that integrates equivariant machine learning interatomic potentials with explicit long-range polarizable interactions. QEq-based models inherently struggle with atomic polarization because they do not distinguish between core and shell components. Due to their inability to handle atomic-level responses to external fields, they may exhibit non-physical behaviors in some cases, as described in the reference47. Our framework incorporates polarizable long-range interactions and accurately captures atomic polarization in the presence of external electric fields. Since our framework decomposes the total energy into electrostatic energy and remaining energies that only depend on atomic types and positions, any future machine learning techniques and/or high-order electrostatic energy expansion schemes can be readily integrated into it, empowering possible accuracy enhancement.
Our foundation model successfully reproduced DFT potential energy surfaces for Na-Cl/Na-Na dimers across extended interatomic distances, outperforming conventional message-passing MLIP beyond the cut-off regions. The model demonstrates exceptional versatility across multiple materials science challenges, including accurate prediction of thermal stability in complex layered materials, high-fidelity capture of molecular responses to external electric fields, and reliable prediction of bulk modulus. The model has been also successfully applied to study dynamic properties and reactive molecular dynamics, such as lithium-ion diffusion in solid-state electrolytes and temperature-dependent phase transitions in ferroelectric materials like BaTiO3, while also elucidating interface formation mechanisms between lithium thiophosphate electrolytes and lithium metal anodes in solid-state batteries.
Beyond these specific applications, we evaluated the model’s transferability, a critical attribute for a foundation model. The model accurately reproduces the potential energy surfaces for OH-OH systems, correctly distinguishing between the neutral state and the long-range Coulombic repulsion of the ionic state. The model achieved a remarkable mean absolute error of just 4.57 a.u. for molecular polarizability, highlighting its robust and generalizable learned representation of polarization physics. Finally, we demonstrated our foundation model’s capacity to achieve ab initio accuracy for specific systems through efficient finetuning.
We acknowledge that our model aims to achieve a balance between computational efficiency and accuracy. Several advanced machine learning approaches for long-range interactions may achieve higher accuracy for specific, complex systems, including higher-order charge expansion methods and simultaneous optimization of QEq parameters32,77 in each step. However, more accurate methods incur higher computational complexity. The method that dynamically solves charge equilibration parameters at each step requires additional cost for MD simulations. It is also observed that the charge equilibration parameters do not change significantly during MD simulations for that method77, suggesting that expanding polarization charges to higher orders while using fixed, a priori parameters may achieve higher accuracy with tolerable computational overheads. Additionally, several approaches that do not require a charge equilibration process provide valuable insights. Latent Ewald Summation24 captures long-range interactions by learning hidden variables from local descriptors and applying Ewald summation to these variables. LSR-MP78 employs a fragmentation-based approach with hierarchical message passing between atoms and fragments to learn the long-range interactions. NeuralP3M79 introduces mesh points alongside atoms to discretize space and capture long-range effects. SO3krates80 utilizes spherical harmonic coordinates and equivariant attention for interactions at arbitrary length scales. LOREM81 models charges as equivariant tensors to capture orientation-dependent interactions beyond the standard cutoff radius. In contrast to these approaches, our foundation model explicitly simulates charge redistribution through a physics-driven framework that maintains a transparent connection to the underlying physical principles of charge polarization and electrostatic interactions. This design choice enhances interpretability while preserving computational efficiency. A promising direction for future work lies in exploring strategic integrations of our framework with these complementary methodologies to develop even more comprehensive and powerful foundation models.
Methods
Model training details
To train the foundation equivariance neural network potential, we used the MPtrj dataset16 sourced from Materials Projects48 as the training dataset. All configurations were calculated using DFT with the PBE82/PBE + U83 exchange-correlation functional and pseudopotential basis. The PBE/PBE + U mixing compatibility correction84 was applied to ensure energy consistency. Due to transferability issues with the Yb element85, all data containing Yb in the MPtrj dataset were excluded. The higher-order energy terms and the electrostatic Coulomb interaction energies due to charge fluctuations should be deducted from the total potential energy. Appropriately setting the PQEq parameters is crucial for training a foundation framework. Detailed information on the fitting process and the PQEq parameters used in this work can be found in the Supplementary Note 10 and Supplementary Table 7.
For equivariant neural network training, the NequIP12 model was utilized, incorporating six equivariant layers with node features set at 256 × 0e + 256 × 1e. Spherical harmonic basis functions were used to represent interatomic directions, denoted as 1 × 0e + 1 × 1o. Interatomic distances were described using eight Bessel basis functions, and a cut-off value of 5.0 Å was chosen for building the MLIP neighbor list. The dataset is divided into a training set, validation set, and test set in a ratio of 8:1:1.
The total loss function (\({{{\mathcal{L}}}}\)) of prediction values \({{{{\bf{x}}}}}^{{{{\rm{pred}}}}}\) and reference first principle values \({{{{\bf{x}}}}}^{{{{\rm{ref}}}}}\) can be expressed as,
where energies \({{{\bf{E}}}}\), forces \({{{\bf{F}}}}\), and stresses \({{{\bf{S}}}}\) weights were set as \({\omega }_{E}=1\), \({\omega }_{F}=1\), and \({\omega }_{S}=10\) with quantity units of eV, eV\(\cdot\)Å−1, and eV\(\cdot\)Å−3. The batch size was selected as 128, and the Adam optimizer was employed with an initial learning rate of 10–3. All the trainable parameters were initialized with the random seed 3407. The implementation of the entire neural networks and long-range interactions was based on JAX 0.4.2086. The training was conducted on a single NVIDIA H100-PCIe GPU, utilizing CUDA version 12.3 and Driver version 545.23.06.
Molecular dynamics simulations
The Atomic Simulation Environment (ASE)87 was utilized as the interface for geometry relaxations and MD simulations. Structure relaxations were conducted using the limited-memory BFGS88 method. Both atomic coordinates and cell vectors were optimized concurrently until the forces fell below the convergence threshold of 0.05 eV/Å.
The c-LLZO with a space group of \({{{\rm{Ia}}}}\bar{3}{{{\rm{d}}}}\) originated from experimental structures detailed in the reference50. The structure was expanded to a 2 × 2 × 2 supercell. Subsequently, 180 lithium atoms were randomly placed on the 24(d) sites, and an additional 268 lithium atoms on the 96(h) sites. After relaxation, the structure served as the starting point for MD simulations. The isothermal isobaric (NpT) ensemble simulations were performed with the temperature control via Nosé-Hoover thermostat89,90 and pressure maintained at 1 atm using Parrinello-Rahman barostat91,92 to determine the lattice constants at various temperatures. Then, the canonical (NVT) ensemble simulations with the Nosé-Hoover thermostat were utilized to ascertain the lithium diffusion properties within c-LLZO. A time step of 2 fs was used for all NpT and NVT simulations. In the case of NVT simulations, following a 100 ps equilibration period, 2-ns trajectories’ mean squared displacements of Li ions were employed to calculate the self-diffusion coefficients at various target temperatures. Additionally, we conducted an uncertainty analysis of the diffusion coefficients, applying the empirical error estimation method as detailed in the reference51.
To construct the phase diagram of the BaTiO3 crystal structure, a supercell consisting of a 10 × 10 × 10 \({{{\rm{R}}}}3{{{\rm{m}}}}\) BaTiO3 lattice with 5000 atoms was created. The crystal structure data for BaTiO3 was sourced from the Materials Project48. NpT simulations with the Nosé-Hoover thermostat and Parrinello-Rahman barostat were performed with a 1-fs timestep. Throughout the simulations, the pressure was consistently maintained at 1 atmosphere. To regulate temperature, the procedure began at 5 K and involved incrementing the temperature by 10 K every 20 ps. The average lattice constants and the local polarization of the unit cell, denoted as \({{{\bf{P}}}}\), were determined using data from the final 10 ps of trajectories at each temperature step. For the comprehensive calculation method of the local polarization, please refer to the Supplementary Note 12.
We utilized a 13760-atom structure consisting of a ca. 13.6-nm thick \({{{\rm{Im}}}}\bar{3}{{{\rm{m}}}}\) lithium metal anode with (100) facet, paired with a ca. 19.1 nm thick Pnma β-Li3PS4 electrolyte with (010) facet to illustrate the SEI formation across realistic spatial and temporal scales. The system was structured with periodic lateral dimensions measuring 3.1 nm by 2.6 nm. The unit cell of anode and electrolyte structures originated from the Materials Project48. The entire cell was subject to periodic boundary conditions, with a 1 nm atomic layer fixed on each side along the x-direction to ensure a singular interface reaction. NpT MD simulations were conducted at a standard temperature of 300 K and a pressure of 1 atm by Nosé-Hoover thermostat and Parrinello-Rahman barostat with a timestep of 2 fs. Initial velocities for the particles were assigned randomly following the Maxwell-Boltzmann distribution, and the simulations were carried out for a duration of 4 ns.
Data availability
For datasets containing different charge states, the C10H2/C10H3+, Na8/9Cl8+, Ag3+/− and Au2-MgO datasets are publicly available from the reference93 at https://doi.org/10.24435/materialscloud:f3-yh, while BTA-Cu and BTA (H2O)-Cu datasets are available at https://github.com/ahmad-research-group/nequip-charge. The MPtrj dataset is also publicly available from the reference94 through https://doi.org/10.6084/m9.figshare.23713842. Detailed polarizable charge equilibration parameters are provided in the Supplementary Table 7. The trajectory data of molecular dynamics are available at https://github.com/reaxnet/reaxnet. Source data are provided with this paper.
Code availability
A JAX implementation code is available under GNU General Public License v3.0 via GitHub at https://github.com/reaxnet/reaxnet or Zenodo (https://doi.org/10.5281/zenodo.1712310795).
References
Karplus, M. & Petsko, G. A. Molecular dynamics simulations in biology. Nature 347, 631–639 (1990).
Senftle, T. P. et al. The ReaxFF reactive force-field: development, applications and future directions. npj Comput. Mater. 2, 15011 (2016).
Yao, N., Chen, X., Fu, Z.-H. & Zhang, Q. Applying classical, ab initio, and machine-learning molecular dynamics simulations to the liquid electrolyte for rechargeable batteries. Chem. Rev. 122, 10970–11021 (2022).
Chen, P. J. et al. Molecular dynamics investigations of dienolate [4 + 2] reactions. J. Am. Chem. Soc. 146, 12758–12765 (2024).
Iftimie, R., Minary, P. & Tuckerman, M. E. Ab initio molecular dynamics: concepts, recent developments, and future trends. Proc. Natl Acad. Sci. USA 102, 6654–6659 (2005).
Rappe, A. K., Casewit, C. J., Colwell, K. S., Goddard, W. A. & Skiff, W. M. UFF, a full periodic table force field for molecular mechanics and molecular dynamics simulations. J. Am. Chem. Soc. 114, 10024–10035 (1992).
van Duin, A. C. T., Dasgupta, S., Lorant, F. & Goddard, W. A. ReaxFF: a reactive force field for hydrocarbons. J. Phys. Chem. A 105, 9396–9409 (2001).
Schütt, K. et al. SchNet: a continuous-filter convolutional neural network for modeling quantum interactions. In Advances in Neural Information Processing Systems. (NeurIPS, 2017).
Schütt, K. T., Unke, O. T. & Gastegger, M. Equivariant message passing for the prediction of tensorial properties and molecular spectra. In International Conference on Machine Learning. (PMLR, 2021).
Behler, J. & Parrinello, M. Generalized neural-network representation of high-dimensional potential-energy surfaces. Phys. Rev. Lett. 98, 146401 (2007).
Zhang, L. et al. Deep potential molecular dynamics: a scalable model with the accuracy of quantum mechanics. Phys. Rev. Lett. 120, 143001 (2018).
Batzner, S. et al. E(3)-equivariant graph neural networks for data-efficient and accurate interatomic potentials. Nat. Commun. 13, 2453 (2022).
Gasteiger, J., Groß, J. & Günnemann, S. Directional message passing for molecular graphs. In International Conference on Learning Representations. (ICLR, 2020).
Batatia, I., Kovacs, D. P., Simm, G. N. C., Ortner, C. & Csanyi, G. MACE: Higher order equivariant message passing neural networks for fast and accurate force fields. In Advances in Neural Information Processing Systems. (NeurIPS, 2022).
Chen, C. & Ong, S. P. A universal graph deep learning interatomic potential for the periodic table. Nat. Comput. Sci. 2, 718–728 (2022).
Deng, B. et al. CHGNet as a pretrained universal neural network potential for charge-informed atomistic modelling. Nat. Mach. Intell. 5, 1031–1041 (2023).
Merchant, A. et al. Scaling deep learning for materials discovery. Nature 624, 80–85 (2023).
Batatia, I. et al. A foundation model for atomistic materials chemistry. Preprint at arXiv https://arxiv.org/abs/2401.00096 (2024).
Yang, H. et al. MatterSim: a deep learning atomistic model across elements, temperatures and pressures. Preprint at arXiv https://arxiv.org/abs/2405.04967 (2024).
Park, Y., Kim, J., Hwang, S. & Han, S. Scalable parallel algorithm for graph neural network interatomic potentials in molecular dynamics simulations. J. Chem. Theory Comput. 20, 4857–4868 (2024).
Neumann, M. et al. Orb: a fast, scalable neural network potential. Preprint at arXiv https://arxiv.org/abs/2410.22570 (2024).
Anstine, D. M. & Isayev, O. Machine learning interatomic potentials and long-range physics. J. Phys. Chem. A 127, 2417–2431 (2023).
Gilmer, J., Schoenholz, S. S., Riley, P. F., Vinyals, O. & Dahl, G. E. Neural message passing for quantum chemistry. In International Conference on Machine Learning. (PMLR, 2017).
Cheng, B. Latent Ewald summation for machine learning of long-range interactions. npj Comput. Mater. 11, 80 (2025).
Rappe, A. K. & Goddard, W. A. Charge equilibration for molecular dynamics simulations. J. Phys. Chem. 95, 3358 (1991).
Ghasemi, S. A., Hofstetter, A., Saha, S. & Goedecker, S. Interatomic potentials for ionic systems with density functional accuracy based on charge densities obtained by a neural network. Phys. Rev. B 92, 045131 (2015).
Behler, J. Four generations of high-dimensional neural network potentials. Chem. Rev. 121, 10037–10072 (2021).
Ko, T. W., Finkler, J. A., Goedecker, S. & Behler, J. A fourth-generation high-dimensional neural network potential with accurate electrostatics, including non-local charge transfer. Nat. Commun. 12, 398 (2021).
Fuchs, P., Sanocki, M. & Zavadlav, J. Learning non-local molecular interactions via equivariant local representations and charge equilibration. npj Comput. Mater. 11, 287 (2025).
Naserifar, S., Brooks, D. J., Goddard, W. A. & Cvicek, V. Polarizable charge equilibration model for predicting accurate electrostatic interactions in molecules and solids. J. Chem. Phys. 146, 124117 (2021).
Shaidu, Y., Pellegrini, F., Küçükbenli, E., Lot, R. & de Gironcoli, S. Incorporating long-range electrostatics in neural network potentials via variational charge equilibration from shortsighted ingredients. npj Comput. Mater. 10, 47 (2024).
King, D. S., Kim, D., Zhong, P. & Cheng, B. Machine learning of charges and long-range interactions from energies and forces. Nat. Commun. 16, 8763 (2025).
Kwon, S., Naserifar, S., Lee, H. M. & Goddard, W. A. Polarizable charge equilibration model for transition-metal elements. J. Phys. Chem. A 122, 9350–9358 (2018).
Oppenheim, J. J., Naserifar, S. & Goddard, W. A. Extension of the polarizable charge equilibration model to higher oxidation states with applications to Ge, As, Se, Br, Sn, Sb, Te, I, Pb, Bi, Po, and At elements. J. Phys. Chem. A 122, 639–645 (2018).
Yang, M. Y., O’Mari, O., Goddard, W. A. & Vullev, V. I. How permanent are the permanent macrodipoles of anthranilamide bioinspired molecular electrets? J. Am. Chem. Soc. 146, 5162–5172 (2024).
Naserifar, S., Chen, Y., Kwon, S., Xiao, H. & Goddard III, W. A. Artificial intelligence and QM/MM with a polarizable reactive force field for next-generation electrocatalysts. Matter 4, 195–216 (2021).
Unke, O. T. et al. SpookyNet: Learning force fields with electronic degrees of freedom and nonlocal effects. Nat. Commun. 12, 7273 (2021).
Maruf, M. U., Kim, S. & Ahmad, Z. Equivariant machine learning interatomic potentials with global charge redistribution. Preprint at arXiv https://arxiv.org/abs/2503.17949 (2025).
Kohn, W. & Sham, L. J. Self-consistent equations including exchange and correlation effects. Phys. Rev. 140, A1133 (1965).
Grimme, S., Antony, J., Ehrlich, S. & Krieg, H. A consistent and accurate ab initio parametrization of density functional dispersion correction (DFT-D) for the 94 elements H-Pu. J. Chem. Phys. 132, 154104 (2010).
Grimme, S., Ehrlich, S. & Goerigk, L. Effect of the damping function in dispersion corrected density functional theory. J. Comput. Chem. 32, 1456–1465 (2011).
Lide, D. R. CRC Handbook of Chemistry and Physics. 102nd edn, 1624, (CRC Press, 2004).
Hirshfeld, F. L. Bonded-atom fragments for describing molecular charge densities. Theor. Chim. Acta 44, 129–138 (1977).
Delacourt, C., Poizot, P., Tarascon, J. M. & Masquelier, C. The existence of a temperature-driven solid solution in LixFePO4 for 0 ≤ x ≤ 1. Nat. Mater. 4, 254–260 (2005).
Yang, J. & Tse, J. S. Li ion diffusion mechanisms in LiFePO4: an ab initio molecular dynamics study. J. Phys. Chem. A 115, 13045–13049 (2011).
Zhang, Y. & Jiang, B. Universal machine learning for the response of atomistic systems to external fields. Nat. Commun. 14, 6424 (2023).
Koski, J. P. et al. Water in an external electric field: comparing charge distribution methods using ReaxFF simulations. J. Chem. Theory Comput. 18, 580–594 (2022).
Jain, A. et al. The Materials Project: a materials genome approach to accelerating materials innovation. APL Mater. 1, 011002 (2013).
Yang, Y. et al. Quantum mechanical static dipole polarizabilities in the QM7b and AlphaML showcase databases. Sci. Data 6, 152 (2019).
Awaka, J. et al. Crystal structure of fast lithium-ion-conducting cubic Li7La3Zr2O12. Chem. Lett. 40, 60–62 (2010).
He, X., Zhu, Y., Epstein, A. & Mo, Y. Statistical variances of diffusional properties from ab initio molecular dynamics simulations. npj Comput. Mater. 4, 18 (2018).
Das, T., Merinov, B. V., Yang, M. Y. & Goddard, W. A. Structural, dynamic, and diffusion properties of a Li6(PS)4Cl superionic conductor from molecular dynamics simulations; prediction of a dramatically improved conductor. J. Mater. Chem. A 10, 16319–16327 (2022).
Merz, W. J. The electric and optical behavior of BaTiO3 single-domain crystals. Phys. Rev. 76, 1221–1225 (1949).
Gigli, L. et al. Thermodynamics and dielectric response of BaTiO3 by data-driven modeling. npj Comput. Mater. 8, 209 (2022).
Zhong, W., Vanderbilt, D. & Rabe, K. M. Phase transitions in BaTiO3 from first principles. Phys. Rev. Lett. 73, 1861–1864 (1994).
Mayer, F. et al. Improved description of the potential energy surface in BaTiO3 by anharmonic phonon coupling. Phys. Rev. B 106, 064108 (2022).
Zhong, W. & Vanderbilt, D. Effect of quantum fluctuations on structural phase transitions in SrTiO3 and BaTiO3. Phys. Rev. B 53, 5047–5050 (1996).
Zhang, J. et al. Structural phase transitions and dielectric properties of BaTiO3 from a second-principles method. Phys. Rev. B 108, 134117 (2023).
Qi, Y., Liu, S., Grinberg, I. & Rappe, A. M. Atomistic description for temperature-driven phase transitions in BaTiO3. Phys. Rev. B 94, 134308 (2016).
Tinte, S., Stachiotti, M. G., Sepliarsky, M., Migoni, R. L. & Rodriguez, C. O. Atomistic modelling of BaTiO3 based on first-principles calculations. J. Phys. Condens. Matter 11, 9679–9690 (1999).
Akbarian, D. et al. Understanding the influence of defects and surface chemistry on ferroelectric switching: a ReaxFF investigation of BaTiO3. Phys. Chem. Chem. Phys. 21, 18240–18249 (2019).
Ouyang, X. et al. Quantum-accurate modeling of ferroelectric phase transition in perovskites from message-passing neural networks. J. Phys. Chem. C. 127, 20890–20902 (2023).
Samara, G. A. Pressure and temperature dependences of the dielectric properties of the perovskites BaTiO3 and SrTiO3. Phys. Rev. 151, 378–386 (1966).
Kamaya, N. et al. A lithium superionic conductor. Nat. Mater. 10, 682–686 (2011).
Mizuno, F., Hayashi, A., Tadanaga, K. & Tatsumisago, M. New, highly ion-conductive crystals precipitated from Li2S–P2S5 glasses. Adv. Mater. 17, 918–921 (2005).
Luo, S. et al. Nanostructure of the interphase layer between a single li dendrite and sulfide electrolyte in all-solid-state li batteries. ACS Energy Lett. 7, 3064–3071 (2022).
Wood, K. N. et al. Operando X-ray photoelectron spectroscopy of solid electrolyte interphase formation and evolution in Li2S-P2S5 solid-state electrolytes. Nat. Commun. 9, 10 (2018).
Cheng, T., Merinov, B. V., Morozov, S. & Goddard, W. A. Quantum mechanics reactive dynamics study of solid Li-electrode/Li6PS5Cl-electrolyte interface. ACS Energy Lett. 2, 1454–1459 (2017).
Lepley, N. D., Holzwarth, N. A. W. & Du, Y. A. Structures, Li+ mobilities, and interfacial properties of solid electrolytes Li3PS4 and Li3PO4 from first principles. Phys. Rev. B 88, 104103 (2013).
Ren, F. et al. Visualizing the SEI formation between lithium metal and solid-state electrolyte. Energy Environ. Sci. 17, 2743–2752 (2024).
Liu, Z. et al. Anomalous high ionic conductivity of nanoporous β-Li3PS4. J. Am. Chem. Soc. 135, 975–978 (2013).
Marana, N. L. et al. Stability and formation of the Li3PS4/Li, Li3PS4/Li2S, and Li2S/Li interfaces: a theoretical study. Langmuir 39, 18797–18806 (2023).
Richards, W. D., Miara, L. J., Wang, Y., Kim, J. C. & Ceder, G. Interface stability in solid-state batteries. Chem. Mater. 28, 266–273 (2016).
Zhu, Y., He, X. & Mo, Y. Origin of outstanding stability in the lithium solid electrolyte materials: insights from thermodynamic analyses based on first-principles calculations. ACS Appl. Mater. Interfaces 7, 23685–23693 (2015).
Liu, Z. Revealing reaction mechanisms of nanoconfined Li2S: implications for lithium–sulfur batteries. Phys. Chem. Chem. Phys. 20, 11713–11721 (2018).
Jung, S. C. & Han, Y.-K. Thermodynamic and kinetic origins of lithiation-induced amorphous-to-crystalline phase transition of phosphorus. J. Phys. Chem. C. 119, 12130–12137 (2015).
Rinaldi, M., Bochkarev, A., Lysogorskiy, Y. & Drautz, R. Charge-constrained atomic cluster expansion. Phys. Rev. Materials 9, 033802 (2025).
Li, Y. et al. Long-short-range message-passing: a physics-informed framework to capture non-local interaction for scalable molecular dynamics simulation. In International Conference on Learning Representations. (ICLR, 2023).
Wang, Y. et al. NeuralP3M: A long-range interaction modeling enhancer for geometric GNNs. In Advances in Neural Information Processing Systems. (NeurIPS, 2024).
Frank, T., Unke, O. & Müller, K.-R. So3krates: equivariant attention for interactions on arbitrary length-scales in molecular systems. In Advances in Neural Information Processing Systems. (NeurIPS, 2022).
Rumiantsev, E., Langer, M. F., Sodjargal, T.-E., Ceriotti, M. & Loche, P. Learning long-range representations with equivariant messages. Preprint at arXiv https://arxiv.org/abs/2507.19382 (2025).
Perdew, J. P., Burke, K. & Ernzerhof, M. Generalized gradient approximation made simple. Phys. Rev. Lett. 77, 3865 (1996).
Anisimov, V. I., Zaanen, J. & Andersen, O. K. Band theory and Mott insulators: Hubbard U instead of Stoner I. Phys. Rev. B 44, 943–954 (1991).
Wang, A. et al. A framework for quantifying uncertainty in DFT energy corrections. Sci. Rep. 11, 15496 (2021).
Ong, S. P. et al. Python Materials Genomics (pymatgen): a robust, open-source Python library for materials analysis. Comput. Mater. Sci. 68, 314–319 (2013).
Frostig, R., Johnson, M. J. & Leary, C. Compiling machine learning programs via high-level tracing. in Syst. Mach. Learn. (2018).
Larsen, A. H. et al. The atomic simulation environment—a Python library for working with atoms. J. Phys. Condens. Matter 29, 273002 (2017).
Liu, D. C. & Nocedal, J. On the limited memory BFGS method for large-scale optimization. Math. Program. 45, 503–528 (1989).
Nosé, S. A unified formulation of the constant temperature molecular dynamics methods. J. Chem. Phys. 81, 511–519 (1984).
Hoover, W. G. Canonical dynamics: equilibrium phase–space distributions. Phys. Rev. A Gen. Phys. 31, 1695–1697 (1985).
Parrinello, M. & Rahman, A. Crystal structure and pair potentials: a molecular-dynamics study. Phys. Rev. Lett. 45, 1196–1199 (1980).
Parrinello, M. & Rahman, A. Polymorphic transitions in single crystals: a new molecular dynamics method. J. Appl. Phys. 52, 7182–7190 (1981).
Ko, T.W., Finkler, J. A., Goedecker, S. & Behler, J. A fourth-generation high-dimensional neural network 722 potential with accurate electrostatics including non-local charge transfer. Materials Cloud Archive 723 2020.X https://doi.org/10.24435/materialscloud:f3-yh (2020).
Deng, B. Materials project trajectory (MPtrj) dataset. Figshare https://doi.org/10.6084/m9.figshare.23713842 (2023).
Gao, R. et al. ReaxNet v1.0. Zenodo https://doi.org/10.5281/zenodo.17123107 (2025).
Acknowledgments
This work was supported by Hong Kong Quantum AI Lab Limited, Air @ InnoHK of Hong Kong Government (G. C. and Z. H.), Seed Fund for Basic Research for New Staff 2022/23 (Grant No. 2201101550 (Z. H.)), the GuangDong Basic and Applied Basic Research Foundation (Grant No. 2024A1515013283 (Z. H.)), the Guangdong Shenzhen Joint Key Fund (Grant No. 2019B1515120045 (C. Y.)), and the National Natural Science Foundation of China (Grant No. 22073007 (C. Y.)).
Author information
Authors and Affiliations
Contributions
R. G.: Conceptualization, investigation, code development, analysis, visualization, and writing. C. Y.: Revision, analysis, and funding acquisition. J. M.: Analysis, review, and discussion. S. C.: Analysis, review, and discussion. G. C.: Review, funding acquisition, and supervision. Z. H.: Conceptualization, analysis, revision, funding acquisition, and supervision.
Corresponding authors
Ethics declarations
Competing interests
The authors declare no competing interests.
Peer review
Peer review information
Nature Communications thanks Philip Loche and the other anonymous reviewer(s) for their contribution to the peer review of this work. A peer review file is available.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Source data
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Gao, R., Yam, C., Mao, J. et al. A foundation machine learning potential with polarizable long-range interactions for materials modelling. Nat Commun 16, 10484 (2025). https://doi.org/10.1038/s41467-025-65496-3
Received:
Accepted:
Published:
Version of record:
DOI: https://doi.org/10.1038/s41467-025-65496-3








