A pre-trained deep potential model for sulfide solid electrolytes with broad coverage and high accuracy

Wang, Ruoyu; Guo, Mingyu; Gao, Yuxiang; Wang, Xiaoxu; Zhang, Yuzhi; Deng, Bin; Shi, Mengchao; Zhang, Linfeng; Zhong, Zhicheng

doi:10.1038/s41524-025-01764-6

Download PDF

Article
Open access
Published: 20 August 2025

A pre-trained deep potential model for sulfide solid electrolytes with broad coverage and high accuracy

Ruoyu Wang^1,2,3^na1,
Mingyu Guo^4,5^na1,
Yuxiang Gao^1,2,
Xiaoxu Wang^4,6,
Yuzhi Zhang^4,6,
Bin Deng⁴,
Mengchao Shi^3,4,
Linfeng Zhang^4,6 &
…
Zhicheng Zhong^1,2,3

npj Computational Materials volume 11, Article number: 266 (2025) Cite this article

1953 Accesses
1 Altmetric
Metrics details

Subjects

Abstract

Solid electrolytes with fast ion transport are crucial for solid state lithium metal batteries. Chemical doping has been the most effective strategy for improving ion condictiviy, and atomistic simulation with machine-learning potentials helps optimize doping by predicting ion conductivity for various composition. Yet most existing machine-learning models are trained on narrow chemistry, requiring retraining for each new system, which wastes transferable knowledge and incurs significant cost. Here, we propose a pre-trained deep potential model purpose-built for sulfide solid electrolytes with attention mechanism, known as DPA-SSE. The training set includes 15 elements and consists of both equilibrium and extensive out-of-equilibrium configurations. DPA-SSE achieves a high energy resolution of less than 2 meV/atom for dynamical trajectories up to 1150 K, and reproduces experimental ion conductivity with remarkable accuracy. DPA-SSE generalizes well to complex electrolytes with mixes of cation and anion atoms, and enables highly efficient dynamical simulation via model distillation. DPA-SSE also serves as a platform for continuous learning and can be fine-tuned with minimal downstream data. These results demonstrate the possibility of a new pathway for the AI-driven development of solid electrolytes with exceptional performance.

Computational screening of sodium solid electrolytes through unsupervised learning

Article Open access 20 September 2024

Disorder-induced enhancement of lithium-ion transport in solid-state electrolytes

Article Open access 26 January 2025

High-throughput investigation of stability and Li diffusion of doped solid electrolytes via neural network potential without configurational knowledge

Article Open access 21 May 2024

Introduction

Solid-state batteries are poised to revolutionize the energy storage industry due to their potential for higher energy density, excellent safety and improved sustainability, which are critical for addressing the limitations of current lithium-ion batteries^1,2. Solid-state electrolytes is the key to the solid state batteries, and hence has attracted tremendous research interest^{3,4,5,6,7,8,9,10}. However, solid electrolytes still face critical challenges, particularly the low ion conductivity compared with liquid electrolytes. In this regard, sulfide electrolytes (e.g., LGPS-like and argyrodite) with fast ion transport are promising candidates for commercial application^11,12. Usually, the ion conductivity of base electrolyte can be further improved by chemical doping (i.e., element substitution), which has been one of the most effective strategies for electrolyte optimization towards high-energy storage^3,13. With proper doping, an optimal lithium ion concentration with maximum ion conductivity can be achieved when there are both abundant diffusing ions and vacant hopping sites. High entropy materials with extensive doping also introduce local structure distortion, which disturbs the ion packing and removes the degeneracy of hopping barriers^3,14. For instance, heavily doped sulfide electrolytes, such as the Li_9.54Si_1.74P_1.44S_11.7Cl_0.3¹¹, the Li_9.54 [Si_0.6Ge_0.4]_1.74P_1.44S_11.1Br_0.3O_0.6⁷, etc., achieve much higher ion conductivity than their base compounds. Therefore, material optimization with doping engineering is an important strategy to achieve the high-performance solid electrolytes.

Traditionally, wide chemical space has to be explored experimentally to find the optimized composition, which is both costly and time-consuming. Atomistic simulation of electrolyte materials with atomic details can significantly accelerate this process by predicting ion conductivity of arbitrary compositions¹⁵. Atomistic simulation involves force field, which can be constructed either by fitting a physically informed model to experiments, or by first-principle quantum theory calculation. Empirical potentials, while being fast and efficient, are limited by their accuracy and generalizability across diverse material configurations and wide temperature ranges. Ab initio calculation achieves high accuracy, but the computational cost is exceedingly high. The simulation is usually limited to a few hundred atoms and several nanoseconds, insufficient for reliable simulation of solid electrolytes¹⁵.

Machine learning force fields (MLFF), such as DeePMD^16,17, etc., enable ab initio accuracy at the computational cost of empirical force fields^18,19,20. Trained over small scale ab initio calculations, the computation cost of MLFF scales linearly with atom count, supporting large-scale simulation. Previous works have shown MLFF can predict ion conductivity in solid electrolytes, a task previously intractable for empirical or ab initio methods^21,22. However, conventional MLFFs lack generalizability across diverse chemical and configurational space, requiring costly retraining for each system, especially for complex doped electrolytes²³. Universal force fields pre-trained across the periodic table, such as the DPA-2²⁴, MACE²⁵, M3GNet²⁶, CHGNet²⁷, et al.^23,28,29, are highly generalizable. However, these universal models are primarily trained on datasets consisting of equilibrium or near-equilibrium configurations. As a result, current universal force fields usually struggle to accurately predict out-of-equilibrium states critical to ionic transport, such as Li-hopping transition states, and lack the quantitative accuracy required for electrolytes simulation. To address this, it is imperative to develop a domain-specific model scheme which is highly accurate while retaining good generalizability within solid electrolyte chemistry.

Here, we introduce DPA-SSE, a domain-specific pre-trained deep potential model with attention mechanism that is specifically designed for sulfide electrolytes simulation. The DPA-SSE model includes 41 unique systems, among which are 26 sulfide compounds, thereby providing a solid platform for the study of sulfide electrolytes and beyond. Trained in extensive out-of-equilibrium configurations, the model accurately predicts the energy and atomic forces of sulfide electrolytes along dynamic trajectories, reproducing ion transport to ab initio accuracy. The DPA-SSE exhibits good generalizability. It accurately predicts solid solutions of arbitrary concentration, an important merit for sulfide electrolyte optimization. The pre-trained model can also be fine-tuned with minimal training data for downstream tasks. The fine-tuned model achieves similar performance with ten times fewer training data than the models trained from scratch. Moreover, a distillation scheme that generates simpler models to alleviate the high cost associated with pre-trained universal models in dynamic simulation is proposed. Combining high precision and good generalizability, the DPA-SSE could aid in the development of better sulfide electrolytes.

Results

Model overview

Figure1 provides a graphical representation of the training sets for the DPA-SSE model, as well as an overview of the model workflow. The training sets include three major types of sulfides electrolytes, namely, the LGPS-like compounds (Li₁₀XY₂S₁₂; X = Ge, Sn, Si; Y = P, Ga, Sb, As), argyrodites (Li₆PS₅X; X = Cl, Br, I) and Li₇P₃S₁₁. For solid batteries, the ability to operate at elevated temperatures is highly desirable, and MD simulations provide critical information regarding the thermal stability of solid electrolytes. To this end, the decomposition products of sulfide SEs are also required. By thermodynamic analysis, LGPS-like compounds decompose into Li₂S, Y₂S₅ and XS₂, while X = Ge, Sn, Si and Y = P, Ga, Sb, As; likewise, argyrodites decomposes to Li₂S, P₂S₅ and LiX (X = Cl, Br, I). These compounds, as well as the lithium metal, are included in the training sets. In general, the training sets cover 15 elements and 41 different systems, including 26 chalcogenides, as presented in Fig. 1. DPA-SSE utilizes the DPA-2²⁴ framework, a sophisticated architecture with an attention mechanism designed for broad coverage across the periodic table and versatile applications. With the pre-trained DPA-SSE, one can perform atomic simulation of sulfide electrolytes, obtaining key properties of ion transport, thermodynamic stability, etc. If unfamiliar systems are encountered, DPA-SSE can be fine-tuned for downstream tasks with minimal additional data set by leveraging the transferable knowledge embedded within the pre-trained model. Usually, atomic simulation only involves a portion of the composition space covered by DPA-SSE. In these cases, lighter, faster DeePMD models can be generated from the pre-trained or fine-tuned DPA-SSE model for more efficient molecular dynamics simulation.

**Fig. 1: Overview of the DPA-SSE model and its training dataset.**

Model testing and performance

DPA-SSE can accurately predict the energies and forces of sulfide electrolytes of various configurations. In Fig. 2a, b, the predicted energy and force are plotted against the DFT calculation. The energy and force mean absolute error (MAE) converge to 1.58 meV/atom and 30.28 meV/Å on the test sets, respectively. To further validate DPA-SSE for the simulation of sulfide electrolytes, Fig. 2c, d test the energy and force MAE on six typical sulfide electrolytes, namely Li₁₀XP₂S₁₂ (X = Ge, Sn, Si) and Li₆PS₅X (X = Cl, Br, I). The test data is constructed by performing long DeePMD simulations heating from 150 to 1150 K and labeling the trajectories with DFT calculation. For all test systems, the energy and force MAE are within 2 meV/atom and 30 meV/Å, respectively. This high accuracy ensures reliable simulation of sulfide electrolytes across a wide temperature range. For a comprehensive evaluation, benchmark tests on these sulfide electrolyte systems are also conducted for universal force fields which include MACE, M3GNet, as well as DPA-2-MP, a pre-trained DPA-2 model branch for the MPtrj dataset. Note that only conservative force fields are listed here. Conservative means that atomic forces are derivatives of the potential energy, which ensures energy conservation in the MD simulation and is fundamental in calculating dynamical properties such as ion diffusion^30,31. As shown in Supplementary Fig. S1, the universal force fields perform reasonably well on low-temperature, near-equilibrium trajectories but have significant errors on the heating trajectories with many out-of-equilibrium configurations. A more detailed comparison of energy prediction by DPA-SSE and other universal force fields is presented in Supplementary Fig. S2. The result suggests that the energy of out-of-equilibrium configurations, such as those in the lithium-hopping event, is underestimated by universal force fields, and significantly higher ion transport may be obtained during dynamic simulations. This result shows the importance of the domain-specific DPA-SSE dataset, which includes a considerable amount of out-of-equilibrium configurations for both electrolyte materials and their subsystems. To further illustrate this point, we fine-tuned the universal force fields (MACE, M3GNet and DPA-2-MP) with DPA-SSE dataset, and test the performance of those fine-tuned models. It is worth noting that the universal force fields being tested are pre-trained with DFT dataset labeled using PBE functional, whereas the DPA-SSE dataset uses PBESol functional. Usually, the difference between PBE and PBESol data amounts to a moderate energy shift, which can be handled by updating the fitting network parameters as well as the element reference energy during fine-tuning. As shown in Fig. 2c, d, the accuracy of the fine-tuned models is significantly improved and is comparable to DPA-SSE for the dynamical trajectories of test systems. The fine-tuned M3GNet exhibits larger errors than fine-tuned MACE and DPA-2-MP, particularly in force predictions, likely due to its significantly smaller model size and less comprehensive pre-training dataset^25,26. This highlights the critical role of extensive pre-training data and high model expressiveness for effective foundation models.

**Fig. 2: Performance on test datasets.**

In addition to the benchmark tests for static energy and force prediction, DPA-SSE is also tested for lattice relaxation, as shown in Supplementary Fig. S4. The lattice constants of the relaxed structures fit well with the DFT calculations. DPA-SSE is also capable of predicting the thermal stability of sulfide electrolyte materials. In Supplementary Fig. S5, the formation energy of pseudo-binary system xLi₂S(1-x)P₂S₅ as a function of composition is presented. The three unique compositions, Li₇P₃S₁₁ (x = 0.7), Li₃PS₄ (x = 0.75) and Li₇PS₆ (x = 0.875) are all on the formation energy hull in this pseudo-binary phase diagram, which agree with DFT calculations.

Model generalizability

As mentioned in previous sections, chemical doping is a key strategy for electrolyte optimization, and atomistic simulations capable of predicting doped electrolytes may provide crucial guidance. Being generalizable is essential here, as no training set can cover the vast configuration space of solid solutions and doped compounds. To verify the generalizability of DPA-SSE, Li₁₀(Ge, Sn, Si)P₂S₁₂ (L(Ge, Sn, Si)PS) and Li₆PS₅(Cl, Br, I) (LPS(Cl, Br, I)) solid solutions with random mixes of cation and anion atoms at various concentrations are constructed. For each configuration, an room temperature AIMD trajectory was produced and added to the test set. Figure 3a, b presents the energy and force prediction in the test sets. The energy and force MAE for solid solutions are 1.56 meV/atom and 29.15 meV/Å, respectively. The insets of Fig. 3a, b also show the distribution of energy and force MAE. The accuracy for solid solutions is on par with that for standard test sets as shown in Fig. 2, suggesting good generalizability for solid solution. To reveal the origin of predictability for solid solutions, the t-distributed stochastic neighbor embedding plot (t-SNE) for the data sets is presented in Fig. 3c. The t-SNE plot projects high-dimensional data distribution onto a 2D plane, visualizing the similarity of two data points by their distance. In Fig. 3c, points representing the solid solutions sit within the regions spanned by the training sets. Therefore, the generalizability can be attributed to the efficient sampling strategy for the training set and the advanced model architecture, which learned the essential features of solid solutions from constituent compounds.

**Fig. 3: Generalizability to solid solutions.**

Model fine-tuning

As well as accurate “zero-shot" prediction for materials with complex composition, a transferable model easily adapts to downstream tasks by model fine-tuning, which requires much less training data than training from scratch. Here, we showcase the fine-tuning of DPA-SSE. 800 snapshots of a Li₂B₂S₅ (mp-29410, denoted as LBS) compound, which is not included in the training set (Supplementary Fig. S6), are collected from the DP-GEN workflow as downstream data set³². The test set is constructed by 40 frames randomly chosen from the downstream data set. Figure 4a, b presents the energy and force prediction on LBS before and after model fine-tuning. The “zero-shot" energy and force prediction MAE are 162.65 meV/atom and 369.16 meV/Å, respectively. An obvious bias can be observed for the zero-shot energy prediction. After fine-tuning with 20 randomly selected samples, the energy bias is eliminated, and the energy mean absolute error (MAE) is reduced to 3.44 meV/atom. The force prediction also improves significantly’ achieving a MAE of 89.07 meV/Å. Figure 4c, d compares the energy and force convergence of randomly initialized and pre-trained models based on the size of training sets. Normalizing the training epochs by the number of data frames, we find that after fine-tuning with 60 frames, the pre-trained model outperforms the model trained from scratch using the complete training set of 760 frames. In this example, fine-tuning significantly reduces the amount of training data required, resulting in substantial savings on data generation. For comparison, the learning rate of the DPA-2-MP universal force field fine-tuned with DPA-SSE data is also presented in Fig. 4c, d. The DPA-2-MP-ft model, which shares the same model architecture with DPA-SSE, exhibits data efficiency similar to that of the specialized DPA-SSE model. In both cases, fine-tuning with a pre-trained model greatly accelerates the training process for unfamiliar compounds by utilizing the transferable knowledge obtained during pre-training.

Nudged elastic band calculation and dynamic simulations

Large-scale dynamic simulation of solid electrolytes for a sufficiently long time can predict ion transport properties. For precise ion transport calculations, it is essential that the underlying force fields can accurately evaluate out-of-equilibrium configurations during Li⁺ ion hopping. Figure 5a, b presents the hopping barrier for the correlated Li⁺ motion along c-axis and within the ab-plane of LGPS electrolyte^33,34. The diffusion path is optimized using nudged elastic band (NEB) method with the pre-trained DPA-SSE model. Along the NEB path, the Li⁺ hopping energy is calculated as 0.28 eV along the c-axis and 0.36 eV within the ab-plane, which agrees well with the DFT result using PBESol functional. To demonstrate the accuracy of DPA-SSE, Fig. 5c compares the calculated and experimental lithium ion diffusion coefficient D of LGPS from room temperature up to 1100 K. Clearly, the diffusion coefficients calculated by the DPA-SSE simulation are very close to those of the AIMD simulation. Similar results are also obtained for other LGPS-like electrolytes, as shown in Supplementary Fig. S8. Most noticeably, DPA-SSE enables an accurate simulation of sulfide electrolytes at room temperature, which agrees reasonably well with available experimental results. This is previously intractable for AIMD simulation due to the long simulation time required for convergence and the large simulation cell to mitigate the size effect³⁵. For comparison, universal force fields (M3GNet, MACE, DPA-2-MP) fine-tuned with DPA-SSE dataset as well as their base models are also tested. It is noted that the DFT label of the three universal force fields uses PBE functional, instead of the PBESol used by DPA-SSE. To clarify potential ambiguity in simulation results due to the functional choice, lithium-hopping paths labeled with PBE calculations are presented in Fig. 5a, b. The difference in relative energy between frames in the lithium-hopping paths is very small for the two types of functional. A previous study by Huang et al. also suggests the PBE and PBESol functional only has a minor effect on the ionic transport simulation of sulfide electrolytes²². The universal force fields tend to underestimate the Li⁺ hopping barrier, which results in significantly higher Li⁺ diffusion coefficients, especially at ambient temperature³⁶. After fine-tuning, these universal force fields predict hopping barriers with much improved accuracy and achieve quantitative results of Li⁺ diffusion coefficients similar to the DPA-SSE model. This observation may suggest fine-tuning as a viable future direction for building domain-specific models.

**Fig. 5: Nudged elastic band (NEB) calculation of concerted Li transport in Li₁₀GeP₂S₁₂ (LGPS) electrolytes.**

The ion transport of sulfide electrolytes can be modified by doping. Figure 6a plots the room temperature ion conductivity of Li₁₀(Si_xGe_1−x)P₂S₁₂ solid solutions using DPA-SSE model. The result agrees reasonably well with experimental measurements. Even without external doping, there might be an intrinsic disorder in the electrolyte materials that plays a key role in ion transport. The Li⁺ ion transport in the argyrodite electrolyte LPSX (X = Cl, Br, I) has been attributed to the S²⁻/X⁻ anion site disorder³⁷. Powder X-ray diffraction analysis reveals that a portion of the halide atoms at the 4c sites randomly exchange with an equal number of S atoms at the 4a sites, as shown in the inset of Fig. 6b³⁸. Molecular dynamic simulations with DPA-SSE capture the correlation between anion site disorder and ion transport²¹. Figure 6b shows the time evolution of the mean square displacement (MSD) at room temperature of lithium ions in LPSCl with and without anion disorder. The lithium MSD converges after a few hundred picoseconds in the ordered phase; in the disordered phase, the lithium MSD increases with simulation steps, suggesting long-range ion transport. Hence, anion site disorder is essential for fast ion transport in argyrodites. Figure 6d also plots the potential energy of ordered and disordered LPSCl along the simulation trajectory. The disorder phase actually has lower potential energy, and is thus more favorable than the ordered phase. This result is in agreement with experimental synthesis of LPSCl electrolytes. Unlike LPSCl, LPSI in ordered phase exhibits lower potential energy as shown in Supplementary Fig. S8, which helps explain the significantly lower ion conductivity. The anion disorder critical to inonic transport in LPSCl electrolyte can be enhanced by substituting more S at the 4c sites with excess Cl atoms, which also introduces Li vacancies to keep stoichiometry³⁹. In Supplementary Fig. S10a, ionic conductivity of Li_6−xPS_5−xCl_1+x as a function of excess Cl doping is calculated with DPA-SSE. The result reveals the correlated effect of Li vacancy and stronger S/Cl mixing at 4c site that significantly improves ionic conductivity, which supports the experimental measurement^39,40. An alternative strategy would be to replace the excess Cl with Br, and the ionic conductivity of Li_6−xPS_5−xClBr_x calculated with DPA-SSE is presented in Supplementary Fig. S10b, along with experimental results⁴¹. The Br-doped LPSCl exhibits higher ionic conductivity than Cl doping at higher concentration, possibly due to the enhanced configurational entropy. In both examples, DPA-SSE accurately simulates the positive effect of excess halogen doping in LPSCl electrolyte.

**Fig. 6: Molecular dynamic simulation of lithium ion transport in sulfide electrolytes.**

Considering the anion site disorder in argyrodites, Fig. 6c lists room temperature ion conductivity of five sulfide electrolytes calculated with DPA-SSE, showing excellent agreement with experiments. Assuming Arrhenius relations, activation energy E_a of lithium ion diffusion can be extracted from the temperature dependence of diffusion coefficient D. The activation energy for Li⁺ hopping is plotted in Fig. 6d. Evidently, the hopping barrier ΔE estimated by DPA-SSE is very close to the experimental measurement, consistent with the NEB calculations in Fig. 5.

Model distillation

Despite the high accuracy and good generalizability, DPA-SSE is still rather expensive for large-scale dynamic simulations. Usually, the simulation systems only represent a small portion of the chemical and configuration space covered by the pre-trained DPA-SSE model. In these cases, the complex model structure associated with the generalization becomes redundant and can be trade-off for better efficiency. To address this, a knowledge distillation scheme was proposed for DPA-SSE, as demonstrated in Supplementary Fig. S12a. Firstly, thousands of randomly perturbed structures are generated. To ensure the robustness of the distilled model for dynamic simulations, we run a few short MD simulations with the DPA-SSE model from the perturbed structures for picoseconds. The resulting trajectory was further perturbed to generate more out-of-equilibrium configurations. These configurations are labeled by the DPA-SSE model that effectively acts as a “teacher”, and those with excessive atomic forces (in this case more than 10 eV/Å) are filtered out. Then a standard DeePMD model with local descriptor, the “student” model, is trained for a million steps on the collected training set. Supplementary Fig. S10b, c compares the performance of the “teacher” and “student” models on the test set constructed from a 900 K AIMD trajectory. The energy and force MAE of the distilled model are 2.10 meV/atom and 48.44 meV/Å, respectively. For comparison, the energy and force MAE of the DPA-SSE “teacher” are 1.59 meV/atom and 30.0 meV/Å, respectively. Note that in addition to error inherent in the “teacher” model, the training process of the “student” model also induces errors. Thus, the accuracy of the distilled model would inevitably be affected compared with the original DPA-SSE. However, the extent of the induced error can be controlled within the acceptable range of less than 1 meV/atom and a few dozen meV/Åby including a sufficient number of data frames in the training dataset. Provided with a reliable and accurate “teacher” model, a robust distilled model can be obtained for only slight loss in accuracy, which can be easily out-weighted by the vastly improved simulation efficiency. The inset of Supplementary Fig. S12b compares the running efficiency of the pre-trained and distilled models. The benchmark test involves running a 900 K NVT simulation of an LGPS supercell consisting of 1350 atoms, on the same GPU processing unit. The distilled model is faster than the pre-trained model by at least two orders of magnitude. Figure 7 compares the efficiency of DPA-SSE, distilled DPA-SSE and other universal force fields. The efficiency of the distilled model can be further improved by hardware-acceleration, e.g., the non von-Neumann molecular dynamic (NVNMD) proposed by Mo et al.⁴². The NVNMD is revised from the standard DeePMD model and addresses the “memory bottleneck” issue by running on purpose-built hardwares of non von-Neumann architecture.

**Fig. 7: Model efficiency for molecular dynamic simulation.**

Discussion

Ion conductivity is one of the most essential parameters of solid electrolytes. Atomistic simulation can be a valuable asset in the search for better electrolyte composition by screening candidate materials based on predicted ion conductivity. This may significantly accelerate the development process of solid electrolytes. To this end, the underlying force field of atomistic simulation must fulfill two requirements: First, it must be highly accurate even along out-of-equilibrium dynamic trajectories; secondly, it must be generalizable to doped electrolyte materials of complex composition. Traditional machine learning potentials or universal force fields trained on near-equilibrium data sets either lack generalizability or do not have sufficient accuracy. As shown in our comparative tests, the universal force fields overestimate the room temperature ion conductivity of sulfide electrolytes by more than an order of magnitude, thus falling short of the required accuracy.

In this work, we introduced the domain-specific pre-trained DPA-SSE model built for accurate simulation of sulfide electrolytes. Leveraging the extensive out-of-equilibrium training set and advanced model architecture, DPA-SSE model achieves high accuracy and good generalizability among sulfide electrolytes. We demonstrate the capability of DPA-SSE by the Li⁺ hopping barrier calculation using NEB method and accurately reproduce the experimental ion conductivity of sulfide electrolytes even with complex composition and intrinsic disorder such as the L(Si, Ge, Sn)PS and LPS(Cl, Br, I) solid solutions. In addition to direct application, DPA-SSE also serves as a basis platform for more specialized tasks. The model can be fine-tuned for downstream tasks, achieving the same performance as the model trained from scratch but with ten times less training data, as demonstrated in the example of an LBS system. When the chemical and composition space of the target electrolyte can be narrowed, the pretrained or fine-tuned DPA-SSE model can generate a faster and lighter DeePMD model through distillation. The distilled model is much more efficient in dynamic simulation as it truncates model parameters irrelevant to the target system.

Despite these advancements, further improvements can be achieved through several efforts. First, DPA-SSE encompasses only sulfide electrolytes, and the coverage and generalizability of the model may be further enhanced by incorporating other systems. Second, an automated workflow for model fine-tuning and distillation may greatly enhance the accessibility of DPA-SSE for field experts and industry users. This workflow should automatically validate the quality of fine-tuned models and the robustness of distilled models. In addition to predicting bulk electrolyte properties, interface models can also be generated on the basis of DPA-SSE. In solid batteries, the composition and structure of the interface between the electrolyte and electrode materials often differ significantly from bulk materials, which presents major challenges in ion conduction and chemical stability⁴³. Machine learning force fields may play a pivotal role in understanding interfaces in atomistic detail, but training an interface force field is very difficult due to the complexity of the interface and the requirement for extensive DFT calculations. With the fine-tune workflow, interface models built upon the pre-trained DPA-SSE may greatly improve training efficiency and accelerate the interface design process. In conclusion, DPA-SSE enables accurate large-scale atomistic simulation of sulfide electrolytes within a wide chemical and configuration space. Our model not only accelerates the optimization of sulfide electrolytes but also serves as a platform model toward widespread applications of AI-driven techniques in solid electrolyte development.

Methods

Model design

The DPA-SSE model utilizes the DPA-2²⁴ framework, a sophisticated model designed with broad coverage across the periodic table in mind. By leveraging the innovative architecture design, DPA-2 exhibits remarkable transferability with sufficient training sets. In DPA-2, a local descriptor D_i is constructed from the local structural and chemical environment for each atom, represented by a single atom channel f_i, a rotation invariant atom pair channel g_ij and a rotation equivariant channel h_ij. The single- and pair-atom representation subsequently passes through multiple transformer layers to output the local descriptor D_i which is invariant under lattice translation, rotation and atom permutation. For each type of downstream task, a neural network would fit the local descriptor to the outputs. In the case of atomistic simulation, D_i is fitted to a local energy E_i, whose summation over atomic indices gives the potential energy, E(R) = ∑_iE_i(R), where R represents atomic configuration. The atomic forces are derived from the partial differentiation of potential energy with respect to atomic position, which naturally ensures force conservation.

Fine-tune and distillation

The pre-trained DPA-SSE model can be fine-tuned for downstream tasks, wherein the atomic descriptors D_i are initialized by the pre-trained model. Thanks to the transferable knowledge, the fine-tuned model converges much faster than models trained from scratch. Due to the large number of model parameters, direct application of the fine-tuned DPA-SSE to dynamic simulation for long time steps can be inefficient, which is a common issue for universal models. Thus, a distillation strategy is devised. In this scheme, DPA-SSE model, the “teacher” model, essentially plays the role of an energy and force calculator, and a “student” model, usually a standard, lightweight DeePMD model using the local descriptor without the attention mechanism^16,24,44, is trained with a large data set labeled by the DPA-SSE model. For the distilled LGPS model reported in this work, the detailed procedure of distillation is as follows: The perturbed LGPS structures are first generated using the dpdata module, and then labeled by the pre-trained DPA-SSE model. Five perturbed structures are randomly selected as the initial structure for a short DPA-SSE molecular dynamics simulation of 2000 steps at 900 K using the NPT thermostat at ambient pressure. The time step of the molecular dynamic simulation is 1 fs. One hundred frames are extracted from each trajectory, and each frame is further perturbed to generate more configurations. In total, the training set consists of 8579 frames of LGPS system. The test set is constructed from an 900K AIMD trajectory of NVT thermostat starting from the relaxed LGPS structure. A simple schematics of the distillation procedure is presented at Supplementary Fig. S12a.

Data generation

The training sets are produced by the standard DP-GEN workflow⁴⁵. Initial configurations are constructed by sampling a short AIMD trajectory starting from a randomly perturbed structure. In subsequent training iterations, the training sets are collected through NPT simulations conducted over a temperature range of 0–1200 K and a pressure range of 0–2 GPa for extensive sampling of out-of-equilibrium configurations. Such a wide temperature and pressure range ensures extensive sampling of configurations relevant to MD simulation at heating temperature. The final training sets consist of 54,771 snapshots of various configurations with DFT energy, force and virial as labels. The labeling of training sets are carried out on VASP packages⁴⁶ utilizing projector-augmented plane-wave (PAW) pseudopotentials with PBESol exchange-correlation functional⁴⁷. The cutoff energy was set to 600 eV for plane-waves, and the convergence condition was set to 0.01 eV/Å for atomistic coordination relaxation. The Brillouin zone samplings were controlled by KSPACING in order to ensure the same k-point grid for all systems, and it is set to be 0.3.

Model training

The collected training sets are utilized to train the DPA-SSE model as mentioned before. The descriptor has two major components, a representation initializer (repinit) layer and 12 representation transformer (repformer) layers. The repinit layer consists of three hidden layers with 25, 50 and 100 neurons, respectively. The repformer layers serve to learn complex information within the latent space of data sets, enabling increasing model performance with the number of data frames. The learned descriptors are fitted to downstream tasks by a fitting network of three hidden layers each with 240 neurons. The transformer module generally implies a lower learning rate for training stability⁴⁸. A low initial learning rate of 0.001 is chosen, which decays to 3.51 × 10⁻⁸ after 12,000,000 learning steps at an interval of 60,000 steps. We utilize the DPA-2 framework for single-task training. Specifically, we set the cutoff radius for the repinit layer to 9.0 Å. The input dimension for single-atom representations is fixed at 8. For the embedding connection, we employ a multilayer perceptron (MLP) consisting of three layers with 25, 50, and 100 neurons, respectively. Both types of layers are configured with four attention heads. The fitting network is composed of a three-layer MLP, each layer containing 240 neurons. Our training strategy starts with a learning rate of 2 × 10⁻⁴, undergoing exponential decay every 10,000 steps, eventually reducing to 3 × 10⁻⁸ at the 2,000,000 step. The prefactors for energy, force, and stress are adjusted alongside the learning rate. Specifically, the energy (virial) prefactor is scaled from 0.02 (0.2) to 1, while the force prefactor is adjusted from 1000 to 1. The more detailed training parameters can be found in our previous paper²⁴, and we have made all the training sets and models available through the AIS-Square platform at here. We have created a notebook to help readers learn how to use open models for property calculation and model evaluation (https://bohrium.dp.tech/notebooks/71679486918).

The universal force fields, namely DPA-2-MP, MACE and M3GNet, are fine-tuned with the DPA-SSE dataset for 4 epochs in all cases, with 5% of frames randomly selected for validation. The DPA-2-MP (DPA-2.3.1-v3.0.0rc0) is pre-trained on a diverse dataset including MPtraj and millions of other data frames using a multi-task strategy, and it shares the same network structure with DPA-SSE. The MACE model being tested is the small version of MACE-MP-0a pre-trained on the MPtraj dataset²⁵. The M3GNet model is fine-tuned using the matgl module (M3GNet-MP-2021.2.8-PES) instead of the original implementation⁴⁹. For the learning rate test at Fig. 4, DPA-SSE, DPA-2-MP-ft and the randomly initialized DPA-2 model are all trained for 133 epochs, starting with a learning rate of 0.001, which finally decays to 3.51 × 10⁻⁸.

Ionic conductivity

The mean square displacement (MSD) of lithium ions after time period t can be calculated as \({\rm{MSD}}(t)=\frac{1}{N}\mathop{\sum }\nolimits_{i}^{N}| {r}_{i}(t)-{r}_{i}(0){| }^{2}\), where N is the number of lithium ions. Assuming Brownian motion, the diffusion coefficient D of lithium ions in 3-dimensional space can be deduced as \(D=\mathop{\lim }\nolimits_{t\to \infty }\frac{{\rm{MSD}}(t)}{6t}\). The Nernst–Einstein relation relates ionic conductivity σ at temperature T to the lithium diffusion coefficient if the correlation between ions is negligible, \(\sigma (T)=\frac{{(ze)}^{2}}{V{k}_{B}T}D\), where V the volume of simulation cell, ze the electronic charge for each lithium ion, and k_B is the Boltzmann coefficient⁵⁰. Ion conduction is a thermally activated process, and the activation energy E_a can be extrapolated by fitting the modified Arrhenius relation to the calculated ion conductivity at various temperatures, \(\sigma (T)={\sigma }_{0}{T}^{m}\exp (\frac{-{E}_{a}}{{k}_{B}T})\), with m typically equal to −1. The pre-factor σ₀ is related to ion hopping entropy, distance, as well as attempt frequency in the simple case of uncorrelated ion hopping⁴. In practice, σ₀T^m can be approximated as constant over temperatures.

Data availability

DPA-SSE model and training data are made public at AIS Square.

Code availability

A notebook that helps readers learn how to use DPA-SSE for property calculation and model evaluation is also available at Bohrium platform.

References

Janek, J. & Zeier, W. G. A solid future for battery development. Nat. Energy 1, 16141 (2016).
Article Google Scholar
Janek, J. & Zeier, W. G. Challenges in speeding up solid-state battery development. Nat. Energy 8, 230–240 (2023).
Article Google Scholar
Zhao, Q., Stalin, S., Zhao, C.-Z. & Archer, L. A. Designing solid-state electrolytes for safe, energy-dense batteries. Nat. Rev. Mater. 5, 229–252 (2020).
Article CAS Google Scholar
Famprikis, T., Canepa, P., Dawson, J. A., Islam, M. S. & Masquelier, C. Fundamentals of inorganic solid-state electrolytes for batteries. Nat. Mater. 18, 1278–1291 (2019).
Article CAS PubMed Google Scholar
Li, S. et al. Progress and perspective of ceramic/polymer composite solid electrolytes for lithium batteries. Adv. Sci. 7, 1903088 (2020).
Article CAS Google Scholar
Jun, K. et al. Lithium superionic conductors with corner-sharing frameworks. Nat. Mater. 21, 924–931 (2022).
Article CAS PubMed Google Scholar
Li, Y. et al. A lithium superionic conductor for millimeter-thick battery electrode. Science 381, 50–53 (2023).
Article CAS PubMed Google Scholar
Park, K.-H. et al. High-voltage superionic halide solid electrolytes for all-solid-state Li-ion batteries. ACS Energy Lett. 5, 533–539 (2020).
Article CAS Google Scholar
Wang, C., Liang, J., Kim, J. T. & Sun, X. Prospects of halide-based all-solid-state batteries: from material design to practical application. Sci. Adv. 8, eadc9516 (2022).
Article CAS PubMed PubMed Central Google Scholar
Tan, D. H. S. et al. Carbon-free high-loading silicon anodes enabled by sulfide solid electrolytes. Science 373, 1494–1499 (2021).
Article CAS PubMed Google Scholar
Kato, Y. et al. High-power all-solid-state batteries using sulfide superionic conductors. Nat. Energy 1, 16030 (2016).
Article CAS Google Scholar
Wenzel, S. et al. Interphase formation and degradation of charge transfer kinetics between a lithium metal anode and highly crystalline Li7P3S11 solid electrolyte. Solid State Ion. 286, 24–33 (2016).
Article CAS Google Scholar
He, B. et al. Halogen chemistry of solid electrolytes in all-solid-state batteries. Nat. Rev. Chem. https://www.nature.com/articles/s41570-023-00541-7 (2023).
Zeng, Y. et al. High-entropy mechanism to boost ionic conductivity. Science 378, 1320–1324 (2022).
Article CAS PubMed Google Scholar
Van der Ven, A., Deng, Z., Banerjee, S. & Ong, S. P. Rechargeable alkali-ion battery materials: theory and computation. Chem. Rev. 120, 6977–7019 (2020).
Article PubMed Google Scholar
Zhang, L., Han, J., Wang, H., Car, R. & E, W. Deep potential molecular dynamics: a scalable model with the accuracy of quantum mechanics. Phys. Rev. Lett. 120, 143001 (2018).
Article CAS PubMed Google Scholar
Han, J., Zhang, L., Car, R. & Weinan, E. Deep potential: a general representation of a many-body potential energy surface. Commun. Comput. Phys. 23, http://arxiv.org/abs/1707.01478 (2018).
Behler, J. & Parrinello, M. Generalized neural-network representation of high-dimensional potential-energy surfaces. Phys. Rev. Lett. 98, 146401 (2007).
Article PubMed Google Scholar
Bartók, A. P., Payne, M. C., Kondor, R. & Csányi, G. Gaussian approximation potentials: the accuracy of quantum mechanics, without the electrons. Phys. Rev. Lett. 104, 136403 (2010).
Article PubMed Google Scholar
Noé, F., Tkatchenko, A., Müller, K.-R. & Clementi, C. Machine learning for molecular simulation. Annu. Rev. Phys. Chem. 71, 361–390 (2020).
Article PubMed Google Scholar
Lee, J. et al. Disorder-dependent Li diffusion in Li₆PS₅Cl investigated by machine learning potential. ACS Appl. Mater. Interfaces 16, 46642–46453 (2024).
Article Google Scholar
Huang, J. et al. Deep potential generation scheme and simulation protocol for the Li10GeP2S12-type superionic conductors. J. Chem. Phys. 154, 094703 (2021).
Article CAS PubMed Google Scholar
Merchant, A. et al. Scaling deep learning for materials discovery. Nature https://www.nature.com/articles/s41586-023-06735-9 (2023).
Zhang, D. et al. DPA-2: a large atomic model as a multi-task learner. npj Comput. Mater. 10, 293 (2024).
Article CAS Google Scholar
Batatia, I. et al. A foundation model for atomistic materials chemistry. Preprint at http://arxiv.org/abs/2401.00096 (2024).
Chen, C. & Ong, S. P. A universal graph deep learning interatomic potential for the periodic table. Nat. Comput. Sci. 2, 718–728 (2022).
Article PubMed Google Scholar
Deng, B. et al. CHGNet as a pretrained universal neural network potential for charge-informed atomistic modelling. Nat. Mach. Intell. 5, 1031–1041 (2023).
Article Google Scholar
Xie, F., Lu, T., Meng, S. & Liu, M. GPTFF: a high-accuracy out-of-the-box universal AI force field for arbitrary inorganic materials. Sci. Bull. 69, 3525–3532 (2024).
Article Google Scholar
Yang, H. et al. MatterSim: a deep learning atomistic model across elements, temperatures and pressures. Preprint at http://arxiv.org/abs/2405.04967 (2024).
Neumann, M. et al. Orb: a fast, scalable neural network potential. Preprint at http://arxiv.org/abs/2410.22570 (2024).
Liao, Y.-L., Wood, B., Das, A. & Smidt, T. EquiformerV2: improved equivariant transformer for scaling to higher-degree representations. Preprint at http://arxiv.org/abs/2306.12059 (2024).
Jansen, C., Küper, J. & Krebs, B. Na2B2S5 and Li2B2S5: two novel perthioborates with planar 1, 2, 4-trithia-3, 5-diborolane rings. Z. Anorganische Allg. Chem. 621, 1322–1329 (1995).
Article CAS Google Scholar
Mo, Y., Ong, S. P. & Ceder, G. First principles study of the Li ₁₀ GeP ₂ S ₁₂ lithium super ionic conductor material. Chem. Mater. 24, 15–17 (2012).
Article CAS Google Scholar
He, X., Zhu, Y. & Mo, Y. Origin of fast ion diffusion in super-ionic conductors. Nat. Commun. 8, 15893 (2017).
Article CAS PubMed PubMed Central Google Scholar
Mortazavi, B. et al. Machine-learning interatomic potentials enable first-principles multiscale modeling of lattice thermal conductivity in graphene/borophene heterostructures. Mater. Horiz. 7, 2359–2367 (2020).
Article Google Scholar
Chen, B. et al. An insight into intrinsic interfacial properties between Li metals and Li ₁₀ GeP ₂ S ₁₂ solid electrolytes. Phys. Chem. Chem. Phys. 19, 31436–31442 (2017).
Article CAS PubMed Google Scholar
Kraft, M. A. et al. Influence of lattice polarizability on the ionic conductivity in the lithium superionic argyrodites Li ₆ PS ₅ X (X = Cl, Br, I). J. Am. Chem. Soc. 139, 10909–10918 (2017).
Article CAS PubMed Google Scholar
Morgan, B. J. Mechanistic origin of superionic lithium diffusion in anion-disordered Li ₆ PS ₅X argyrodites. Chem. Mater. 33, 2004–2018 (2021).
Article CAS PubMed PubMed Central Google Scholar
Feng, X. et al. Enhanced ion conduction by enforcing structural disorder in Li-deficient argyrodites Li6-xPS5-xCl1+x. Energy Storage Mater. 30, 67–73 (2020).
Article Google Scholar
Liu, Y. et al. Inhibiting Dendrites by uniformizing microstructure of superionic lithium argyrodites for all-solid-state lithium metal batteries. Adv. Energy Mater. 14, 2400783 (2024).
Article CAS Google Scholar
Patel, S. V. et al. Tunable lithium-ion transport in mixed-halide argyrodites Li_6−x PS_5−x ClBr_x: an unusual compositional space. Chem. Mater. 33, 1435–1443 (2021).
Article CAS Google Scholar
Mo, P. et al. Accurate and efficient molecular dynamics based on machine learning and non von Neumann architecture. npj Comput. Mater. 8, 107 (2022).
Article Google Scholar
Xiao, Y. et al. Understanding interface stability in solid-state batteries. Nat. Rev. Mater. 5, 105–126 (2019).
Article Google Scholar
Zhang, D. et al. Pretraining of attention-based deep learning potential model for molecular simulation. npj. Comput. Mater. 10, 94 (2024).
Article Google Scholar
Zhang, L., Lin, D.-Y., Wang, H., Car, R. & E, W. Active learning of uniformly accurate interatomic potentials for materials simulation. Phys. Rev. Mater. 3, 023804 (2019).
Article CAS Google Scholar
Kresse, G. & Furthmüller, J. Efficiency of ab-initio total energy calculations for metals and semiconductors using a plane-wave basis set. Comput. Mater. Sci. 6, 15–50 (1996).
Article CAS Google Scholar
Perdew, J. P., Burke, K. & Ernzerhof, M. Generalized gradient approximation made simple. Phys. Rev. Lett. 77, 3865–3868 (1996).
Article CAS PubMed Google Scholar
Liu, L., Liu, X., Gao, J., Chen, W. & Han, J. Understanding the difficulty of training transformers. Preprint at http://arxiv.org/abs/2004.08249. (2023).
Ko, T. W. et al. Materials Graph Library (MatGL), an open-source graph deep learning library for materials science and chemistry. npj. Comput. Mater. 11, 253 (2025).
Article Google Scholar
France-Lanord, A. & Grossman, J. C. Correlations from Ion Pairing and the Nernst-Einstein Equation. Phys. Rev. Lett. 122, 136001 (2019).
Article CAS PubMed Google Scholar

Download references

Acknowledgements

This work was supported in part by the National Science and Technology Major Project (Grants No. 2023ZD0120702), Key Research Program of Frontier Sciences of CAS (Grant No. ZDBS-LY-SLH008) and National Nature Science Foundation of China (Grants No. 12304049). We thank Bowen Deng and Dr. Peichen Zhong, the authors of CHGNet, for inspiring discussions. We also appreciate valuable advice by Dr. Qisheng Wu. The computational resource was supported by the Bohrium Cloud Platform at DP technology.

Author information

These authors contributed equally: Ruoyu Wang, Mingyu Guo.

Authors and Affiliations

School of Artificial Intelligence and Data Science, University of Science and Technology of China, Hefei, China
Ruoyu Wang, Yuxiang Gao & Zhicheng Zhong
Suzhou Institute for Advanced Research, University of Science and Technology of China, Suzhou, China
Ruoyu Wang, Yuxiang Gao & Zhicheng Zhong
Suzhou Lab, Suzhou, China
Ruoyu Wang, Mengchao Shi & Zhicheng Zhong
DP Technology, Beijing, China
Mingyu Guo, Xiaoxu Wang, Yuzhi Zhang, Bin Deng, Mengchao Shi & Linfeng Zhang
School of Chemistry, Sun Yat-sen University, Guangzhou, China
Mingyu Guo
AI for Science Institute, Beijing, China
Xiaoxu Wang, Yuzhi Zhang & Linfeng Zhang

Authors

Ruoyu Wang
View author publications
Search author on:PubMed Google Scholar
Mingyu Guo
View author publications
Search author on:PubMed Google Scholar
Yuxiang Gao
View author publications
Search author on:PubMed Google Scholar
Xiaoxu Wang
View author publications
Search author on:PubMed Google Scholar
Yuzhi Zhang
View author publications
Search author on:PubMed Google Scholar
Bin Deng
View author publications
Search author on:PubMed Google Scholar
Mengchao Shi
View author publications
Search author on:PubMed Google Scholar
Linfeng Zhang
View author publications
Search author on:PubMed Google Scholar
Zhicheng Zhong
View author publications
Search author on:PubMed Google Scholar

Contributions

M.S., L.Z. and Z.Z. supervised the project. M.S., R.W. and Z.Z. conceived the project and idea. R.W., M.G., Y.G. and M.S. carried out data generation, model training and simulation. B.D. and X.W. conceived the application cases for DPA-SSE. Y.Z. updated the active learning code to support the project. R.W. analyzed the data and drafted the initial manuscript. R.W. wrote the final manuscript with input from all authors. All authors discussed the results and commented on the manuscript.

Corresponding authors

Correspondence to Mengchao Shi, Linfeng Zhang or Zhicheng Zhong.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplemental Information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Cite this article

Wang, R., Guo, M., Gao, Y. et al. A pre-trained deep potential model for sulfide solid electrolytes with broad coverage and high accuracy. npj Comput Mater 11, 266 (2025). https://doi.org/10.1038/s41524-025-01764-6

Download citation

Received: 24 July 2025
Accepted: 04 August 2025
Published: 20 August 2025
DOI: https://doi.org/10.1038/s41524-025-01764-6