Differentiable modeling and optimization of non-aqueous Li-based battery electrolyte solutions using geometric deep learning

Zhu, Shang; Ramsundar, Bharath; Annevelink, Emil; Lin, Hongyi; Dave, Adarsh; Guan, Pin-Wen; Gering, Kevin; Viswanathan, Venkatasubramanian

doi:10.1038/s41467-024-51653-7

Download PDF

Article
Open access
Published: 05 October 2024

Differentiable modeling and optimization of non-aqueous Li-based battery electrolyte solutions using geometric deep learning

Nature Communications volume 15, Article number: 8649 (2024) Cite this article

8970 Accesses
15 Citations
5 Altmetric
Metrics details

Subjects

Abstract

Electrolytes play a critical role in designing next-generation battery systems, by allowing efficient ion transfer, preventing charge transfer, and stabilizing electrode-electrolyte interfaces. In this work, we develop a differentiable geometric deep learning (GDL) model for chemical mixtures, DiffMix, which is applied in guiding robotic experimentation and optimization towards fast-charging battery electrolytes. In particular, we extend mixture thermodynamic and transport laws by creating GDL-learnable physical coefficients. We evaluate our model with mixture thermodynamics and ion transport properties, where we show improved prediction accuracy and model robustness of DiffMix than its purely data-driven variants. Furthermore, with a robotic experimentation setup, Clio, we improve ionic conductivity of electrolytes by over 18.8% within 10 experimental steps, via differentiable optimization built on DiffMix gradients. By combining GDL, mixture physics laws, and robotic experimentation, DiffMix expands the predictive modeling methods for chemical mixtures and enables efficient optimization in large chemical spaces.

Chemical foundation model-guided design of high ionic conductivity electrolyte formulations

Article Open access 30 August 2025

De novo design of polymer electrolytes using GPT-based and diffusion-based generative models

Article Open access 19 December 2024

Autonomous optimization of non-aqueous Li-ion battery electrolytes via robotic experimentation and machine learning coupling

Article Open access 27 September 2022

Introduction

Chemical mixtures are widely used in chemical processes and devices such as energy storage and conversion^1,2,3,4, chemical reactions and catalysis^5,6,7, and environmental engineering^8,9,10. Often, the mixture chemistry and compositions are carefully designed to achieve higher device performances. In particular, battery electrolytes, as mixtures of salts and solvents, have been optimized to facilitate ion transport, prevent electron transfer, and stabilize electrode-electrolyte interfaces for an energy-dense and durable battery system^11,12,13,14.

The design and optimization of electrolyte mixtures remain challenging due to the complexity of mixture chemistry and compositions, as well as the high experimentation cost^15,16. Physics-based modeling offers a solution by probing the underlying mixture physics and rationalizing the design principles for high-performing mixtures. Among physics-based mixture modeling techniques, molecular simulation is a powerful tool to study the interactions and dynamic evolution inside a complex mixture system, but it can be limited to time and length scales due to its high computational costs^17,18. Alternatively, chemical physicists proposed empirical function relationships to describe mixture physics. For example, Redlich-Kister (R-K) polynomials¹⁹ were designed for modeling mixture thermodynamics, and Arrhenius equation²⁰ was proposed to describe the temperature dependence of chemical reactions and other dynamic behaviors. Although they may provide decent model accuracy and indicate intrinsic physical behaviors such as reaction energy barriers, these empirical relationships are lacking in predictive power when new chemical species are provided. Emerging data-driven methods^{21,22,23,24,25,26,27} can potentially bridge the gap in the predictive modeling of electrolyte mixtures^28,29,30,31. Notably, with a linear regression method, Kim et al.³² discovered a strong correlation between the oxygen content in battery electrolytes and lithium-metal-cell Coulombic efficiencies. Bradford et al.³³ developed a graph machine learning model of solid polymer electrolytes (SPEs) and predicted ionic conductivities of thousands of new SPEs. Furthermore, the differentiability of modern deep learning models provides a new opportunity for unifying physics-based and data-driven models^{34,35,36,37,38,39}. Especially, Guan proposed a general differentiable framework merging thermodynamic modeling and deep learning for multi-component mixtures, where all the thermodynamic observables including thermochemical quantities and phase equilibria can be auto-differentiated, thus allowing models learned by gradient-based optimization³⁷. It was subsequently extended to a more comprehensive framework of differentiable materials modeling and design, including the full processing-structure-properties-performance relationships⁴⁰.

In this work, we leverage the geometric deep learning (GDL) method for battery electrolyte modeling and optimization, where, in GDL, necessary geometric priors are applied as constraints on the model space to improve model efficiency^41,42. In particular, we develop a differentiable GDL model of chemical mixtures, DiffMix, which is applied in guiding the robotic experimentation towards fast-charging battery electrolytes. The GDL component is designed to transform the molecular species, compositions, and environment conditions, to physical coefficients in predefined mixture physics laws, where the Redlich-Kister (R-K) mixing theory and Vogel-Fulcher-Tammann (VFT) model are selected for mixture thermodynamic and transport properties, respectively. Specifically, we leverage the graph neural networks⁴³ for molecular-level representation learning and design a customized neural network to learn mixture representations while preserving the permutation invariance of mixture components in DiffMix. We test the predictive power of DiffMix on a non-electrolyte binary mixture dataset of excess molar enthalpies and excess molar volumes, and thereafter on a large-scale simulation dataset of electrolyte ionic conductivities. We compare our model with its purely data-driven methods and show superior performances on prediction accuracy and robustness. Further, with our previously built robotic experimentation setup, Clio¹⁶, we demonstrate a differentiable optimization on battery electrolyte mixtures, based on the gradient information from DiffMix auto-differentiation⁴⁴. We successfully improve the ionic conductivity values by over 18.8% within 10 experimental steps in the evaluated chemical space, enabling the fast-charging design of battery systems. Our method extends the modeling techniques of battery electrolyte mixtures by unifying physics models and geometric deep learning, and realizing the differentiable optimization of battery electrolyte properties.

DiffMix: combining physics and geometric deep learning for modeling chemical mixtures

Our model, DiffMix, combines physics and geometric deep learning in order to build a differentiable and predictive model for chemical mixtures, as shown in Fig. 1a. Taking the input of chemical graphs, compositions, and environment condition vector (g, x, E), DiffMix processes with two components, geometric deep learning G_θ(g, x, E) and physics laws, f( ⋅ , x, E), and then output mixture property P_m = f(G_θ, x, E), in an end-to-end differentiable framework.

**Fig. 1: Differentiable Modeling and Optimization of Chemical Mixtures with DiffMix.**

Physics models for thermodynamics of mixing and ion transport

The selection of physics models, f( ⋅ , x, E), depends on the mixture properties of interest. Here, we take the mixing thermodynamics of binary non-electrolyte mixtures and ion transport of multicomponent electrolyte mixtures as examples, which can be further generalized to other forms^45,46.

To describe the thermodynamics of mixing of non-electrolyte mixtures, a polynomial expansion can be used for representing the excess function of mixing ΔP_m, i.e., the difference between mixing thermodynamic quantity P_m and the linear combination of each component Σ_ix_iPⁱ, where Pⁱ is the property of the species i. It has been successfully applied in differentiable thermodynamic modeling³⁷, with the Redlich-Kister (R-K) polynomial¹⁹ being a popular choice:

$$\Delta {P}_{{{{\rm{m}}}}}={\Sigma }_{i < j}\left[{x}_{{{{\rm{i}}}}}{x}_{{{{\rm{j}}}}}{\Sigma }_{k=0}^{N}{C}_{{{{\rm{RK}}}},{{{\rm{ij}}}}}^{{{{\rm{k}}}}}{({x}_{{{{\rm{i}}}}}-{x}_{{{{\rm{j}}}}})}^{k}\right]$$

(1)

where x_i and x_j are mole fractions of species i and $j,{C}_{{{{\rm{RK}}}},{{{\rm{ij}}}}}^{{{{\rm{k}}}}}$ is the R-K polynomial coefficients between the two species and with order number k. Equation (1) preserves the permutation invariance of chemical species i and j, when the odd orders of polynomials follow the parity rule of permutation. The mixture thermodynamic property P_m can be further obtained by:

$${P}_{{{{\rm{m}}}}}={\Sigma }_{i < j}\left[{x}_{{{{\rm{i}}}}}{x}_{{{{\rm{j}}}}}{\Sigma }_{k=0}^{N}{C}_{{{{\rm{RK}}}},{{{\rm{ij}}}}}^{{{{\rm{k}}}}}{({x}_{{{{\rm{i}}}}}-{x}_{{{{\rm{j}}}}})}^{k}\right]+{\Sigma }_{i}{x}_{{{{\rm{i}}}}}{P}^{{{{\rm{i}}}}}$$

(2)

Equation (2) preserves permutation invariance over mixture components and can be applied to a wide range of mixture thermodynamic properties.

On the ion transport properties, we focus on the ionic conductivities of battery electrolytes. A higher ionic conductivity will reduce the ion transfer resistance between electrodes and lessen the formation of electrolyte concentration polarization, therefore enabling fast-charging battery applications⁴⁷. Here, we select the Vogel-Fulcher-Tammann (VFT) model to capture the temperature dependence⁴⁸ as:

$${P}_{{{{\rm{m}}}}}={C}_{1}{e}^{-\frac{{C}_{2}}{T-{C}_{3}}}$$

(3)

where T is the temperature and {C_i} is a set of physical coefficients.

Geometric deep learning (GDL) to learn mixture representations

Equation (2) and (3) describe thermodynamic and ion transport laws that conventionally rely on empirically fitting experimental data to obtain physical coefficients, {C_RK, C_i}. However, the function relationship between mixture input (g, x, E) and physical coefficients {C_RK, C_i} remains unknown. GDL component is therefore introduced to replace physical coefficients with learnable GDL functions, {C_RK, C_i} = G_θ(g, x, E). In this way, the mixture physics model now becomes predictive and fully differentiable from chemical structures to properties. The first step in the GDL component is a graph convolution⁴³ transformation over each component graph g_i to obtain the graph-level feature vector ${{{{\bf{g}}}}}_{{{{\rm{i}}}}}^{{\prime} }$, for component i, as shown in Fig. 1a. In the second step, ${{{{\bf{g}}}}}_{{{{\rm{i}}}}}^{{\prime} }$ is attached with compositions and environment conditions and processed by MixtureNet to learn the mixture-level representations. The detailed model architecture of MixtureNet is shown in Fig. 1b. Each attached mixture component vector, v_i=$[{{{{\bf{g}}}}}_{{{{\rm{i}}}}}^{{\prime} },{x}_{{{{\rm{i}}}}},{{{\bf{E}}}}]$, passes through two weight-sharing fully connected neural networks (FCNN), SubNet and PairNet, to learn the per-substance and pairwise-interaction embeddings, {s_i} = {f ^sub(v_i)} and {p_ij} = {f ^pair(v_i, v_j)}, respectively, where f ^sub( ⋅ ) and f ^pair( ⋅ ) represent the neural network transformations. Depending on the mixture physics laws, {s_i} and {p_ij} are combined in a certain form to produce the physical coefficients. For VFT model in Equation (3) and battery electrolyte mixtures, mixture feature vector m is created via a pooling operator ⊕ , m = {s_i} ⊕ {p_ij} = [Σ_ix_i ⋅ s_i, Σ_ijx_ix_j ⋅ p_ij], which provides a permutation-invariant map from {s_i} and {p_ij} to m by concatenating the weighted sums of substance and pair embeddings. The physical coefficients in the VFT model, {C_i}, is a function of m via another FCNN, MixNet, {C_i} = { f ^mix(m)}. When no physics laws are applied, MixNet simply outputs mixture property. A special case exists in the mixing law of thermodynamics and R-K polynomial-based model in Equation (2), due to the intrinsic per-substance dependence of Pⁱ and pairwise interaction dependence of $\{{C}_{{{{\rm{RK}}}},{{{\rm{ij}}}}}^{{{{\rm{k}}}}}\}$. They can be modeled directly from SubNet and PairNet, and our implementation follows: {P_i} = { f ^s2p(s_i)} and $\{| {C}_{{{{\rm{RK}}}},{{{\rm{ij}}}}}^{{{{\rm{k}}}}}| \}=\{( \, \, {f}^{{{{\rm{s}}}}2{{{\rm{RK}}}}}([{{{{\bf{s}}}}}_{{{{\rm{i}}}}},{{{{\bf{s}}}}}_{{{{\rm{j}}}}}])+{f}^{{{{\rm{s}}}}2{{{\rm{RK}}}}}([{{{{\bf{s}}}}}_{{{{\rm{j}}}}},{{{{\bf{s}}}}}_{{{{\rm{i}}}}}]))/2\}$, where f ^s2p( ⋅ ) and f ^s2RK( ⋅ ) are two separate FCNNs to learn the pure component property given substance embeddings and RK coefficients given pairs of substance embeddings, respectively. An alternative implementation is to replace concatenated substance embeddings [s_i, s_j] with p_ij. We further introduce the mean pooling of permuted sequences via dividing the f^s2RK( ⋅ ) summation by two so that $| {C}_{{{{\rm{RK}}}},{{{\rm{ij}}}}}^{{{{\rm{k}}}}}|=| {C}_{{{{\rm{RK}}}},{{{\rm{ji}}}}}^{{{{\rm{k}}}}}|$, where the absolute value symbol is added since the RK coefficients follow permutation parity, ${C}_{{{{\rm{RK}}}},{{{\rm{ij}}}}}^{{{{\rm{k}}}}}={C}_{{{{\rm{RK}}}},{{{\rm{ji}}}}}^{{{{\rm{k}}}}}\cdot {(-1)}^{{{{\rm{k}}}}}$. We set ${C}_{{{{\rm{RK}}}},{{{\rm{ij}}}}}^{{{{\rm{k}}}}}=| {C}_{{{{\rm{RK}}}},{{{\rm{ij}}}}}^{{{{\rm{k}}}}}|$ when chemical i’s molecular weight is smaller than j’s. No additional pooling operations are required. More details about the model implementation can be found in the Methods section. Note that the geometric deep learning concept refers to the approaches that incorporate appropriate geometric priors, i.e., information on the structure space and symmetry properties of input signals⁴¹, and in this work we preserve permutation invariance over components with the mixture pooling operator ( ⊕ ) in the VFT model and the intrinsic permutation invariance introduced in the R-K model. The preserved permutation invariance will improve the model robustness by outputting identical mixture properties when mixture component sequences are permuted. To confirm that, we reverse the co-solvent sequences and recalculate the loss on the permuted testing set, as reported in the result section.

In the graph convolution transformation, we perform an ablation study on the impact of 3-dimensional (3D) molecular structural information, by introducing DimeNet++^49,50 that operates on 3D molecular conformers generated and optimized by RDKit⁵¹, compared with the original version of graph convolutions on 2-dimensional (2D) molecular graph objects. More details are described in the Methods part. Furthermore, to benchmark the effectiveness of combining physics laws with data-driven models, we design a purely data-driven baseline, GNN-only, a graph-neural-network model without physics incorporated, which is created by removing the mixture physics model in DiffMix. In the VFT-type of the GDL component, instead of outputting {C_i} as the physics law coefficients, the GNN-only model ignores the physics laws and directly produces the mixture properties. All models are evaluated on two thermodynamic datasets of binary non-electrolyte mixtures and one transport property dataset of battery electrolyte mixtures. The thermodynamic data include literature-curated excess molar enthalpies (631 data points) and excess molar volumes (1,069 data points). For electrolytes, the ionic conductivity dataset is prepared that contains 24,822 mixtures of single-salt-ternary-solvent electrolyte solutions, generated by the Advanced Electrolyte Model (AEM)^52,53. More data generation details can be found in the Methods part. In Supplementary Information (SI), we also test a data-driven variant with non-graph Morgan fingerprints for molecule representation⁵⁴ and without permutation invariance of mixture component sequences.

DiffMix-guided robotic experimentation and optimization for battery electrolytes

Differentiability enables gradient-based optimization for materials modeling and design^37,40. With auto-differentiation⁴⁴ on a trained DiffMix model, we can conveniently obtain gradient information of mixture property output over input compositions, $\frac{d({P}_{{{{\rm{m}}}}})}{d({x}_{{{{\rm{i}}}}})}$, and thereafter navigate the mixture chemical space in order to optimize the mixture property objective. In Fig. 1c, we illustrate the battery electrolyte optimization on a ternary co-solvent composition space to maximize the ionic conductivity via a gradient-ascent algorithm, and guide our previously developed robotic experimentation setup, Clio¹⁶, to improve the electrolyte ion transport properties for fast-charging batteries. Note that other factors, such as electrochemical stability and interfacial reactivity, may also play an important role in fast-charging battery electrolyte design and need to be discussed in a future study.

Results

Differentiable modeling on thermodynamic and transport properties of chemical mixtures

We start our result analysis on excess molar enthalpies (${H}_{{{{\rm{m}}}}}^{{{{\rm{E}}}}}$) and excess molar volumes (${V}_{{{{\rm{m}}}}}^{{{{\rm{E}}}}}$) of binary non-electrolyte mixtures. The model performances of DiffMix and GNN-only model are summarized in Table 1. First, we do not observe a significant performance boost when replacing the 2D Graph Convolutions with 3D-information-incorporated DimeNet++ models, which may be attributed to the fact that the datasets we include in our work are real-world mixtures, where the individual molecule may have spatially varying 3D coordinates depending on the local environments, so one fixed configuration generated by RDKit may not be representative enough. Second, we find that DiffMix and DiffMix-3D models, built on the known physics prior, outperform the GNN-only and GNN-only-3D models. With DiffMix model, we achieve mean-absolute-errors (MAEs) of 0.033 ± 0.009 (cm³/mol) and 5.10 ± 0.32 (J/mol) for excess molar volumes and excess molar enthalpies, respectively.

Table 1 Model Performance on Thermodynamic Properties and Ionic Conductivities^a

Full size table

Further, we investigate the predictive power of DiffMix and DiffMix-3D on ionic conductivities (κ) of multi-component electrolyte solutions. With the 24,822 ionic conductivity data points, we train both models and compare them with the GNN-only and the GNN-only-3D baseline models. The prediction accuracy on the testing sets is shown in Table 1. Compared with thermodynamic results, the accuracy improvement by adding physics priors in DiffMix is not as significant here, which holds for both 2D and 3D model variants. This may be attributed to the limited physical capacity of the VFT model in Equation (3), but further investigation is required, such as testing alternative physics laws for ionic conductivities. Here, with DiffMix model, we achieve the MAE of 0.044 ± 0.005 (mS/cm) considering the maximum ionic conductivity above 12 (mS/cm) in the training set.

It is also worth noting that DiffMix-3D models are trained at least two times more slowly than DiffMix models across the tasks, while we do not observe a distinct performance improvement when incorporating 3D information in the graph convolution input. Therefore, we focus our further result analysis on the machine learning models without 3D information. First, we confirm the permutation invariance of DiffMix and GNN-only models, considering the identical loss values before and after permuting the component sequences for all tasks, as shown in Table 1. According to Table S 1, the baseline model without permutation invariance reports permuted testing loss values that are significantly higher than the vanilla testing loss in all tasks, indicating the importance of model invariance. Furthermore, the parity plots for three tasks are shown in Fig. 2(a–c), indicating a high correlation between true and DiffMix-predicted values. In Fig. 2(d), we visualize the DiffMix-learned mixture features (m in Fig. 1b) for ionic conductivities with principal component analysis (PCA) in two dimensions, where we observe a smooth distribution of high and low κ values. The smooth mixture-feature distribution demonstrates the effectiveness of the graph convolution step, SubNet and PairNet, and pooling, on distinguishing different mixture compositions, which can boost prediction accuracy after passing the features through MixNet.

**Fig. 2: Prediction Accuracy Analysis of DiffMix.**

Physics model capacity and temperature extrapolation

For the mixture thermodynamics tasks, so far, the polynomial order N in Equation (2) is specified as four. To study the polynomial-order dependence of model capacity, we vary the polynomial order as N = 0, 4, 9, 14 or fully remove the excess term in Equation (2). The latter essentially describes the linear mixing rule. The results are shown in Fig. 3a, b, where we also compare them with GNN-only in order to see the effectiveness of the added physics models. First, we observe the trend of decreasing testing errors when higher orders of polynomials are introduced, i.e., increasing the capacity of the mixture physics model. With N = 4, MAEs for both ${V}_{{{{\rm{m}}}}}^{{{{\rm{E}}}}}$ and ${H}_{{{{\rm{m}}}}}^{{{{\rm{E}}}}}$ get reduced by over half than those of the linear mixing model. However, the model performance plateaus as we further increase the polynomial-based model capacity. It is worth noting that the experimentation uncertainty is around 0.005 (cm³/mol) and 5 (J/mol) for the two measurements. For the excess molar volume task, the plateauing behavior may be due to the fact that DiffMix accuracy is limited by the GDL model capacity. However, for the enthalpy task, it can also be attributed to the data uncertainty, considering that the DiffMix prediction MAE is close to the measurement error. Compared with the GNN-only baseline model, in both cases, adding the zeroth-order interaction terms improves the worst performing linear mixing model significantly and DiffMix (N=0) matches the GNN-only baseline. This indicates that both the molar volume and enthalpy changes of mixing rely on the intermolecular interactions, modeled by the pair-wise interaction coefficients in R-K mixing laws. Further, Figure S 2 describes the overall decreasing trend of R-K polynomial coefficients $\{{C}_{{{{\rm{RK}}}},{{{\rm{ij}}}}}^{{{{\rm{k}}}}}\}$ when 15 polynomials are included in the physics-based R-K model, explaining the plateauing pattern of the model accuracy.

**Fig. 3: Physics Model Capacity and Extrapolation Analysis of DiffMix.**

On modeling the ionic conductivities (κ) of battery electrolytes, we test model extrapolation to higher temperatures, as shown in Fig. 3c. In Fig. 3c, we report the prediction MAEs grouped by temperatures in the range of [-30 °C, 60 °C]. Note that our training is performed on the data with temperature range [-30 °C, 20 °C]. We notice that the interpolation MAE is close to 0 for both models, consistent with the low MAE results reported in Table 1. However, in the extrapolation test on the data generated above 20 °C, non-negligible errors have been detected, and the MAE magnitudes are positively correlated with the temperature change from 20 °C. The average MAE at 60 °C goes above 1 (mS/cm), two orders of magnitudes higher than that in the interpolation case. Compared to the GNN-only baseline, we found a superior accuracy with DiffMix. The average MAE drops from 0.39 (mS/cm) to 0.24 (mS/cm), from 0.83 (mS/cm) to 0.49 (mS/cm), from 1.28 (mS/cm) to 0.75 (mS/cm), and from 1.74 (mS/cm) to 1.03 (mS/cm), at T of 30, 40, 50, 60 °C, respectively. We further compare the DiffMix prediction results with the experimental measurements, as shown in the parity plot of Fig. 3d. Both the interpolation and extrapolation testing results of DiffMix are validated by experimental measurements for the solutions of lithium hexafluorophosphate (LiPF₆) in ethylene carbonates (EC), propylene carbonates (PC), and ethyl methyl carbonates (EMC) solvent mixtures⁵⁵. In the experimentation, the salt concentration varied between 0.2 (mol/kg) and 2.1 (mol/kg), and the EC:PC ratio was varied with (EC+PC):EMC ratio fixed at 3:7 and 1:1, respectively. We find a good agreement between DiffMix predictions and experiments, even in the extrapolation test with temperatures higher than 20 °C. Quantitatively, the R² and Pearson correlation coefficient values for interpolation and extrapolation sets are (0.80, 0.94) and (0.75, 0.92), and the interpolation and extrapolation MAEs are 0.75 (mS/cm) and 1.07 (mS/cm), respectively. Based on the results in Fig. 3d, we conclude that the AEM-generated data provide an accurate basis to learn the complex electrolyte patterns via DiffMix at the given conditions.

Differentiable battery electrolyte optimization with DiffMix and robotic experimentation

Fast charging of Li-ion batteries is impacted by electrolyte ionic conductivities, and electrolyte optimization can be challenging for battery design due to high experimentation costs¹⁶. With the trained DiffMix model, we test its capability to evaluate ionic conductivities and design electrolyte mixtures for high-performing Li-ion batteries. We select three types of electrolyte solutions as test cases and evaluate their ionic conductivities at 30 °C and varying co-solvent compositions. They are LiPF₆ salt in solvent mixtures of (i) cyclic carbonates, including ethylene carbonates (EC), propylene carbonates (PC) and fluorinated ethylene carbonates (FEC), (ii) linear carbonates, including ethyl methyl carbonates (EMC), diethyl carbonates (DEC) and dimethyl carbonates (DMC) and (iii) cyclic and linear carbonates, including EC, PC, and DMC. We first show the ionic conductivity landscape of (i) in Fig. 4a by varying co-solvent compositions with fixed lithium mole fractions of 0.08, where we observe a moderate ionic conductivity peak up to 8 (mS/cm) in the EC-enriched region. Note that we treat the anions and cations separately when computing the mole fraction. Fig. 4b provides the conductivity landscape of electrolyte mixture (ii), where the highest κ values are observed in the DMC-enriched region. Here, we fixed the lithium mole fraction at 0.12 due to the low dielectric constants of linear carbonates and thus the low dissociation degree of lithium salts. According to the conductivity map of the electrolyte mixture (iii) shown in Fig. 4c, adding linear carbonate molecules into cyclic carbonate solvents can significantly increase the mixture ionic conductivities, where the maximum ionic conductivity is 14.39 (mS/cm) when PC:DMC:EC mole ratio is close to 0:0.70:0.30 with a fixed lithium-ion mole fraction of 0.08. It is worth noting that the training data is produced with a temperature lower than or equal to 20 °C (maximum is 12.6 (mS/cm)), but we are testing its generalization capability at 30 °C. We verify this result with the output of our data generator, AEM⁵², which provides the highest conductivity of 14.2 (mS/cm) at 0.082 to 0.085 lithium mole fraction with the given PC:DMC:EC ratio. This agrees well with the differentiable modeling result and indicates good generalization of DiffMix.

As previously introduced, the gradient information is readily accessible by differentiating the trained DiffMix model. To illustrate that, we show the gradient vectors as arrows in the ionic conductivity landscapes in Fig. 4a–c. In Fig. 4a, c, we observe large gradients at pure EC solvent area, indicating that adding a small number of co-solvents can significantly improve the ionic conductivity. This can be explained by EC’s being solid-like at room temperature¹¹. Another interesting observation is that pure DMC solvent area in Fig. 4c displays a much higher gradient than that in Fig. 4b, with which we conclude that adding a small quantity of high-polarity cyclic carbonate solvents (EC, PC) could enable higher ionic conductivities. The gradient field map is generated instantaneously by auto-differentiation leveraging training data generated by AEM in the trained DiffMix model, which saves expensive experimental costs by reducing the number of optimization steps. Based on the gradient information provided by DiffMix, we implement a gradient-ascent algorithm by increasing the objectives iteratively following the gradient directions, where a batch of four starting points are initialized in the mixture space. From Fig. 4a–c, our optimization algorithm robustly identified local maximum spots. This differentiable optimization framework further guides the robotic experimentation performed by our hardware setup, Clio. We extract the batch of four optimization trajectories in Fig. 4c and compare the ionic conductivities evaluated at each step by both DiffMix and Clio, as shown in Fig. 4d. DiffMix and Clio results show a good agreement between simulation and experimentation, and we show at least an 18.8% increase from the initial ionic conductivities. Note that the temperature of Clio is managed at around 27 °C, which is slightly varied from the DiffMix temperature of 30 °C. This may account for deviations between experimental and simulation data. Across Fig. 4a–d, we focus on varying solvent composition space while fixing lithium mole fractions. The fixed lithium mole fractions are selected for demonstration purposes and are close to the optimal lithium concentrations in the training data generated at lower temperatures (20 °C). We further realize an improved ionic conductivity by fixing solvent compositions and optimizing salt concentrations to prove the generalizability of our framework, as described in SI. These results elucidate the capability of differentiable modeling of battery electrolytes, with which we could efficiently explore the chemical space of multi-component electrolyte mixtures.

Discussion

In this work, focusing on battery electrolytes, we develop a GDL-based differentiable model for chemical mixtures, DiffMix, that combines the advantages of physics-based models and geometric deep learning and further guides the robotic experimentation for practical electrolyte optimization. The evaluation results on thermodynamic data of binary non-electrolyte mixtures and ion transport data of electrolyte mixtures indicate that DiffMix preserves the component-wise permutation invariance and enables more accurate and robust predictions than GNN-only and MixECFP (in Table S 1), as can be seen from the low MAEs. When extrapolated to high temperatures, DiffMix ionic conductivity predictions show superior accuracy than the GNN-only baseline, due to the incorporation of a temperature-dependent VFT model. The experimental measurements and DiffMix display a good agreement with each other, even in the extrapolation case, enabling the real-world applications of our trained model.

We further test the physics model capacity of R-K thermodynamic mixing law in DiffMix by tuning the polynomial order N in Equation (2) and observe a plateauing behavior beyond N = 4. A similar trend between excess molar volumes and enthalpies is observed that adding the zeroth-order interaction terms significantly improves the worst performing linear mixing model, indicating strong intermolecular interactions that may exist in the tested mixture systems. Although this demonstrates the flexibility of our model in terms of the function forms of physics laws, future investigation is required to explore other types of thermodynamic and kinetic laws for mixtures.

By building our model in a fully differentiable framework, gradient information is readily accessible for a trained DiffMix model. This further allows us to optimize ionic conductivity over the input space. Taking the input co-solvent composition as variables, we identify peak ionic conductivity areas for various ternary co-solvent electrolyte chemical spaces by mixing linear carbonate solvents and cyclic carbonate solvents. The simulated trajectories have been utilized to guide the robotic experimentation performed by Clio, which successfully increases ionic conductivity values by over 18.8%. It is worth noting that in this work we conduct the DiffMix-guided robotic experimentation in a two-step process, (1) training a DiffMix model with simulated AEM data and running the optimization on the modeled response surface, (2) guiding Clio with the predefined optimization trajectory. In an alternative way, especially when the simulation is not of high quality, a closed-loop optimization can be designed via retraining the DiffMix model every few iterations during experimental data collection, which may enable a more robust and adaptive optimization. Our work has expanded the modeling and optimization techniques of battery electrolyte mixtures by unifying physics laws and geometric deep-learning in a differentiable framework.

So far, our discussions on battery electrolyte optimization focus on improving their ionic conductivities, a key factor that determines fast-charging behaviors by reducing the concentration polarization and ion transfer resistance, while other factors, e.g., electrochemical stability and interfacial reactivity, may be important and need to be considered in a future work. The chemical space we explore in this work is well-benchmarked^56,57, and a similar electrolyte recipe has shown improved fast-charging performances on the device level¹⁶. However, when we move to other chemical spaces, electrochemical device measurements are required with the optimized electrolyte compositions. Future works can also be conducted on constructing comprehensive datasets for other properties beyond ionic conductivities to perform multi-objective optimization, to which our DiffMix framework is readily applicable with a modified multi-task objective function.

Methods

Data collection and generation

Thermodynamic and transport mixture property datasets were prepared for benchmarking models developed in this work. The thermodynamic datasets include excess molar enthalpy^58,59,60,61 and excess molar volume^{60,62,63,64,65,66,67,68} values curated from the literature. There are 631 data points for excess molar enthalpy, covering 34 unique mixture chemistries composed of 35 organic chemicals with varying compositions. For excess molar volume, there are 1069 binary mixture data points based on 28 unique mixtures composed of 25 organic chemicals with varying compositions. For ionic conductivities, we prepared an ionic conductivity dataset that contains 24,822 mixtures of single-salt-ternary-solvent electrolyte solutions. These electrolyte components consist of two unique salt species, including lithium hexafluorophosphate (LiPF₆), lithium bis((trifluoromethyl)sulfonyl)azanide (LiTFSI), and six organic carbonate solvents, including ethylene carbonates (EC), propylene carbonates (PC), fluorinated ethylene carbonates (FEC), ethyl methyl carbonates (EMC), diethyl carbonates (DEC) and dimethyl carbonates (DMC). The electrolyte data were generated with one salt and any arbitrary combinations of three co-solvents, with the salt concentration ranged in {0.025, 0.5, 1.0, 1.5, 2.0, 2.5, 3.0} molal and each-solvent mass fractions varying from {0, 0.2, 0.4, 0.6, 0.8, 1.0}. The temperature range of training data is [-30 °C, 20 °C] and temperature extrapolation cases, as shown in Fig. 3c, are evaluated at higher temperatures. The data generation was performed by the Advanced Electrolyte Model (AEM) that produces high-fidelity electrolyte data for the chemical species evaluated here⁵². In the collected datasets, molecule names are stored and can be converted to their Simplified Molecular Input Line Entry System (SMILES) format⁶⁹, from which we can retrieve the topology and chemical information on atoms and bonds with RDKit⁵¹. We converted all compositions into mole fractions during model training. The availability of training datasets is discussed in the data availability statement.

Model implementation and training

For GNN-only and DiffMix, the atom features considered include one hot encoding of atom type, number of heavy neighbors, formal charges, hybridization type, chirality, and number of implicit hydrogens, and numerical information on ring structures, aromaticity, atomic mass, VdW radius, and covalent radius, giving a 97-dimension feature vector. Note that no bond features are incorporated in our model, but can be included in future work. The graph convolution cell is made up of 3 torch_geometric.nn.conv.GraphConv()⁴³ steps, each of which is followed by ReLU and dropout layers (dropout rate, p = 0.25). The whole graph convolution cell ends up with a global mean pooling layer and provides graph-level embeddings for component molecules ${{{{\bf{g}}}}}_{{{{\rm{i}}}}}^{{\prime} }$. These graph-level embeddings are then concatenated with compositions and environment conditions. For DiffMix-3D models, we replaced the aforementioned graph convolution by DimeNet++^49,50 model, where we followed the Pyorch-Geometric implementation of the model (torch_geometric.nn.DimeNetPlusPlus()), and applied default parameters from the original official repository⁷⁰ (emb_size: 128; out_emb_size: 256; int_emb_size: 64; basis_emb_size: 8; num_blocks: 4; num_spherical: 7; num_radial: 6). To combine the DimeNet++ block with the original DiffMix model, the DimeNet++ block should output graph-level embeddings in the same dimension (256) as graph convolutional layers. The 3D molecular structures are constructed by RDKit conformer generator (rdDistGeom.EmbedMolecule(mol, randomSeed=123); .GetConformer().GetPositions(), where mol is a molecule object) and then optimized by AllChem.MMFFOptimizeMolecule(). In the mixture ionic conductivity model, to reduce the computational cost, we chose a different hyperparameter set: (emb_size: 16; out_emb_size: 256; int_emb_size: 8; basis_emb_size: 4; num_blocks: 2; num_spherical: 2; num_radial: 2), which is still more than 2 times slower than the DiffMix model. Note that RDKit fails to generate 3D coordinates of lithium hexafluorophosphate (LiPF₆), so 2D coordinates of which are manually downloaded from PubChem database⁷¹. In the DiffMix-3D models for ionic conductivities, we treated each salt molecule as a whole, without separating lithium cations and anions since there is no explicit charge information encoded for atoms in the DimeNet++ input.

For GNN-only model, no physics laws are incorporated, and therefore MixtureNet output is the predicted mixture property. The dimensions of SubNet, PairNet, and MixNet go as follows: [N, N, N, N], [2N, 2N, 2N, N], [2N, 2N, 4N, 2N, 1], where N = 256 + 1 + N_env and N_env is the dimension of environment conditions. For DiffMix, it is treated differently for thermodynamic and ion transport properties, since distinct mixing laws are selected. With the VFT model selected, MixNet now is changed into [2N, 2N, 4N, 2N, 3], which outputs the three physical coefficients in the VFT model. In terms of the thermodynamics of mixing, SubNet is used to obtain the per-substance embeddings and two additional neural networks with hidden-layer dimensions, [N, N] and [2N, 4N, 2N], to obtain the component-wise physical parameters {Pⁱ} and pair-wise physical parameters $\{{C}_{{{{\rm{RK}}}},{{{\rm{ij}}}}}^{{{{\rm{k}}}}}\}$, where the input-layer and output-layer dimension for the former is N and 1, and for the latter is 2N and the defined R-K polynomial order.

During training, we set the learning rate as 0.001 with a weight decay rate of 10⁻⁴ in PyTorch⁷²Adam optimizer. L1Loss is used for loss backpropagation. We also applied early stopping criteria to select the epoch with the lowest validation error to avoid overfitting. The ionic conductivity labels went through a logarithm transformation before computing the loss values to distinguish mixture properties that span multiple orders of magnitudes. All models were implemented with PyTorch⁷² and PyTorch Geometric⁷³. For each mixture property, the full dataset was randomly split into training, validation, and testing sets, in the ratio of 8: 1: 1. The cross-validation results were reported based on an ensemble of 5 models with randomly varying splits.

Differentiable optimization algorithm

With the gradient information generated from auto-differentiation, a gradient-ascent algorithm is developed to iteratively increase the electrolyte ionic conductivity, as shown in Fig. 4. With the constraint that the mole fractions of three co-solvents should be summed into 1, we only need to vary the mole fractions of the first two components, a and b. The gradient-based update is as follows:

$$\left[{x}_{{{{\rm{a}}}}}^{1},{x}_{{{{\rm{b}}}}}^{1}\right]=\left[{x}_{{{{\rm{a}}}}}^{0},{x}_{{{{\rm{b}}}}}^{0}\right]+\left[\frac{d\kappa }{d{x}_{{{{\rm{a}}}}}^{0}},\frac{d\kappa }{d{x}_{{{{\rm{b}}}}}^{0}}\right]\cdot {{{\rm{dx}}}}$$

(4)

where [${x}_{{{{\rm{a}}}}}^{0},{x}_{{{{\rm{b}}}}}^{0}$] and [${x}_{{{{\rm{a}}}}}^{1},{x}_{{{{\rm{b}}}}}^{1}$] are the initial and updated mole fractions (points) of a and b, respectively, and $[\frac{d\kappa }{d{x}_{{{{\rm{a}}}}}^{0}},\frac{d\kappa }{d{x}_{{{{\rm{b}}}}}^{0}}]$ are the gradient of ionic conductivity κ at the initial point. The adaptive step size dx is calculated by normalizing 0.02 with the L2 norm of the two-dimensional gradients. It is worth noting that the optimizer needs to observe the constraints that each dimension of mole fractions needs to be within 0 and 1. When the updated step $[{x}_{{{{\rm{a}}}}}^{1},{x}_{{{{\rm{b}}}}}^{1}]$ is outside the boundaries, we perform a two-step update as follows:

$$\left[{x}_{{{{\rm{a}}}}}^{1{{{\rm{t}}}}},{x}_{{{{\rm{b}}}}}^{1{{{\rm{t}}}}}\right]=\left[{x}_{{{{\rm{a}}}}}^{0},{x}_{{{{\rm{b}}}}}^{0}\right]+\left[\frac{d\kappa }{d{x}_{{{{\rm{a}}}}}^{0}},\frac{d\kappa }{d{x}_{{{{\rm{b}}}}}^{0}}\right]\cdot \left(1-{x}_{{{{\rm{a}}}}}^{0}-{x}_{{{{\rm{b}}}}}^{0}\right)/{{{\rm{dx}}}}0\cdot {{{\rm{dx}}}}$$

(5)

$$\left[{x}_{{{{\rm{a}}}}}^{2{{{\rm{t}}}}},{x}_{{{{\rm{b}}}}}^{2{{{\rm{t}}}}}\right]=\left[{x}_{{{{\rm{a}}}}}^{1{{{\rm{t}}}}},{x}_{{{{\rm{b}}}}}^{1{{{\rm{t}}}}}\right]+\left[\frac{d\kappa }{d{x}_{{{{\rm{a}}}}}^{0}}-\frac{d\kappa }{d{x}_{{{{\rm{b}}}}}^{0}},\frac{d\kappa }{d{x}_{{{{\rm{b}}}}}^{0}}-\frac{d\kappa }{d{x}_{{{{\rm{a}}}}}^{0}}\right]\cdot \left({x}_{{{{\rm{a}}}}}^{1}+{x}_{{{{\rm{b}}}}}^{1}-1\right)/{{{\rm{dx}}}}0\cdot {{{\rm{dx}}}}$$

(6)

where $[{x}_{{{{\rm{a}}}}}^{1{{{\rm{t}}}}},{x}_{{{{\rm{b}}}}}^{1{{{\rm{t}}}}}]$ and $[{x}_{{{{\rm{a}}}}}^{2{{{\rm{t}}}}},{x}_{{{{\rm{b}}}}}^{2{{{\rm{t}}}}}]$ are the intermediate and updated points. Here, the first step is to get closer to the boundary while the second step is designed to move along the boundary, i.e., the zero composition line of the third component. dx0 (0.02) is introduced to normalize the proxy distances of $[{x}_{{{{\rm{a}}}}}^{0},{x}_{{{{\rm{b}}}}}^{0}],[{x}_{{{{\rm{a}}}}}^{1},{x}_{{{{\rm{b}}}}}^{1}]$ against the zero-composition boundary of the third component, i.e., $(1-{x}_{{{{\rm{a}}}}}^{0}-{x}_{{{{\rm{b}}}}}^{0})$ and $({x}_{{{{\rm{a}}}}}^{1}+{x}_{{{{\rm{b}}}}}^{1}-1)$. Note that this is an approximate derivation (due to the simplified distance formula) and the updated point $[{x}_{{{{\rm{a}}}}}^{2{{{\rm{t}}}}},{x}_{{{{\rm{b}}}}}^{2{{{\rm{t}}}}}]$ may go beyond the boundary. Therefore, they are followed by clipping ${x}_{{{{\rm{a}}}}}^{2{{{\rm{t}}}}}$ and ${x}_{{{{\rm{b}}}}}^{2{{{\rm{t}}}}}$ (within 0 and 1) and being normalized by the summation of ${x}_{{{{\rm{a}}}}}^{2{{{\rm{t}}}}}$ and ${x}_{{{{\rm{b}}}}}^{2{{{\rm{t}}}}}$ (when above 1). This optimization algorithm performs well in all tested scenarios, as shown in Fig. 4. In Fig. 4, we visualize the optimization history by showing the explored trajectory every five steps until convergence (criteria: the L2 norm of the gradient is below 0.2 or up to 100 steps). When guiding the experimentation robot, we conducted one measurement every two visualized optimization steps, i.e., 10 simulation steps, until no significant ionic conductivity improvement was observed.

Automated experimental measurements of electrolyte properties

The ionic conductivity measurements in this work were done by Clio, a custom-built robotic setup¹⁶. The ionic conductivity data were measured by electrochemical impedance spectroscopy (EIS) in a PTFE fixture chamber using a PalmSens4 impedance analyzer. The electrolytes were filled into the chamber between two symmetric Pt electrodes. The impedance of the cell was measured at five frequencies between 14 kHz and 800 kHz. The resistance of the sample is determined by evaluating the real part of the impedance at the frequency where the smallest phase difference is observed during measurement. To calculate the specific ionic conductivity of the sample, a cell constant is obtained through a single point calibration using a known solution (Acetonitrile and LiPF₆). The specific ionic conductivity is then determined by dividing the inverse resistance by the cell constant. The temperature was managed via glove-box-wide heating and airflow. Temperatures were 27.2 °C ± 0.3 °C. We note that this is slightly lower than the predictions of DiffMix, thus temperature may account for deviations between experimental and modeled data.

Experimental methods: materials

The electrolyte salt (LiPF6) and solvents (PC, DMC, EC) used in this study were obtained from Linyi Gelon LIB Co. Ltd., anhydrous (<20 ppm) and battery grade (99.9% pure). The precursors and electrolyte stock solutions were prepared and stored in a dry Ar-filled glove box (<100 ppm oxygen, <10 ppm H₂O). The stock solutions were made by first mixing the solvents into the desired mass ratios, then gradually adding salts to the solvents to the designated concentrations. The mass of the solutes and solvents were measured using a Denver Instrument PI-214.1 analytical balance. All solutions were mixed with a magnetic stir bar and magnetic stir plate in a glass beaker for at least half an hour after the last visible salt. The solutions were then transferred to and stored in 60-mL amber glass vials with Sure/Seal septa lids.

Data availability

All data used for model training in this study is deposited in Github for public accession for noncommercial, research purposes at github.com/BattModels/DiffMix-NatCommData⁷⁴. Source data are provided with this paper in the same repository. Source data are provided with this paper.

Code availability

The code related to this work is implemented in Python following the methods section described in the manuscript. The code is not publicly available as it is proprietary and exclusively licensed. Further explanation of our methodology is available upon request.

References

Yu, Z. et al. Molecular design for electrolyte solvents enabling energy-dense and long-cycling lithium metal batteries. Nat. Energy 5, 526–533 (2020).
Article ADS CAS Google Scholar
Fan, X. et al. All-temperature batteries enabled by fluorinated electrolytes with non-polar solvents. Nat. Energy 4, 882–890 (2019).
Article ADS CAS Google Scholar
Bi, Z. et al. Individual nanostructure optimization in donor and acceptor phases to achieve efficient quaternary organic solar cells. Nano Energy 66, 104176 (2019).
Article CAS Google Scholar
Harillo-Baños, A., Rodríguez-Martínez, X. & Campoy-Quiles, M. Efficient exploration of the composition space in ternary organic solar cells by combining high-throughput material libraries and hyperspectral imaging. Adv. Energy Mater. 10, 1902417 (2020).
Article Google Scholar
Connors, K.A. Chemical Kinetics: The Study of Reaction Rates in Solution. Wiley (1990).
Hynes, J. T. Chemical reaction dynamics in solution. Ann. Rev. Phys. Chem. 36, 573–597 (1985).
Article ADS CAS Google Scholar
Li, D. et al. Surfactant removal for colloidal nanoparticles from solution synthesis: the effect on catalytic performance. ACS Catal. 2, 1358–1362 (2012).
Article CAS Google Scholar
Deng, Y. & Ezyske, C. M. Sulfate radical-advanced oxidation process (SR-AOP) for simultaneous removal of refractory organic contaminants and ammonia in landfill leachate. Water Res. 45, 6189–6194 (2011).
Article CAS PubMed Google Scholar
Acero, J. L., Stemmler, K. & Gunten, U. Degradation kinetics of atrazine and its degradation products with ozone and OH radicals: a predictive tool for Drinking water treatment. Environ. Sci. Technol. 34, 591–597 (2000).
Article ADS CAS Google Scholar
Altenburger, R., Scholz, S., Schmitt-Jansen, M., Busch, W. & Escher, B. I. Mixture toxicity revisited from a toxicogenomic perspective. Environ. Sci. Technol. 46, 2508–2522 (2012).
Article ADS CAS PubMed Google Scholar
Xu, K. Nonaqueous liquid electrolytes for lithium-based rechargeable batteries. Chem. Rev. 104, 4303–4418 (2004).
Article CAS PubMed Google Scholar
Xu, K. Electrolytes and interphases in li-ion batteries and beyond. Chem. Rev. 114, 11503–11618 (2014).
Article CAS PubMed Google Scholar
Meng, Y. S., Srinivasan, V. & Xu, K. Designing better electrolytes. Science 378, 3750 (2022).
Article Google Scholar
Annevelink, E. et al. Automat: Automated materials discovery for electrochemical systems. MRS Bull. 47, 1036–1044 (2022).
Article ADS Google Scholar
Dave, A. et al. Autonomous Discovery of Battery Electrolytes with Robotic Experimentation and Machine Learning. Cell Rep. Phys. Sci. 1, 100264 (2020).
Article CAS Google Scholar
Dave, A. et al. Autonomous optimization of non-aqueous li-ion battery electrolytes via robotic experimentation and machine learning coupling. Nat. Commun. 13, 5454 (2022).
Article ADS CAS PubMed Central PubMed Google Scholar
Yao, N. et al. An atomic insight into the chemical origin and variation of the dielectric constant in liquid electrolytes. Angew. Chem. Int. Ed. 60, 21473–21478 (2021).
Article CAS Google Scholar
Zhang, Y., Bier, I. & Viswanathan, V. Predicting Electrolyte Conductivity Directly from Molecular-Level Interactions. ACS Energy Lett. 7, 4061–4070 (2022).
Article CAS Google Scholar
Redlich, O. & Kister, A. T. Algebraic Representation of Thermodynamic Properties and the Classification of Solutions. Ind. Eng. Chem. 40, 345–348 (1948).
Article Google Scholar
Arrhenius, S. Über die dissociationswärme und den einfluss der temperatur auf den dissociationsgrad der elektrolyte. Z. f.ür. physikalische Chem. 4, 96–116 (1889).
Article Google Scholar
Butler, K. T., Davies, D. W., Cartwright, H., Isayev, O. & Walsh, A. Machine learning for molecular and materials science. Nature 559, 547–555 (2018).
Article ADS CAS PubMed Google Scholar
Yao, Z. et al. Machine learning for a sustainable energy future. Nat. Rev. Mater. 8, 202–215 (2023).
Article ADS PubMed Google Scholar
Pablo-García, S. et al. Fast evaluation of the adsorption energy of organic molecules on metals via graph neural networks. Nat. Comput. Sci. 3, 433–442 (2023).
Article PubMed Central PubMed Google Scholar
Levin, I., Liu, M., Voigt, C. A. & Coley, C. W. Merging enzymatic and synthetic chemistry with computational synthesis planning. Nat. Commun. 13, 7747 (2022).
Article ADS CAS PubMed Central PubMed Google Scholar
Goldman, S. et al. Annotating metabolite mass spectra with domain-inspired chemical formula transformers. Nat. Mach. Intell. 5, 965–979 (2023).
Article Google Scholar
Chen, C., Zuo, Y., Ye, W., Li, X. & Ong, S. P. Learning properties of ordered and disordered materials from multi-fidelity data. Nat. Computational Sci. 1, 46–53 (2021).
Article Google Scholar
Chen, C. & Ong, S. P. A universal graph deep learning interatomic potential for the periodic table. Nat. Computational Sci. 2, 718–728 (2022).
Article Google Scholar
Bilodeau, C. et al. Machine learning for predicting the viscosity of binary liquid mixtures. Chem. Eng. J. 464, 142454 (2023).
Article CAS Google Scholar
Jirasek, F., Bamler, R. & Mandt, S. Hybridizing physical and data-driven prediction methods for physicochemical properties. Chem. Commun. 56, 12407–12410 (2020).
Article CAS Google Scholar
Jirasek, F. et al. Making thermodynamic models of mixtures predictive by machine learning: matrix completion of pair interactions. Chem. Sci. 13, 4854–4862 (2022).
Article CAS PubMed Central PubMed Google Scholar
Greenman, K. P., Green, W. H. & Gómez-Bombarelli, R. Multi-fidelity prediction of molecular optical peaks with deep learning. Chem. Sci. 13, 1152–1162 (2022).
Article CAS PubMed Central PubMed Google Scholar
Kim, S. C. et al. Data-driven electrolyte design for lithium metal anodes. Proc. Natl Acad. Sci. USA 120, 2214357120 (2023).
Article Google Scholar
Bradford, G. et al. Chemistry-informed machine learning for polymer electrolyte discovery. ACS Cent. Sci. 9, 206–216 (2023).
Article CAS PubMed Central PubMed Google Scholar
Schoenholz, S., Cubuk, E.D. Jax md: A framework for differentiable physics. In: Larochelle, H., Ranzato, M., Hadsell, R., Balcan, M.F., Lin, H. (eds.) Advances in Neural Information Processing Systems, vol. 33, pp. 11428–11441. Curran Associates, Inc. https://proceedings.neurips.cc/paper_files/paper/2020/file/83d3d4b6c9579515e1679aca8cbc8033-Paper.pdf (2020).
Mann, S. et al. tho PV: An end-to-end differentiable solar-cell simulator. Comput. Phys. Commun. 272, 108232 (2022).
Article CAS Google Scholar
Kasim, M. F. & Vinko, S. M. Learning the exchange-correlation functional from nature with fully differentiable density functional theory. Phys. Rev. Lett. 127, 126403 (2021).
Article ADS CAS PubMed Google Scholar
Guan, P.-W. Differentiable thermodynamic modeling. Scr. Mater. 207, 114217 (2022).
Article CAS Google Scholar
Wang, W., Wu, Z., Dietschreit, J. C. B. & Gómez-Bombarelli, R. Learning pair potentials using differentiable simulations. J. Chem. Phys. 158, 044113 (2023).
Article ADS CAS PubMed Google Scholar
Shen, C. et al. Differentiable modelling to unify machine learning and physical models for geosciences. Nat. Rev. Earth Environ. 4, 552–567 (2023).
Article ADS Google Scholar
Guan, P.-W., Viswanathan, V. System and method for material modelling and design using differentiable models, PCT/US2022/041009 (2022).
Atz, K., Grisoni, F. & Schneider, G. Geometric deep learning on molecular representations. Nat. Mach. Intell. 3, 1023–1032 (2021).
Article Google Scholar
Bronstein, M. M., Bruna, J., LeCun, Y., Szlam, A. & Vandergheynst, P. Geometric deep learning: Going beyond euclidean data. IEEE Signal Process. Mag. 34, 18–42 (2017).
Article ADS Google Scholar
Morris, C. et al. Weisfeiler and leman go neural: higher-order graph neural networks. Proc. AAAI Conf. Artif. Intell. 33, 4602–4609 (2019).
Google Scholar
Baydin, A. G., Pearlmutter, B. A., Radul, A. A. & Siskind, J. M. Automatic differentiation in machine learning: a survey. J. Mach. Learn. Res. 18, 1–43 (2018).
MathSciNet Google Scholar
Thomas, E. R. & Eckert, C. A. Prediction of limiting activity coefficients by a modified separation of cohesive energy density model and UNIFAC. Ind. Eng. Chem. Process Des. Dev. 23, 194–209 (1984).
Article CAS Google Scholar
Siegel, D. J., Nazar, L., Chiang, Y.-M., Fang, C. & Balsara, N. P. Establishing a unified framework for ion solvation and transport in liquid and solid electrolytes. Trends Chem. 3, 807–818 (2021).
Article CAS Google Scholar
Xu, J. et al. Electrolyte design for Li-ion batteries under extreme operating conditions. Nature 614, 694–700 (2023).
Article ADS CAS PubMed Google Scholar
Garca-Coln, L. S., Castillo, L. F. & Goldstein, P. Theoretical basis for the vogel-fulcher-tammann equation. Phys. Rev. B 40, 7040–7044 (1989).
Article ADS CAS Google Scholar
Gasteiger, J., Groß, J., Günnemann, S. Directional message passing for molecular graphs. In: International Conference on Learning Representations (ICLR) (2020).
Gasteiger, J., Giri, S., Margraf, J.T., Günnemann, S. Fast and uncertainty-aware directional message passing for non-equilibrium molecules. In: Machine Learning for Molecules Workshop, NeurIPS (2020).
RDKit: Open-source Cheminformatics. http://www.rdkit.org.
Gering, K. L. Prediction of electrolyte conductivity: results from a generalized molecular model based on ion solvation and a chemical physics framework. Electrochim. Acta 225, 175–189 (2017).
Article CAS Google Scholar
Gering, K. L. Prediction of electrolyte viscosity for aqueous and non-aqueous systems: Results from a molecular model based on ion solvation and a chemical physics framework. Electrochim. Acta 51, 3125–3138 (2006).
Article CAS Google Scholar
Rogers, D. & Hahn, M. Extended-connectivity fingerprints. J. Chem. Inf. Modeling 50, 742–754 (2010).
Article CAS Google Scholar
Rahmanian, F. et al. Conductivity experiments for electrolyte formulations and their automated analysis. Sci. Data 10, 43 (2023).
Article CAS PubMed Central PubMed Google Scholar
Wu, X. et al. Effects of solvent formulations in electrolytes on fast charging of li-ion cells. Electrochim. Acta 353, 136453 (2020).
Article CAS Google Scholar
Wu, X. et al. Understanding the effect of salt concentrations on fast charging performance of li-ion cells. J. Power Sources 545, 231863 (2022).
Article CAS Google Scholar
Ottani, S., Comelli, F. & Castellari, C. Densities, viscosities, and excess molar enthalpies of propylene carbonate + anisole or + phenetole at (293.15, 303.15, and 313.15) K. J. Chem. Eng. Data 46, 125–129 (2001).
Article CAS Google Scholar
Comelli, F., Francesconi, R., Bigi, A. & Rubini, K. Excess molar enthalpies, molar heat capacities, densities, viscosities, and refractive indices of dimethyl sulfoxide + esters of carbonic acid at 308.15 K and atmospheric pressure. J. Chem. Eng. Data 51, 665–670 (2006).
Article CAS Google Scholar
Francesconi, R. & Comelli, F. Excess enthalpies and excess volumes of the liquid binary mixtures of propylene carbonate + six alkanols at 298.15 K. J. Chem. Eng. Data 41, 1397–1400 (1996).
Article CAS Google Scholar
Comelli, F., Francesconi, R. & Ottani, S. Excess molar enthalpies of binary mixtures containing propylene carbonate + 23 alkanoates at 298.15 K. J. Chem. Eng. Data 43, 333–336 (1998).
Article CAS Google Scholar
Chen, F. et al. Density, viscosity, speed of sound, excess property and bulk modulus of binary mixtures of γ-butyrolactone with acetonitrile, dimethyl carbonate, and tetrahydrofuran at temperatures (293.15 to 333.15) K. J. Mol. Liq. 209, 683–692 (2015).
Article CAS Google Scholar
Francesconi, R. & Comelli, F. Excess molar enthalpies, densities, and excess molar volumes of binary mixtures containing esters of carbonic acid at 298.15 and 313.15 K. J. Chem. Eng. Data 40, 811–814 (1995).
Article CAS Google Scholar
Lu, H., Wang, J., Zhao, Y., Xuan, X. & Zhuo, K. Excess molar volumes and viscosities for binary mixtures of γ-butyrolactone with methyl formate, ethyl formate, methyl acetate, ethyl acetate, and acetonitrile at 298.15 K. J. Chem. Eng. Data 46, 631–634 (2001).
Article CAS Google Scholar
Yang, C., Xu, W. & Ma, P. Excess molar volumes and viscosities of binary mixtures of dimethyl carbonate with chlorobenzene, hexane, and heptane from (293.15 to 353.15) K and at atmospheric pressure. J. Chem. Eng. Data 49, 1802–1808 (2004).
Article CAS Google Scholar
Roy, M. N., Sinha, B. & Dakua, V. K. Excess molar volumes and viscosity deviations of binary liquid mixtures of 1,3-Dioxolane and 1,4-Dioxane with Butyl acetate, butyric acid, butylamine, and 2-butanone at 298.15 K. J. Chem. Eng. Data 51, 590–594 (2006).
Article CAS Google Scholar
Muhuri, P. K., Das, B. & Hazra, D. K. Viscosities and excess molar volumes of binary mixtures of propylene carbonate with tetrahydrofuran and methanol at different temperatures. J. Chem. Eng. Data 41, 1473–1476 (1996).
Article CAS Google Scholar
Zhao, Y., Wang, J., Xuan, X. & Lu, J. Effect of temperature on excess molar volumes and viscosities for propylene carbonate + N,N-Dimethylformamide mixtures. J. Chem. Eng. Data 45, 440–444 (2000).
Article CAS Google Scholar
Weininger, D. Smiles, a chemical language and information system. 1. introduction to methodology and encoding rules. J. Chem. Inf. Comput. Sci. 28, 31–36 (1988).
Article CAS Google Scholar
DimeNet++ configuration file. https://github.com/gasteigerjo/dimenet/blob/master/config_pp.yaml [Accessed: April 27th, 2024] (2024).
PubChem 2-dimensional Structure of Lithium hexafluorophosphate. https://pubchem.ncbi.nlm.nih.gov/compound/23688915#section=2D-Structure&fullscreen=true [Accessed: April 26th, 2024] (2024).
Paszke, A. et al. Pytorch: An imperative style, high-performance deep learning library. In: Wallach, H., Larochelle, H., Beygelzimer, A., Alché-Buc, F., Fox, E., Garnett, R. (eds.) Advances in Neural Information Processing Systems, 32 Curran Associates, Inc. https://proceedings.neurips.cc/paper/2019/file/bdbca288fee7f92f2bfa9f7012727740-Paper.pdf (2019).
Fey, M., Lenssen, J.E. Fast graph representation learning with PyTorch Geometric. In: ICLR Workshop on Representation Learning on Graphs and Manifolds (2019).
Zhu, S. et al. Differentiable Modeling and Optimization of Non-aqueous Li-based Battery Electrolyte Solutions Using Geometric Deep Learning. BattModels/DiffMix-NatCommData. https://doi.org/10.5281/zenodo.12682958 (2024).

Download references

Acknowledgements

We acknowledge funding from the Advanced Research Projects Agency-Energy (ARPA-E), U.S. Department of Energy, under Award Number DE-AR0001211. The views and opinions of authors expressed herein do not necessarily state or reflect those of the United States Government or any agency thereof. H.L., A.D., and V.V. acknowledge the support of Toyota Research Institute through the Accelerated Materials Design and Discovery program. S.Z. and V.V. acknowledge support from the Extreme Science and Engineering Discovery Environment (XSEDE) for providing computational resources, under Award Number TG-CTS180061. We also acknowledge Dr. Jay Whitacre, Dr. Andrew Li, Dr. Lei Zhang for their helpful discussions.

Author information

Pin-Wen Guan
Present address: Sandia National Laboratories, Livermore, USA

Authors and Affiliations

Department of Mechanical Engineering, Carnegie Mellon University, Pittsburgh, USA
Shang Zhu, Emil Annevelink, Hongyi Lin, Adarsh Dave, Pin-Wen Guan & Venkatasubramanian Viswanathan
Department of Mechanical Engineering, University of Michigan, Ann Arbor, USA
Shang Zhu, Hongyi Lin & Venkatasubramanian Viswanathan
Deep Forest Sciences, Palo Alto, USA
Bharath Ramsundar
Energy Storage & Technology, Idaho National Laboratory, Idaho Falls, USA
Kevin Gering
Department of Aerospace Engineering, University of Michigan, Ann Arbor, USA
Venkatasubramanian Viswanathan

Authors

Shang Zhu
View author publications
Search author on:PubMed Google Scholar
Bharath Ramsundar
View author publications
Search author on:PubMed Google Scholar
Emil Annevelink
View author publications
Search author on:PubMed Google Scholar
Hongyi Lin
View author publications
Search author on:PubMed Google Scholar
Adarsh Dave
View author publications
Search author on:PubMed Google Scholar
Pin-Wen Guan
View author publications
Search author on:PubMed Google Scholar
Kevin Gering
View author publications
Search author on:PubMed Google Scholar
Venkatasubramanian Viswanathan
View author publications
Search author on:PubMed Google Scholar

Contributions

S.Z. and V.V. designed research; S.Z., B.R. E.A., P.-W.G., and V.V., contributed to the conceptualization and methodology of the DiffMix framework; S.Z. implemented the algorithms; H.L. and A.D. designed and performed the experiments; S.Z., B.R. E.A., H.L., A.D, P.-W.G., K.G., and V.V. analyzed data and wrote the paper.

Corresponding author

Correspondence to Venkatasubramanian Viswanathan.

Ethics declarations

Competing interests

V.V., S.Z., and B.R. are inventors on a U.S. provisional patent application (application no. 63/525,925), related to predicting and optimizing mixture properties by geometric deep learning. P.-W.G. and V.V. are inventors on a patent application (US Patent Application No. 18/576,023, International Patent Application No. PCT/US2022/041009, published as WO2023034051A1), related to system and method for material modeling and design using differentiable models. The remaining authors declare no competing interests.

Peer review

Peer review information

Nature Communications thanks Leiting Zhang, who co-reviewed with Jackie T. Yik; and the other, anonymous, reviewer for their contribution to the peer review of this work. A peer review file is available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Peer Review File

Source data

Source Data

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Cite this article

Zhu, S., Ramsundar, B., Annevelink, E. et al. Differentiable modeling and optimization of non-aqueous Li-based battery electrolyte solutions using geometric deep learning. Nat Commun 15, 8649 (2024). https://doi.org/10.1038/s41467-024-51653-7

Download citation

Received: 15 November 2023
Accepted: 12 August 2024
Published: 05 October 2024
DOI: https://doi.org/10.1038/s41467-024-51653-7

This article is cited by

A predictive machine learning force-field framework for liquid electrolyte development
- Sheng Gong
- Yumin Zhang
- Liang Xiang
Nature Machine Intelligence (2025)
Active learning accelerates electrolyte solvent screening for anode-free lithium metal batteries
- Peiyuan Ma
- Ritesh Kumar
- Chibueze V. Amanchukwu
Nature Communications (2025)
Chemical foundation model-guided design of high ionic conductivity electrolyte formulations
- Murtaza Zohair
- Vidushi Sharma
- Young-Hye La
npj Computational Materials (2025)