Introduction

The search for the ultimate exchange-correlation functional in density functional theory (DFT) is the most significant challenge for first-principles calculations in materials science. The flora of approximations was divided into rungs on Jacob’s ladder by Perdew and Schmidt1,2. Starting from the local density and generalized gradient approximation (LDA and GGA), the sophistication increases up to the highest chemical accuracy. At the top resides the random phase approximation (RPA), which considers exact exchange together with an approximate many-body treatment of correlations3,4,5,6,7. Being nonlocal, the latter also effectively includes dispersion interactions, while, for lower rungs, semiempirical dispersion corrections may be required. Despite the increased sophistication, the higher-level approximations do not always lead to improved results8,9,10. Evaluation against experiments is thus critical and has, conventionally, been performed for 0-K properties such as lattice constants and cohesive energies8,9,10,11,12,13,14,15,16,17,18,19,20,21, see Table 1. In a few cases, low-temperature approximations were used22, which, however, can lead to errors at elevated temperatures23. Even worse, dynamically unstable phases, such as titanium alloys, high entropy alloys, certain perovskites, and silica, cannot be treated at all.

Table 1 Ab initio studies evaluating the performance of the exchange-correlation treatment (mGGA=meta-GGA, hGGA=hybrid GGA; 0 K equilibrium properties: a0=lattice constant, B=bulk modulus, Ecoh=cohesion energy, Eform=formation energy; finite temperature (T) properties: Tq−c=transition temperature between β-quartz and β-cristobalite in SiO2, \({\bf{\Delta}}\)Sq−c=corresponding transition entropy)

Silica, SiO2, is often used as a model system by virtue of its technologically relevant polymorphs9,24. At higher temperatures, the β-phases are stable, i.e., β-quartz, β-cristobalite, and β-tridymite (P63/mmc-tridymite)25. These phases are dynamically stabilized and, thus, require explicit finite-temperature modeling. An additional challenge is the stringent requirement on the statistical and computational precision of the Gibbs energy differences when targeting phase transitions. The Gibbs energies of the β-phases are similar in magnitude and temperature dependence, so small entropy differences drive the transitions. Even minute inaccuracies of a few meV/atom can shift transition temperatures by several hundred K.

Here, we develop an approach (see Fig. 1) that facilitates the prediction of high-accuracy transition temperatures and entropies for all rungs of Jacob’s ladder. A key component is free-energy perturbation in the exchange-correlation space. The approach works genuinely at finite temperatures, enabling the treatment of dynamically stabilized phases with the same high precision as phases that are stable already at 0 K. We apply the approach to the β-phases of silica, focusing on the transition between β-quartz and β-cristobalite. We compute the transition temperature Tq−c and transition entropy \(\Delta\)Sq−c between these two phases for all rungs of Jacob’s ladder up to the RPA level. Analysis of the results provides a simple geometrical understanding of the phase-space differences between the different exchange-correlation treatments.

Fig. 1: Outline of the developed methodology.
Fig. 1: Outline of the developed methodology.
Full size image

Within the lower rungs of Jacob’s ladder of DFT, Helmholtz energy surfaces F(V, T) are computed, yielding the full set of thermodynamic properties (bottom panel). Precise free energy perturbation is then guaranteed by small second-order terms, making all rungs efficiently accessible at selected volume-temperature points (top panel). The correlation with the β-quartz–β-cristobalite Helmholtz energy difference \(\Delta\)Fq−c allows an accurate estimate of the transition temperature Tq−c and further transition characteristics.

Results

Direct upsampling for lower rung functionals

The proposed approach is based on the direct upsampling method26. Within the latter, machine-learning potentials (MLPs) are utilized to efficiently compute a high-precision Helmholtz energy surface with well-converged computational parameters (e.g., a dense volume-temperature mesh and supercells with several thousand atoms), isolating the inherent error due to the exchange-correlation approximation and enabling its evaluation. Direct upsampling can be applied well for rungs 1–3 from Jacob’s ladder. For the present case, we compute Helmholtz energy surfaces for the silica β-phases utilizing various functionals from rungs 1–3 (LDA27, GGA-PBE28, meta-GGA r2SCAN29 with and without dispersion corrections). The β-quartz LDA surface is shown in Fig. 2a as an example. The gray-shaded region marks the instability regime where the system transforms into α-quartz, highlighting the necessity of an explicit finite-temperature treatment. Various thermodynamic quantities can be extracted from the Helmholtz energy surface, e.g., the thermal expansion, as emphasized by the red line. Figure 2b compares the volume expansion for β-quartz and β-cristobalite within r2SCAN, including the D4 correction, with experiments marked by circles. The explicit finite-temperature results (solid lines) give a good prediction, while calculations for the symmetry-constrained idealized 0-K structures are meaningless (dashed lines). A comparison of various functionals from rungs 1–3 with experiments is provided in Supplementary Sec. S2. LDA shows a good agreement for the volume at Tq−c with a deviation below 1.5%, but fails to correctly predict the temperature dependence. The third rung meta-GGA functionals, including dispersion corrections, give the best results with deviations below 1%, qualitatively reproducing the temperature dependence.

Fig. 2: Calculated Helmholtz energy and volume within selected exchange-correlation functionals.
Fig. 2: Calculated Helmholtz energy and volume within selected exchange-correlation functionals.
Full size image

a Helmholtz energy surface for β-quartz within LDA. The red line marks the volume expansion at ambient pressure. The black dots indicate volume-temperature points for the calculation of the transition temperature, Tq−c, transition entropy, \(\Delta\)Sq−c, and equilibrium volume, Veq, for rungs 4 and 5 within the proposed approach. The purple crosses indicate the points used for training the machine-learning potential. b Volume expansion at ambient pressure of β-quartz and β-cristobalite within r2SCAN-D4. Finite-temperature results (solid lines) are compared with experimental data (circles)56,57,58,59,60,61 and results for symmetry-constrained idealized structures at 0 K (dashed lines). The gray areas mark the dynamical instability regime. See Supplementary Fig. S1 for volume-expansion and bulk-modulus results for various rung 1–3 functionals.

The crossing of the β-quartz and β-cristobalite Gibbs energies determines Tq−c. The Gibbs energies are obtained by a Legendre transformation of the Helmholtz energy surfaces. The resulting Tq−c’s from rungs 1–3 are compared in Fig. 3a with CALPHAD values representing experiments. The CALPHAD values vary marginally and we take the average for comparison. The spread of the ab initio predicted values is, on the other hand, significant. Without dispersion corrections, Tq−c is underestimated. LDA’s Tq−c is 251 K (22%) below CALPHAD, while PBE’s and r2SCAN’s predictions even fall out of the stability window of the phases. Including dispersion corrections shifts Tq−c upward, resulting in overestimations, e.g., 408 K (36%) for PBE-D3(BJ) or 594 K (52%) for r2SCAN-D4. The corresponding transition entropies \(\Delta\)Sq−c (differences between the slopes of the Gibbs energies at Tq−c) are shown in Fig. 3b. All rung 1–3 functionals underestimate \(\Delta\)Sq−c with the largest discrepancy of 56% for LDA.

Fig. 3: β-quartz to β-cristobalite transition properties.
Fig. 3: β-quartz to β-cristobalite transition properties.
Full size image

a Transition temperatures Tq−c and b Transition entropies \(\Delta\)Sq−c in SiO2 for rungs 1–3 functionals (green) compared with CALPHAD25,62,63 (gray). Extrapolated temperatures are indicated with brackets, and the CALPHAD average is marked with a dashed line.

Systematic approach for higher-rung predictions

In light of the discrepancies in predicting Tq−c and \(\Delta\)Sq−c observed for the functionals of rungs 1–3, an extension of the analysis to higher rungs is desirable. The difficulty is that the computational requirements for rungs 4 and 5 increase strongly. In the following, we describe a systematically extendable approach that overcomes these difficulties and enables—in successive steps—an accurate prediction of the transition temperature and entropy for rungs 4 and 5.

The approach builds on two main observations extracted from the wide range of results obtained for rungs 1–3. (1) The transition temperature Tq−c is well correlated with \(\Delta\)Fq−c, the Helmholtz energy difference between the two phases for a certain, specified condition. (2) Free energy perturbation can be utilized to efficiently obtain Helmholtz energy differences between the functionals. Using these observations, it is possible to drastically reduce the computational effort in predicting Tq−c and \(\Delta\)Sq−c within rungs 4 and 5. Observation (1) allows us to focus on a single volume-temperature point for each of the two phases for the Tq−c prediction (cf. Fig. 2a for β-quartz), reducing the number of calculations by about two orders of magnitude. The entropy \(\Delta\)Sq−c can be obtained by one additional point on each surface. Observation (2) dispenses with the necessity to fit MLPs for rungs 4 or 5. Instead, the optimized potentials from rungs 1–3 are used to generate suitable snapshots. Taken together, the prediction of Tq−c and \(\Delta\)Sq−c for rungs 4 or 5 can be obtained in a few hundred snapshots of supercells with about two hundred atoms.

The correlation between Tq−c and \(\Delta\)Fq−c is displayed in Fig. 4. The difference \(\Delta\)Fq−c (y-axis) is obtained as

$$\Delta {F}_{{\rm{q}}-{\rm{c}}}={F}_{{\rm{cristobalite}}}({V}_{{\rm{cristobalite}}}^{\exp },{T}_{{\rm{q}}-{\rm{c}}}^{\exp })-{F}_{{\rm{quartz}}}({V}_{{\rm{quartz}}}^{\exp },{T}_{{\rm{q}}-{\rm{c}}}^{\exp }),$$
(1)

where Fcristobalite and Fquartz correspond to the Helmholtz energies for a given exchange-correlation functional; further, \({T}_{{\rm{q}}-{\rm{c}}}^{\exp }\) is the experimental (or CALPHAD) transition temperature, and \({V}_{{\rm{cristobalite}}}^{\exp }\) and \({V}_{{\rm{quartz}}}^{\exp }\) are the experimental equilibrium volumes of the two phases at \({T}_{{\rm{q}}-{\rm{c}}}^{\exp }\). The Tq−c value (x-axis) corresponds to the self-consistently computed transition temperature of the respective functional. The such obtained (Tq−c, \(\Delta\)Fq−c)-pairs are shown by the dark green dots in Fig. 4. The good linear relation is quantified by the small standard deviation of the fit of 0.35 meV/atom, which propagates into an uncertainty of 55 K in the transition temperature prediction. Even the predictions from r2SCAN and PBE, which fall outside the stability regime of the phases, are reasonably captured by extrapolation. This extrapolative capacity is useful for locating functionals with low predictability of Tq−c, e.g., HSE06 (see Fig. 4). However, for the main purpose of locating functionals with high predictability, we rely only on the interpolative behavior of the linear relation. The best prediction is quantified by a vanishing \(\Delta\)Fq−c, which corresponds to the averaged CALPHAD transition temperature of 1137 K.

Fig. 4: Correlation between the predicted transition temperature and Helmholtz energy difference between β-quartz and β-cristobalite at \({T}_{{\rm{q}}-{\rm{c}}}^{\exp }\), and \({V}_{{\rm{cristobalite}}}^{\exp }\) and \({V}_{{\rm{quartz}}}^{\exp }\).
Fig. 4: Correlation between the predicted transition temperature and Helmholtz energy difference between β-quartz and β-cristobalite at 
                        $${T}_{{\rm{q}}-{\rm{c}}}^{\exp }$$
                        
                          
                            
                              T
                            
                            
                              q
                              −
                              c
                            
                            
                              exp
                            
                          
                        
                      , and 
                        $${V}_{{\rm{cristobalite}}}^{\exp }$$
                        
                          
                            
                              V
                            
                            
                              
                                c
                                r
                                i
                                s
                                t
                                o
                                b
                                a
                                l
                                i
                                t
                                e
                              
                            
                            
                              exp
                            
                          
                        
                       and 
                        $${V}_{{\rm{quartz}}}^{\exp }$$
                        
                          
                            
                              V
                            
                            
                              
                                q
                                u
                                a
                                r
                                t
                                z
                              
                            
                            
                              exp
                            
                          
                        
                      .
Full size image

The filled dark green circles show explicitly computed results and the line a linear fit of them. The hollow circles mark extrapolations out of the stability regime of the phases. The light green triangles mark predictions for rung 4 and 5 functionals (labeled in italics) utilizing the fit.

To obtain the transition entropy \(\Delta\)Sq−c, we utilize a finite difference,

$$\Delta {S}_{{\rm{q}}-{\rm{c}}}=(\Delta {F}_{{\rm{q}}-{\rm{c}}}-\Delta {F}_{{\rm{q}}-{\rm{c}}}^{{\prime} })/\Delta T,$$
(2)

where \(\Delta {F}_{{\rm{q}}-{\rm{c}}}^{{\prime} }\) is a second Helmholtz energy difference computed similarly as \(\Delta\)Fq−c in Eq. (1) but at a shifted temperature. The linear temperature dependence of the free energy difference between the two phases (see Suplementary Figs. S4, S5, S7) allows us to utilize a large enough temperature shift \(\Delta\)T (≈300 K) to guarantee a numerically stable evaluation of \(\Delta\)Sq−c.

To compute the Helmholtz energy for each of the phases in Eq. (1) for functionals of rung 4 or 5, we utilize free-energy perturbation in the exchange-correlation space (second observation). Specifically, at volume V and temperature T,

$$F(V,T)={F}^{\rm{ML}}(V,T;{\rm{xc}}^{1-3})+\Delta {F}^{\rm{up}}(V,T;{\rm{xc}}^{1-3}).$$
(3)

Here, FML(V, T; xc1–3) is the Helmholtz energy obtained within direct upsampling with an MLP fitted to an exchange-correlation (xc) functional from rung 1–3 and

$$\Delta {F}^{{\rm{up}}}(V,T;{{\rm{xc}}}^{{{1-3}}})=-{k}_{B}T\,{\rm{ln}}{\left\langle \exp \left(-\frac{\Delta E}{{k}_{B}T}\right)\right\rangle }_{{\rm{ML}}},$$
(4)

where \(\Delta\)E = EEML, with the energies E and EML calculated with a rung 4 or 5 functional and the xc1–3 MLP, respectively. The xc1–3 MLP is also used to sample the thermodynamic average in Eq. (4).

Two classes of functionals

In practice, the convergence behavior of the thermodynamic average with the number of snapshots is decisive. To quantify the convergence behavior, we make use of the second-order terms in the expansion of Eq. (4) (see Supplementary Sec. S3D). Generally, values less than 1 meV/atom allow for efficient use of Eq. (4) (around 100 snapshots for high convergence). Values for various combinations of functionals are given in the matrix in Fig. 5a. Two classes of functionals can be distinguished as highlighted by the two green areas identifying small second-order terms and thus a good overlap between the corresponding functionals in phase space. The first, larger class is composed of LDA, meta-GGA (r2SCAN), and the hybrid functional HSE06 with or without dispersion corrections. The second class consists of the GGA functionals (PBE with or without correction), vdW-DF-cx (classified as a nonlocal van der Waals functional30), and RPA.

Fig. 5: Upsampling performance between functionals.
Fig. 5: Upsampling performance between functionals.
Full size image

a Second-order terms for β-quartz / β-cristobalite, in meV/atom. The fields are colored according to the average value and qualitatively indicate the resemblance between the phase spaces of the reference and target functionals. b Correlation between average Si–O bond length and the energy difference between r2SCAN and RPA for β-quartz (blue circles). The black line shows a linear fit. Second-order terms from free-energy perturbation using full energy differences and residuals are written in the lower left corner. The values on the y-axis are offset with respect to the average energy. c Illustration of the Si–O bond length d and Si–O–Si bond angle θ in silica.

Interclass combinations of functionals show larger second-order terms (yellow to red areas in the matrix). The larger second-order terms can be understood by simple geometrical means. In Fig. 5b, we plot, as an example, the energy differences between r2SCAN (represented by the MLP) and RPA as a function of the average Si–O bond length (Fig. 5c). The correlation is good and, in particular, removing the spread in energies due to the bond-length dependence, in practice by subtracting the linear fit, leads to a second-order term of below 1 meV/atom (from −4.08 to −0.58 meV/atom). Si–O–Si bond angles are similarly well correlated as bond lengths—however, the combined account for bond lengths and angles does not improve the correlation, which shows that these two parameters are strongly correlated. The importance of bond lengths and angles has been realized since the early days of the development of classical potentials for SiO231. Here, we unveil how even fine nuances in the description of these coordinates (changes in the range of 0.01Å) affect the free-energy perturbation.

Transition properties for rung 4 and 5

We utilize the proposed approach to compute Tq−c and \(\Delta\)Sq−c for HSE06, HSE06-D4 (both rung 4), and RPA (rung 5) by upsampling, respectively, from r2SCAN for the first two and from PBE-D3(BJ) for RPA, chosen according to the smallest corresponding second-order term. The results are included in Fig. 4 by the bright green triangles. For the hybrid-GGA HSE06 functional, the extrapolated transition temperature lies far off, close to that of PBE. The D4 corrections strongly shift the transition temperature upward, resulting in an overestimation. The transition entropy is 0.060 kB and 0.084 kB, respectively. Overall, no improvement is seen compared to the rung 1–3 functionals. For RPA, our analysis (Supplementary Sec. S3C) shows that extremely well-converged computational settings are required, in particular for the plane wave cutoff (1000 eV), to obtain convergence. We also find that core polarization is important, shifting Tq−c up by 267 K. The final RPA transition temperature and entropy are 1081 K and 0.088 kB (underestimation of 5% and 25%), thus being closest to experiment among the investigated exchange-correlation treatments.

Discussion

The proposed approach is systematically extendable and—based on the provided data32 and clear-cut procedure, Supplementary Sec. S1 —straightforwardly available to the community. For example, in developing new functionals, Tq−c can be used as the first benchmark quantity. The set of dedicated snapshots (100 for each phase) along with the MLP energies for the various rung 1–3 functionals is available for download32. To evaluate a new functional, the energies for the snapshots are computed (with any independent code) and utilized in the free-energy perturbation formulas. The predictive capability of the functional can also be tested for the transition entropy by utilizing another set of snapshots. Further, we provide a third set of snapshots with which the equilibrium volume of the new functional at Tq−c can be predicted (cf. Fig. 2a and Secs. S1C, S2D). Eventually, at a larger computational cost, the full Helmholtz energy surface can be upsampled in the spirit of the direct-upsampling approach, giving access to various thermodynamic quantities (e.g., heat capacity, bulk modulus).

The observed and utilized linearity between Tq−c and \(\Delta\)Fq−c is closely related to the linear temperature dependence of \(\Delta\)Fq−c (Supplementary Figs. S4, S5, S7). A linear temperature dependence of free-energy differences between phases is well-known for many unary systems33. Within our approach, the validity of the linear assumption increases when the predictions of the evaluated exchange-correlation functional are close to experiment. Should an exchange-correlation functional yield a transition temperature strongly deviated from the experimental value, the assumption of linearity may be invalidated. A similar consideration applies to the fixed-volume assumption, which may be invalidated if the equilibrium volumes of the functional deviate strongly from experiment. In such cases, systematically adding further Helmholtz energy (V, T) points into the upsampling (as just mentioned above) relaxes the need for the linearity assumption.

We emphasize that the precision required for a meaningful comparison of the β-quartz–β-cristobalite transition exceeds the one typically achievable with MLPs (see, e.g., Supplementary Fig. S8). Therefore, even with MLPs trained on higher-rung data, inclusion of the free-energy perturbation term \(\Delta\)Fup in Eq. (3) remains essential to correct for the intrinsic errors of the MLPs.

The investigated SiO2 system entails various challenges (dynamical instability, small transition entropies, soft phonon modes34). The successful application of the introduced approach to SiO2 indicates that other material systems should be likewise treatable, thus providing a useful tool for exchange-correlation functional development and evaluation at finite temperatures.

One interesting future direction could be the combination of the presented approach with other high-accuracy methods such as CCSD(T)35, which achieve chemical accuracy of approximately 1 kcal/mol for a wide range of systems and are therefore regarded as the gold standard in computational chemistry. A practical implementation could be achieved together with the recently developed local CCSD(T) methods for periodic systems36,37 and with MLPs trained on CCSD(T) data38.

Methods

We computed densely sampled Helmholtz energy surfaces for 32 combinations of SiO2 phases and exchange-correlation functionals. Specifically, the following four phases were modeled:

  1. 1.

    β-quartz39

  2. 2.

    β-cristobalite40,41

  3. 3.

    P63/mmc-tridymite42

  4. 4.

    C2221-tridymite43

For each phase, eight different exchange-correlation functionals from rungs 1–3 of Jacob’s ladder, without and with dispersion corrections, were used:

$$\begin{array}{ll}\begin{array}{ll}1.\ {\rm{LDA}}\\ 2.\ {\rm{PBE}}\\ 3.\ {\rm{r}^{{\rm{2}}}SCAN}\end{array}\\\left.\begin{array}{ll}4.\ {\rm{PBE}\hbox{-}D3(BJ)}\\ 5.\ {\rm{vdW\hbox{-}DF\hbox{-}cx}}\\ 6.\ {\rm{r}^{{\rm{2}}}SCAN\hbox{-}D3(BJ)}\\ 7.\ {\rm{r}^{{\rm{2}}}SCAN\hbox{-}D4}\\ 8.\ {\rm{r}^{{\rm{2}}}SCAN\hbox{+}rVV10}\end{array}\quad\right\}\,{\rm{dispersion}}\ {\rm{corrected}}\end{array}$$

(References and abbreviation definitions are given below.) The full direct upsampling procedure was used to calculate Helmholtz energy surfaces for LDA, r2SCAN and PBE-D3(BJ). For all other functionals, we used the free-energy-perturbation step between exchange-correlation functionals. For rungs 1–3, we computed the full Helmholtz energy surfaces, while for the higher-rung HSE06 and RPA we used our developed procedure with a few selected volume-temperature points, as summarized in Fig. 1 and described in the Results [Eqs. (1)–(4) and related text].

Direct upsampling

The direct upsampling was performed in the extended version of ref. 44, due to the dynamical instability at 0 K of the investigated phases. In particular, effective equations of state were used as the basis for the finite-temperature calculations. These equations of state were adjusted to allow for reasonable thermodynamic integration. Further, an effective harmonic reference fitted to high temperatures was used as a reference for thermodynamic integration to a moment tensor potential (MTP)45, specifically trained for each phase. To ensure convergence with respect to supercell size, the thermodynamic integrations were performed with up to around six thousand atoms. All supercell sizes and the parameters of the effective equations of state, harmonic references and MTPs are detailed in Supplementary Sec. S3A.

The DFT calculations were performed with the projector-augmented wave (PAW) method46,47 as implemented in the Vienna ab initio simulation package (VASP)48. PBE-based PAW potentials were used for all but the LDA calculations, for which LDA-based potentials were applied. A valence-electron configuration of Si: 3s23p2; O: 2s22p4 was used. For the RPA calculations, the VASP _GW PBE PAW potentials were used, as well as an additional PAW potential for the Si core-polarization correction, see Supplementary Sec. S3C. The utilized functionals from rungs 1–3 follow:

  • LDA, local density approximation27

  • GGA-PBE, generalized gradient approximation parametrized by Perdew, Burke, Ernzerhof28

  • r2SCAN, regularized-restored strongly constrained and appropriately normed meta-GGA functional29

  • vdW-DF-cx, nonlocal van der Waals density functional with consistent exchange30 GGA-PBE-D3(BJ)49

  • r2SCAN-D3(BJ)49,50

  • r2SCAN-D450

  • r2SCAN+rVV1051

where

  • D3(BJ), van der Waals interaction of Grimme with Becke–Johnson damping49

  • D4, van der Waals interaction of Grimme52

  • rVV10, revised nonlocal van der Waals density functional53

From rungs 4 and 5, we used:

  • HSE06, Heyd-Scuseria-Ernzerhof hybrid functional54

  • HSE06-D452,55

  • RPA, random phase approximation within the adiabatic connection fluctuation-dissipation theorem3,4

Detailed DFT parameters and convergence checks are presented in Supplementary Secs. S3A and S3B.

Random phase approximation

Due to the significant computational requirements of RPA calculations, a special procedure was devised to obtain the RPA results. To compute the RPA upsampling energy \(\Delta\)Fup, around 50 snapshots are needed to provide a good statistical convergence. The size of the supercell for the upsampling is around 200 atoms, specifically 243 atoms for β-quartz and 192 atoms for β-cristobalite. In order for such calculations to be computationally feasible, we compute, in the first step of the procedure, \(\Delta {F}_{{\rm{normal}}}^{{\rm{up}}}\) with a set of normally converged parameters. The requirement on these parameters is to provide converged second-order and higher-order terms of the free-energy perturbation. In the second step, we add a first-order correction calculated with a smaller snapshot (9/24 atoms for β-quartz/β-cristobalite). Using a smaller snapshot allows us to use highly converged computational parameters that ensure the highest precision, also with respect to the relative stability between the phases. This first-order correction is strictly valid when the energy of the smaller snapshot is converged with respect to the snapshot average energy. The exact parameters used, resulting intermediate energies and detailed convergence tests can be found in Supplementary Sec. S3C. Ultimately, the upsampling Helmholtz energy is obtained as

$$\Delta {F}^{{\rm{up}}}=\Delta {F}_{{\rm{normal}}}^{{\rm{up}}}+{E}_{{\rm{high}}}-{E}_{{\rm{normal}}},$$
(5)

where Ehigh is the highly converged energy per atom of the small snapshot, and Enormal is the energy per atom of the small snapshot scaled to 243 or 192 atoms and with identical parameters to the upsampling.