Introduction

In high-entropy alloys (HEAs)1,2,3, multiple metallic elements are combined in nearly equal concentrations. This often leads to the stabilization of crystalline phases in which elements are distributed throughout the alloy in a nearly random fashion—namely, solid solution phases. This class of alloys has attracted substantial interest due to their mechanical properties. For example, extraordinary fracture resistance was observed in CrCoNi4,5 as the result of an unusual synergy of deformation mechanisms involving stacking-fault formation and phase transitions, as well as the conventional gliding of dislocations.

It has been established that chemical short-range order (SRO)—i.e., the tendency of solid solutions to not be completely random—affects various chemistry–microstructure relationships that influence mechanical properties. For example, SRO has been shown to affect dislocation mobility6,7,8, grain boundaries9,10,11, stacking-fault energy12,13,14,15, and phase stability16. Consequently, significant experimental efforts have been made to characterize SRO and its effects on materials properties11,13,14,17,18,19,20,21,22. Connecting computational results to such experiments requires high-fidelity physical models capable of capturing the intricate nature of chemical bonds leading to SRO, while also accounting for the complexity of chemical motifs in HEAs23,24,25.

In a previous work (ref. 23), we have demonstrated that small-scale atomistic simulations, i.e., sizes typical of density-functional theory (DFT) calculations, are inadequate to properly capture SRO, leading to errors of up to 25% in the prediction of Warren–Cowley parameters. In the same work, an approach for training machine learning interatomic potentials (MLPs) was demonstrated to capture SRO while simultaneously leading to an improvement in energy accuracy when compared to the state-of-the-art. Yet, fundamentally, such an approach consisted of a set of heuristics on the construction of the MLP, i.e., a reasonable and practical approach for the construction of training sets without rigorous justification. Here, we build on these results and systematically derive the design principles for the rational construction of MLPs that capture SRO.

While much of the work on MLPs for HEAs has focused on their energy accuracy over test data sets7,26,27,28,29,30,31,32,33,34,35,36,37,38, here we focus instead on their performance in reproducing SRO and its effects on materials quantities of relevance for mechanical properties. We systematically augment an initially simple MLP training set while tracking the associated effects on the SRO of the crystal and liquid phases, stacking-fault energy, and phase stability. It is demonstrated that energy accuracy on test data sets often does not correlate with accuracy in capturing such material properties, which are fundamental in enabling large-scale atomistic simulations of HEAs with high physical fidelity.

Results

Training strategy to manage chemical complexity

In order to develop the design principles for capturing SRO, we focus on the face-centered cubic (fcc) solid-solution phase of the paradigmatic CrCoNi alloy4,5,6,12,13,14,21,39,40,41,42. The MLP model chosen is the Moment Tensor Potential (MTP)26 with a radial cutoff of 5 Å, which corresponds to a distance between the 3rd and 4th coordination shell of CrCoNi. In the absence of any other criteria to guide our initial choice of ML model, we have chosen to employ MTP due to its superior performance in energy and force errors for single-element systems compared to other MLP models (as demonstrated by an independent assessment in ref. 43). An a posteriori analysis described in Supplementary Section 1 shows that MTP also has superior performance for various material properties of the CrCoNi alloy when compared to a few other ML models.

In ref. 23, we demonstrated that the first coordination shell in CrCoNi can be chemically decorated in 36,333 unique configurations (i.e., chemical motifs). The relative energy of these chemical motifs affects the frequency with which they are observed in the alloy—which is the fundamental origin of SRO—making this an important property to be reproduced by MLPs in order to capture SRO. Yet, while the first coordination shell is the dominating term in atomic interactions, many-body contributions from higher coordination shells are not negligible and must be accounted in the development of MLPs26,27,28,44,45,46. Extending the counting of unique chemical motifs to the second and third coordination shells results in ~2.5 × 107 and 6.8 × 1018 unique motifs, respectively. Comparing these numbers with the 651 independent parameters of the MLP model with the highest capacity employed here makes it clear that the chemical complexity of this magnitude leads to a landscape with many nearly degenerate minima for the MLP fitting. Here, this chemical complexity is considered by employing an ensemble training approach23,33,35: multiple potentials are fitted under identical training conditions. Variations in performance among the resulting potentials due to the nearly degenerate minima landscape are explicitly evaluated against materials properties related to SRO, including the role of the model capacity (i.e., number of independent parameters).

In the following sections, we gradually build towards a final training set by systematically augmenting an initially simple training set while evaluating the effect of the modifications on associated material properties. In order to focus on the role of chemical complexity, we employ a standard approach for accounting for thermal vibrations and thermal expansion in all potentials (described in the “Methods” section). Throughout this process, the performance is also compared to a benchmark training set referred to as “TS-0”, which was first introduced in ref. 23 as “training set without chemical sampling”. This training set was built by adapting the popular approach introduced in ref. 36 for bcc NbMoTaW to CrCoNi, which includes perfect and distorted ground state structures, slab structures, and molecular dynamics structures spanning single-element, binary, and ternary element systems. While the approach in ref. 36 was not developed with the intent of capturing SRO; it is one of the most comprehensive and popular approaches for constructing training sets for HEAs7,37, which warranted its choice as a benchmark data set.

Chemical SRO in the crystal phase

Quantification of SRO in the crystalline phase can be performed by evaluating the Warren–Cowley (WC) parameters:

$${\alpha }_{ij}=1-\frac{p(i| j)}{{c}_{i}},$$
(1)

where i and j refer to any of the three chemical elements in the alloy, ci is the average concentration of i-type atoms, and p(ij) is the conditional probability of finding a i-type atom in the first coordination shell of an j-type atom. The effectiveness of MLPs in capturing SRO will be quantified by comparing WC parameters against those obtained through DFT Monte Carlo simulations. In Fig. 1a, we show that a popular interatomic potential6 (embedded-atom model, or EAM) for CrCoNi is not capable of reproducing WC parameters. Similarly, TS-0 also falls short of reproducing DFT results within the statistical accuracy, despite resulting in a considerable improvement in comparison with EAM.

Fig. 1: Capturing chemical SRO in the crystal phase.
figure 1

a Comparison of predicted WC parameters (Eq. (1)) against DFT Monte Carlo (literature values from refs. 12,42). The EAM potential is from ref. 6, while the training sets for potentials TS-0, TS-1, and TS-2 are summarized in Table 1. Error bars are the standard error of the mean from an ensemble of 20 independent potentials. b Illustration of the intermediary configurations extracted from DFT Monte Carlo to train TS-2. c Relative error with respect to DFT (Eq. (2)) as a function of the MLP model capacity. Error bars are the 95% confidence interval from an ensemble of 20 independent potentials.

We turn now to the construction of our initial training set, named TS-1. This training set is composed of chemically random and equiatomic fcc supercells with 108 atoms, adding up to 54,540 atoms (as summarized in Table 1). Despite its simplicity, it is clear from Fig. 1a that TS-1 outperforms TS-0 for all WC parameters, which can only be attributed to the more extensive sampling of the chemical space performed in TS-1 when compared to TS-0. Despite this encouraging result, note once again that the chemical space is enormous compared to the number of independent parameters in the MLP model. Thus, it is reasonable to expect that sampling the chemical space with an approach that is better than random will lead to improved performance. We propose to accomplish this by substituting the chemically random configurations of TS-1 with intermediary configurations from DFT Monte Carlo simulations, as illustrated in Fig. 1b. We name this training set TS-2. The configurations along the Monte Carlo trajectory exhibit an increasing amount of SRO with small energetic differences among them, all associated with changes in local chemical motifs. They function as a guide for the MLP, nudging it to capture the most relevant regions of the chemical space (i.e., regions that show up frequently due to SRO, as shown in refs. 23,47), as well as the trajectory to arrive at SRO from an initially random solid solution. It can be seen in Fig. 1a that TS-2 is able to reproduce all WC parameters predicted by DFT within the statistical accuracy.

Table 1 Summary of the contents of each MLP training set

The performance of each MLP in Fig. 1a can be summarized by evaluating the relative error with respect to DFT:

$${\varepsilon }^{{\rm{SRO}}}=\frac{\mathop{\sum }\nolimits_{i=1}^{3}\mathop{\sum }\nolimits_{j=i}^{3}\left\vert {\alpha }_{ij}^{{\rm{MLP}}}-{\alpha }_{ij}^{{\rm{DFT}}}\right\vert }{\mathop{\sum }\nolimits_{i=1}^{3}\mathop{\sum }\nolimits_{j=i}^{3}\left\vert {\alpha }_{ij}^{{\rm{DFT}}}\right\vert }.$$
(2)

This relative error is shown in Fig. 1c for all three training sets as a function of the MLP model capacity (i.e., number of independent parameters), where it can be seen that the observations above regarding the superior performance of TS-2 relative to TS-0 and TS-1 hold for any model capacity. More importantly, note how TS-2 has significantly more stable behavior against random weight initialization for training, which is shown in Fig. 1c by an ensemble standard deviation of 3% at levmax = 20 compared to 29% for TS-0 and 13% for TS-1. Given the simplicity and similarity of the training sets for TS-1 and TS-2, this smaller variation within the ensemble can only be attributed to the extensive chemical sampling and the targeted sampling of motifs of relevance for SRO.

Liquid stability and SRO

The liquid phase is also afflicted by the considerable chemical complexity observed in the crystal phase. The concept of SRO becomes even more complex in the liquid because chemical and structural SRO are both present. This can be observed in the partial radial distribution functions (pRDFs) obtained from DFT molecular dynamics simulations at 2684 K, shown in Fig. 2a: the Cr–Cr and Cr–Co peaks are notably lower than other peaks, indicating that these chemical pairs are energetically unfavorable (i.e., chemical SRO). An entanglement between chemical and structural SRO leads to the variations of the first coordination shells in Fig. 2a.

Fig. 2: Capturing chemical and structural SRO in the liquid phase.
figure 2

a Partial radial distribution function (pRDF) of the liquid phase at 2684 K from DFT. b Comparison of pRDF using TS-3 (colored lines) against DFT values (black lines). c Relative error with respect to DFT (Eq. (3)) in the prediction of structural and chemical SRO in the liquid as a function of MLP model capacity. d Fraction of potentials in the ensemble with a stable liquid phase. Blue bars are for TS-0 and pink bars are for TS-3. e Distribution of nearest-neighbor distance for each atom in TS-0 and TS-3. Inset shows the breakdown of nearest-neighbor distance in TS-3 by phase. The success of TS-3 in reproducing a stable liquid phase is attributed to the inclusion of crystal configurations with atom pairs at much closer distances than TS-0.

The effects observed in Fig. 2a are addressed by augmenting TS-2 with configurations obtained from DFT molecular dynamics simulations of the liquid phase at 1800 K. The resulting training set—named TS-3 and summarized in Table 1—is able to capture the chemical and structural SRO of the pRDFs, as shown in Fig. 2b: the first coordination shell peak heights are in agreement within ± 4.9%, while the major discrepancy in the first coordination shell peak location being −0.12 Å for Cr–Cr pairs.

The performance in capturing chemical and structural SRO in the liquid can be summarized by evaluating the relative error in the absolute differences between the pRDFs with respect to DFT, similarly to what was done for the WC parameters in Eq. (2):

$${\varepsilon }^{{\rm{pRDF}}}=\frac{\mathop{\sum }\nolimits_{i=1}^{3}\mathop{\sum }\nolimits_{j=i}^{3}\mathop{\int}\nolimits_{0}^{{r}_{\max }}\left\vert {g}_{ij}^{{\rm{MLP}}}(r)-{g}_{ij}^{{\rm{DFT}}}(r)\right\vert {\rm{d}}r}{\mathop{\sum }\nolimits_{i=1}^{3}\mathop{\sum }\nolimits_{j=i}^{3}\mathop{\int}\nolimits_{0}^{{r}_{\max }}\left\vert {g}_{ij}^{{\rm{DFT}}}(r)\right\vert {\rm{d}}r},$$
(3)

where gij(r) is the pRDF between chemical elements i and j, and \({r}_{\max }=3.5\) Å is the extension of the first coordination shell (i.e., total RDF minimum between first and second peaks), which was chosen to evaluate only the short-range part of chemical and structural ordering. Using this approach one can see in Fig. 2c that TS-3 has better performance than TS-0, which is unexpected because TS-0 is trained with configurations extracted from the same simulation employed to collect the pRDF statistics in Fig. 2b (i.e., at 2684 K) while TS-3 configurations were collected at 1800 K. Notice that one is unable to detect this performance difference in reproducing materials properties by evaluating only the energy root-mean-square error: TS-0 and TS-3 result in 5.3 and 5.6 meV/atom, respectively, at 1800 K, and 7.6 and 6.6 meV/atom at 2684 K.

Another important consequence of chemical complexity is the fact that a large fraction of the ensemble of MLPs obtained with TS-0 were unstable in the liquid phase, i.e., simulations of the liquid phase with these potentials quickly encountered configurations with unphysically large values for forces and energies that led the simulation to fail. Notice that this behavior is observed despite the inclusion of liquid configurations in the training set of TS-0 potentials. The fraction of potentials in the ensemble with unstable liquid phase is shown in Fig. 2d as a function of the model capacity. Note how for levmax = 12 and 14, only one out of the five potentials trained is stable. In the same figure, it can be seen that TS-3 never results in such instabilities.

We were able to track down the source of TS-3 success in stabilizing the liquid to the inclusion of crystal configurations with atomic pair distances much closer together than those observed in TS-0, as can be seen in the histogram of Fig. 2e (see Supplementary Section 2 for a detailed breakdown of this analysis). This observation is explained by further considering the role of chemical complexity as follows. Despite its disordered structure, the liquid has well-defined structural SRO48,49 (e.g., the three coordination shells clearly visible in Fig. 2b), leading to a similar degree of chemical complexity as the crystalline phase23. Yet, differently from the crystalline phase, atoms in the liquid are frequently diffusing around, which requires going through activated transition states with higher potential energy due to the close proximity of atoms50,51. Such transition states are considered “rare events” in the course of a simulation, such as the ones used to extract configurations for TS-0 and TS-3, because they occur only every many timesteps for each atom. It is suspected that the inclusion of crystal configuration with close pair distances (Fig. 2e) in TS-3 induces the learning of the energetics of such transition states in between chemical motifs, rendering the liquid phase stable.

Stacking-fault energy and phase stability

Mechanical properties of fcc alloys are closely linked to the (111) stacking-fault energy (γsf) and the relative stability of the fcc phase with respect to the hexagonal close-packed (hcp) phase52,53,54E = EfccEhcp). For example, engineering of phase metastability with ΔE enables transformation-induced plasticity55, while lowering γsf is associated with a change in plasticity mechanism from dislocation slip to twinning. Notably, the exceptional damage tolerance4 of CrCoNi is the result of an unusual synergy between the deformation mechanisms described above.

Figure 3 shows that TS-0 is not capable of reproducing γsf or ΔE. Our strategy for capturing these two quantities with MLPs is centered around the well-established correlation between them: a fcc-to-hcp transition can be accomplished by the successive introduction of stacking faults on every other (111) plane. Thus, it is reasonable to expect that training with configurations of one of these structures (i.e., stacking faults or hcp) is enough to learn the other. We chose to employ hcp configurations instead of stacking faults because they are straightforward to simulate. Two training sets were created to compare the performance of MLPs trained with and without hcp configurations. The first training set (TS-4) includes only fcc configurations and is similar to TS-2 in all aspects except for its total size, which is reduced by 32.7% with respect to TS-2 (as shown in Table 1). This reduction is performed with the goal of accommodating an equivalent amount of hcp configurations in the second training set (TS-5), thereby achieving parity between fcc and hcp configurations while maintaining a moderate training set total size. With TS-4 and TS-5 we seek to establish the importance of hcp configurations in capturing γsf and ΔE; the liquid phase was intentionally left out of both training sets (i.e., TS-4 and TS-5) to avoid interference with this test.

Fig. 3: Capturing SRO effects on (111) stacking-fault energy (γsf) and fcc–hcp phase stability (ΔE = EfccEhcp).
figure 3

Comparison of γsf and ΔE against DFT results for a random solid solutions and b solid solutions in thermal equilibrium (i.e., with appropriate SRO as obtained through Monte Carlo simulations). The blue arrow indicates the improvements in both quantities by augmenting TS-4 with hcp configurations. Error bars are the standard error of the mean from independent Monte Carlo simulations. Figures at the bottom indicate the density distribution of γsf due to chemical fluctuations and SRO.

Figure 3a compares γsf and ΔE against DFT for a random solid solution at 500 K, where it can be seen that the introduction of the hcp phase is fundamental in capturing the correct value of γsf and ΔE. Yet, SRO in CrCoNi leads to an increase in γsf12; thus, we also compare γsf and ΔE for a solid solution in thermal equilibrium at 500 K in Fig. 3b. While the agreement in Fig. 3b might seem obvious in light of Fig. 3a, we warn that such anticipation is not warranted. Figure 3b is a much more stringent test than simply capturing the energetics of stacking faults and crystalline phases, which MLPs are well-known to be capable of capturing56,57,58,59,60. Instead, Fig. 3b demonstrates that TS-5 is capable of capturing the correct SRO and its effects on materials properties (i.e., γsf and ΔE). This is because independent Monte Carlo simulations for thermal equilibration were performed in each case (DFT, TS-4, and TS-5), leading to independent predictions of SRO configurations, which were then employed to evaluate γsf and ΔE.

Final training set

A final training set (TS-f) is constructed (see Table 1) to incorporate all elements leading to good performance on capturing SRO and its effects on the crystal phase, liquid phase, stacking faults, and phase stability. Its performance across the various material properties is evaluated through the following metric:

$${\varepsilon }^{{\rm{MLP}}}={\varepsilon }_{{\rm{fcc}}}^{{\rm{SRO}}}\times {\varepsilon }_{{\rm{hcp}}}^{{\rm{SRO}}}\times {\varepsilon }_{{\rm{liquid}}}^{{\rm{pRDF}}}\times {\varepsilon }^{{\rm{energy}}},$$
(4)

where \({\varepsilon }_{{\rm{fcc}}}^{{\rm{SRO}}}\) and \({\varepsilon }_{{\rm{hcp}}}^{{\rm{SRO}}}\) are the relative WC errors (Eq. (2)) for the fcc and hcp phases, \({\varepsilon }_{{\rm{liquid}}}^{{\rm{pRDF}}}\) is shown in Eq. (3), and εenergy is the energy root-mean-square error over a test set with a wide range of configurations, including random solid phases, thermally equilibrated solid solutions, and liquid phases (see the “Methods” section for a full description). The combined error metric εMLP is defined as the product of individual errors to fairly propagate relative uncertainties across different material properties while avoiding dependence on arbitrary normalization. The εMLP error as a function of the computational cost for different model capacities is shown in Fig. 4 along with the Pareto front. The large variation in εMLP performance within an ensemble with the same model capacity (i.e., number of independent parameters) is noteworthy (note the log-scale in the y-axis). For example, the best performing MLP with levmax = 14 has εMLP comparable to the second best out of the five MLPs in the ensemble with levmax = 20, despite the latter being ≈3.6 times more computationally expensive. Moreover, note how the potential with the lowest εenergy is often not the best performing potential for materials properties: εenergy only predicts the best performance in two out of the eight ensembles in Fig. 4.

Fig. 4: Final potential (TS-f) performance across materials properties.
figure 4

The best performing potential for a fixed computational cost falls on the Pareto front. Blue rectangles mark ensembles of identically trained potentials for different model capacities (i.e., \({{\rm{lev}}}_{\max }\)). Potentials with the lowest root-mean-square energy error (εenergy) within each ensemble are marked in green. The four ensembles on the left have levmax = 6, 8, 10, and 12. A breakdown of the MLP performance across individual error metrics is provided in Supplementary Section 3.

Material properties with size-converged SRO

Equipped with TS-f we now turn to demonstrate the effect of SRO on properties and scales that cannot be achieved in the absence of the MLP introduced here. We start by reproducing the observation, made in ref. 23, that DFT-sized calculations are not converged with respect to size and produce relative errors up to 40% in SRO because the SRO characteristic length scale can be as large as 25–30 Å. This can be seen in Fig. 5a, where the relative error of WC parameters with respect to a large calculation with 10,976 atoms is shown as a function of system size. Systems with as many as 2000 atoms (or, equivalently, linear dimensions of 28 Å) are required for size convergence, while DFT calculations for SRO investigations in the literature use one order of magnitude less atoms12,42. We further demonstrate here that similar size convergence is necessary for predicting material properties strongly influenced by SRO. As illustrated in Fig. 5c and d, DFT-size simulations yield significant errors of 10.7 mJ/m2 in stacking fault energy (γSF) and 2.84 meV/atom in phase stability (ΔE) under thermal equilibrium (i.e., with appropriate SRO as obtained through Monte Carlo simulations).

Fig. 5: Final potential (TS-f) prediction of material properties with size-converged SRO.
figure 5

a Relative error of WC parameters with system size (εsize), measured similarly to Eq. (2) but against the largest size prediction. Simulations with <2000 atoms are not converged with respect to system size. b Temperature dependence of WC parameters (Eq. (1)). c Temperature dependence of stacking-fault energy (γsf) for a random solid solution and a solid solution in thermal equilibrium (i.e., with appropriate SRO as obtained through Monte Carlo simulations). The “DFT-size” data point was evaluated with TS-f, but using a system with only 180 atoms, which is the typical size of DFT calculations (as indicated in Fig. 5a). d Temperature dependence of the fcc-hcp phase stability. See text for a full discussion of the effect of SRO and the phase transition at low temperatures leading to the kink at 600 K for solid solutions in thermal equilibrium.

The temperature dependence of the WC parameters was evaluated with calculations converged with respect to system size. Figure 5b shows that in general the WC parameters decrease in magnitude as the temperature increases, but this behavior is not monotonous for αCoCo, where an increase in magnitude with temperature is observed above 900 K. This is yet another evidence of the incompleteness of WC parameters in quantifying SRO: in ref. 23 we demonstrate that an appropriate and complete SRO metric shows smooth monotonic decrease in SRO with temperature.

The stacking-fault energy (γsf) dependence on temperature is also not trivial, as shown in Fig. 5c. Short-range order increases γsf at all temperatures with respect to a random solid solution, which is aligned with previous results12. This observation can be rationalized by the fact that stacking-fault creation in a thermally equilibrated solid solution requires the disruption of chemical motifs with lower energy than those encountered in a random solid solution. Yet, while the temperature effect on random solid solutions is a simple linear increase in γsf (due to thermal expansion), a solid solution in thermal equilibrium (i.e., with appropriate SRO as obtained through Monte Carlo simulations) presents a complicated interplay between SRO and temperature. At low temperatures, the contribution from SRO dominates and γsf decreases almost linearly with temperature, but as temperature increases above around 800 K, γsf displays a linear increase with temperature almost in parallel with the random solid solution. To our knowledge, this is the first report of the complex temperature dependence of γsf, despite its central role in rationalizing many of the mechanical properties of CrCoNi and other fcc HEAs.

The fcc-hcp relative phase stability (ΔE) is also evaluated with TS-f, as shown in Fig. 5d. In general, the presence of SRO further stabilizes the hcp phase at all temperatures, i.e., increases ΔE with respect to the random solid solution. Yet, the absolute value of ΔE decreases with increasing temperature, in agreement with previous first-principles calculations and thermodynamic models61,62, indicating that the fcc phase becomes the stable phase near the melting temperature16. At low temperatures, ΔE has a kink near 600 K for solid solutions in thermal equilibrium that is not present in random solid solutions, which is consistent with previous findings32,33,63 indicating a potential phase transition to an ordered phase at low temperatures.

Finally, the melting temperature of TS-f was evaluated using phase-coexistence molecular dynamics simulations. The agreement with experimental results is excellent, as shown in Table 2, and a marked improvement over existing interatomic potentials6.

Table 2 Comparison of melting temperature

Discussion

The most important design principle supported by the results presented here is that chemical complexity should be sampled extensively and biased towards chemical motifs of relevance for SRO. As shown in Fig. 1c, this is the most effective strategy to reduce the impact of the nearly-degenerate landscape of minima on MLP fitting, on which all other principles rely to perform well in capturing materials properties. A promising venue to amplify the benefits of this design principle will be its combination with compressed lower-dimensional descriptors of chemical information64,65,66,67,68, which were developed to alleviate the dramatic increase in MLP capacity required to accommodate an increasing number of chemical species. Since this design principle addresses chemical complexity, it is fully compatible with existing methods that optimize structural sampling and thermal fluctuations69,70.

Quantitative experimental characterization of SRO has not yet been achieved14,22,71,72,73, and the feasibility of employing SRO as a design feature for materials properties remains uncertain. In light of these observations, it is justifiable to question the relevance of SRO in the field of HEAs. One argument for its relevance is the potential ubiquity of SRO in various material properties and phenomena, as highlighted in ref. 63 by historical results in metallurgy and a review of the fundamental principles of clustering in simpler alloys. The results presented here introduce another argument for the relevance of SRO: the importance of chemical complexity in the development of high-fidelity atomistic physical models for solid solutions, as exemplified by the following design principles: the liquid phase is only rendered stable (Fig. 2) after careful consideration of its structural and chemical SRO and the inclusion of solid configurations with close atomic pair distances, which ought to also affect the solidification process (Table 2) and as-cast state of these alloys. Similarly, the stacking-fault energy and phase stability (Fig. 3) are only captured within DFT accuracy after accounting for chemical SRO equally in both fcc and hcp phases.

The results obtained here highlight the importance of modeling SRO at the appropriate length scales. The observation that DFT-sized calculations do not converge with respect to SRO (i.e., Fig. 5a) was first made in ref. 23. Yet, here we note that size convergence with respect to SRO is also important for properties such as stacking-fault energy (Fig. 5c) and fcc-hcp relative phase stability (Fig. 5d). Large-scale simulations dramatically reduce the uncertainty in the estimation of γsf—which has its origins in the chemical complexity (Fig. 3)—and reveal in Fig. 5c that γsf is negative at all temperatures. This observation supports the argument proposed in ref. 74 that positive stacking-fault free energy is the likely explanation for finite partial-dislocation separations observed experimentally.

In conclusion, our work introduces a series of design principles for the construction of training data sets that optimize MLP performance in capturing SRO and its effects on important materials properties, such as phase stability and defect energies. The effectiveness of each design principle is confirmed by comparing the performance of MLPs trained with and without the proposed approaches in reproducing associated material properties. A final training set that includes all proposed principles is produced, and its performance across various materials properties is evaluated and summarized in a single metric (Eq. (4)), thereby enabling large-scale atomistic simulations of CrCoNi and other HEAs with high physical fidelity75. These design principles are readily generalizable to other material systems of different chemistries and structures, including ordered compounds76,77,78,79 and systems with defects75. The primary remaining challenge is the high computational cost of dataset generation, which we aim to mitigate in future work.

Methods

Density-functional theory calculations

All DFT calculations were performed with the Perdew–Burke–Ernzerhof80 exchange-correlation functional and projector-augmented wave81 pseudopotentials as implemented in the Vienna ab initio simulation package (VASP82,83,84,85,86) version 6.2.1. The pseudopotentials employed (version potpaw.PBE.54) were such that the valence electrons of Cr, Co, and Ni were 3p63d54s1, 3d84s1, and 3d84s2, respectively. All DFT calculations were spin-polarized (collinear) with an initial magnetic moment configuration of 0.6μB, 2.0μB, and 1.0μB for Cr, Co, and Ni, respectively (μB is the Bohr magneton).

Separate convergence tests were performed for the energy cutoff of the plane-wave basis set, Monkhorst-Pack k-point grid density, and width value for Methfessel–Paxton smearing. The energy cutoff was varied from 180 to 520 eV at intervals of 10 eV, the k-point grid varied from 3 × 3 × 3 to 14 × 14 × 14, and the smearing width was varied from 0.03 to 0.2 eV. Convergence tests were performed iteratively until the per-atom energy was within 3 meV/atom of the best set of parameters employed, except for the smearing width convergence, where the criterion was a per-atom entropic energy contribution below 1 meV/atom. The converged set of parameters was: energy cutoff of 430 eV for all elemental systems, k-point grid of 6 × 6 × 6 for a fcc unit cell, and second-order Methfessel–Paxton smearing with a width value of 0.1 eV. The energy threshold for self-consistency and the force threshold for structure relaxation were 10−5 eV and 0.02 eV/Å, respectively.

The k-point grid for structures other than simple unit cells were scaled proportionally, e.g., k-point grid was 3 × 3 × 3 for a 2 × 2 × 2 supercell. For slab structures the k-point grid was 8 × 2 × 1, where the surface normal is along \(\hat{{\bf{z}}}\). A single Γ k-point was employed for Monte Carlo and molecular dynamics simulations, but the configurations included in the training set of MLPs had their energy and forces recalculated with the optimal k-point density obtained during convergence tests.

Molecular dynamics simulations in the canonical ensemble employed a Langevin thermostat coupled with a Parinello–Rahman barostat with a friction coefficient γ = 10 p s−1 for all atom species and lattice degree-of-freedom. All structure manipulations, visualizations, and analyses of simulations were carried out with the Python interface of Ovito87 and the Python Materials Genomics (Pymatgen)88 library. Automation of calculations was performed with the Fireworks software89.

MLP model training

The MLP model chosen for all potentials discussed here is the Moment Tensor Potential26. The Broyden–Fletcher–Goldfarb-Shanno algorithm90 was employed to optimize the cα parameters, which map the invariant basis polynomials Bα(u) to the DFT energies and forces. The cutoff radius hyper-parameter was rcut = 5 Å and a Chebyshev radial basis of size 8 Å was employed for all models, while \({{\rm{lev}}}_{\max }\) was varied from 6 to 20 to investigate the effects of model capacity. The energy and force data weights were set to 1 and 0.01, respectively. Training was carried out for a maximum of 10,000 iterations with a tolerance of 10−5 for the relative error with respect to the 50th previous iteration. All MLP simulations were performed with the large-scale atomic/molecular massively parallel simulator (LAMMPS91).

Training and test sets

In this section, we describe the data sets summarized in Table 1.

Training set TS-0 was introduced in ref. 23, where it was named “training set without chemical sampling”. The full description of TS-0 can be found in Section 3A of the supplementary material of ref. 23, here we provide a brief summary of its content. The atomic configurations in TS-0 include: single-element ground states and distorted structures, surface slabs with different orientations, and molecular dynamics simulations at different temperatures (including above the melting temperature), random binary systems with various compositions, and special quasi-random structures (SQS92,93) of ternary ab initio molecular dynamics simulations at different temperatures (including above the melting temperature).

Training set TS-1 is composed of chemically random and equiatomic fcc 3 × 3 × 3 cubic supercells with 108 atoms and orthogonal axes, aligned with \(\left\langle 100\right\rangle\) directions. A total of 505 such supercells were employed, adding up to 54,540 atoms in the data set. The effects of thermal expansion and thermal noise were accounted for by employing a strategy developed in our previous work23,94, as described next. Starting from the 0 K lattice constant (3.526 Å), each supercell was isotropically expanded to account for thermal expansion effects by assuming a linear thermal expansion coefficient of 1.56 × 10−5 K−1, which corresponds to 3% total expansion at the average melting temperature \(\left\langle {T}_{{\rm{m}}}\right\rangle =({T}_{{\rm{m}}}^{{\rm{Cr}}}+{T}_{{\rm{m}}}^{{\rm{Co}}}+{T}_{{\rm{m}}}^{{\rm{Ni}}})/3=1917\,{\rm{K}}\). Half of the supercells were isotropically expanded to their corresponding lattice parameters at 300 K, and the other half to \(0.9\left\langle {T}_{{\rm{m}}}\right\rangle\). Thermal noise was accounted for by randomly displacing each atom. The displacements were sampled from a uniform distribution within a sphere as follows. The maximum possible displacement of each atom is αdnn/2, where dnn/2 is half of the nearest neighbor distance and α controls the thermal-noise magnitude for the corresponding supercell. For 75% of the supercells α was evenly sampled in the interval [0.01, 0.30) to replicate low-temperature thermal vibrations, while the remaining 25% supercells had α [0.30, 0.55) to account for thermal vibrations near the melting temperature.

Training set TS-2 creation followed the exact same strategy as TS-1, except for employing chemical configurations extracted from Monte Carlo simulations instead of the chemically random configurations of TS-1. The Monte Carlo95 simulations started from chemically random structures with 0 K lattice parameter (3.526 Å) and ran for a total of 5000 atom-swap attempts at a temperature of 500 K. Five independent simulations were performed and 101 configurations evenly spaced in intervals of 50 steps were extracted from each for the creation of TS-2.

Training set TS-3 augments TS-2 by including configurations obtained from a molecular dynamics simulation of the liquid phase of CrCoNi at 1800 K and zero external pressure. The simulation was set up by randomly adding 72 atoms to a cubic box with an average volume per atom of 14.83 Å3 and a restriction on the minimum interatomic distance of 2.11 Å to avoid the overlapping of atoms. The simulation ran for 80 ps with a timestep of 3 fs; after 3 ps of equilibration a total of 752 snapshots were extracted at intervals of 75 fs. Note that TS-3 was introduced in ref. 23, where it was named “training set with chemical sampling”.

Training set TS-4 was created identically to TS-2, except that the fcc supercells now contain 144 atoms with the 500 K lattice constant (3.545 Å), and have orthogonal axes aligned with the [110], \([1\bar{1}2]\), and \([1\bar{1}\bar{1}]\) directions. Configurations were extracted in intervals of 100 steps for each of the five independent Monte Carlo simulations, adding up to 255 supercells or 36,720 atoms.

Training set TS-5 augments TS-4 with hcp supercells with chemical configurations extracted along five independent Monte Carlo simulations in intervals of 100 steps. The axis rotation performed in TS-4 relative to TS-2 permits us to collect the same number of supercells (255) and atoms (36,720) for the hcp phase as the fcc phase in TS-4. The Monte Carlo simulations started from chemically random structures with a = 2.507 Å (\(c=\sqrt{8/3}a\)) and ran for a total of 5000 atom-swap attempts at a temperature of 500 K. The hcp supercells undergo the same treatment as TS-1 to account for thermal vibrations and thermal expansion.

Training set TS-f was obtaining by augmenting TS-5 with atomic configurations from the liquid phase extracted from the same molecular dynamics simulation performed for TS-3, except that a total of 1020 configurations were extracted such that TS-f is balanced with respect to the number of liquid configurations (73,440) and crystal configurations (73,440, evenly split between the fcc and hcp phases).

The test set employed to compute ϵenergy (Eq. (4) and Fig. 4) was introduced in ref. 23. Its full description can be found in Section 3C of the Supplementary material of ref. 23, here we provide a brief summary of its content. This test set includes configurations and thermodynamic conditions not employed in any of the training sets above. For clarity, we divide the set into three groups. The “random solid solution” configurations included only chemically random CrCoNi supercells with various degrees of thermal noise and thermal expansion. The “thermally equilibrated solid solution” configurations included chemical ordering extracted along DFT Monte Carlo simulations at 750 and 1200 K with thermal noise and thermal expansion. Finally, the “liquid” snapshots were extracted from ab initio molecular dynamics simulations at 2684 K.

MLP Monte Carlo simulations

The MLP and EAM Monte Carlo simulations95 for Fig. 1 were identical to the DFT Monte Carlo simulations: they started from chemically random configurations and ran at 500 K for a total of 5000 steps for a supercell with 108 atoms and a lattice parameter of 3.526 Å. The Warren–Cowley parameters (Eq. (1)) are computed by averaging over the final 100 steps of each simulation. A total of 18 independent Monte Carlo simulations were performed for each of the 20 MLPs in the ensemble of each \({{\rm{lev}}}_{\max }\) to evaluate the standard error of the mean and confidence intervals shown in Fig. 1.

The MLP Monte Carlo simulations for Fig. 3 started from random chemical configurations, ran for a total of 55 steps per atom, and employed the same lattice parameters as the DFT Monte Carlo simulations for TS-4. A total of 108 independent Monte Carlo simulations were performed.

The MLP stacking-fault energy in Fig. 3 is evaluated using supercells with 180 atoms and the same lattice parameter as employed in the DFT evaluation of γsf. Figure 3a employed 108 chemically random configurations, while Fig. 3b employed only the last configuration extracted from 108 independent Monte Carlo simulations. In both cases, only a single intrinsic stacking fault is introduced in each of the 108 supercells.

The MLP Monte Carlo simulations for Fig. 5 started from random chemical configurations, ran for a total of 30 steps per atom, and employed lattice parameters that account for the thermal expansion at the associated simulation temperature. All data in Fig. 5 employed levmax = 20. The size-convergence in Fig. 5a was performed for n × n × n fcc supercells from n = 3 (108 atoms) to n = 14 (10,976 atoms). The temperature dependence of SRO states in Fig. 5b was assessed using 40 independent Monte Carlo simulations with 4000 atoms (orthogonal axis aligned with \(\left\langle 100\right\rangle\) directions) for 300–1700 K in 100 K intervals. The temperature dependence of material properties in Fig. 5c and d was evaluated with 10 independent Monte Carlo simulations with 8316 atoms for each temperature and each phase (i.e., fcc and hcp), with orthogonal axes aligned with the [110], \([1\bar{1}2]\), and \([1\bar{1}\bar{1}]\) directions for the fcc phase.

MLP liquid molecular dynamics

The MLP molecular dynamics simulations for Fig. 2a–c started from the last snapshot of the DFT simulation employed in TS-0 (at 2684 K with 72 atoms). A Nosé–Hoover thermostat and barostat were used to maintain the temperature at 2684 K and zero hydrostatic pressure for 25,000 steps with a step size of 3 fs. The element-wise partial radial distribution functions were evaluated with the configurations from the last 20,000 steps.

The liquid phase stability (Fig. 2d) was evaluated with simulations of 4000 atoms. The simulation box was a cube with edge length of 39 Å inside which the atoms were randomly distributed such that no pair of atoms was closer than 1.7 Å to avoid overlap. After relaxing the structure for 100 steps, the system was maintained at 2684 K for 20,000 steps of size 2.5 fs using a Nosé–Hoover thermostat and barostat. An MLP model is marked as having a stable liquid phase if no critical failure is encountered during this simulation.

Stacking-fault energy and fcc-hcp phase stability

The (111) stacking-fault energy (γsf) was computed as follows. The periodic boundary conditions along the \([1\bar{1}\bar{1}]\) direction are removed from supercells of the fcc phase with orthogonal axes aligned with the [110], \([1\bar{1}2]\), and \([1\bar{1}\bar{1}]\) directions. This structure is then relaxed by minimizing the potential energy at fixed volume while keeping atoms in the top and bottom close-packed planes fixed. An intrinsic stacking fault is then introduced between two \((1\bar{1}\bar{1})\) planes, which is followed by another identical structural relaxation. The γsf is evaluated by computing the energy of the relaxed structures before and after the introduction of the stacking fault and dividing the difference by the supercell cross-sectional area along \([1\bar{1}\bar{1}]\).

The DFT stacking-fault energy in Fig. 3 is evaluated using supercells with 144 atoms from the five independent Monte Carlo simulations performed for TS-4. The stacking-fault energy is evaluated for all six \((1\bar{1}\bar{1})\) planes in the final configuration of each Monte Carlo simulation.

The MLP stacking-fault energy in Fig. 3 is evaluated using supercells with 180 atoms and the same lattice parameter as employed in the DFT evaluation of γsf. Figure 3a employed 108 chemically random configurations, while Fig. 3b employed only the last configuration extracted from 108 independent Monte Carlo simulations. In both cases, only a single intrinsic stacking fault is introduced in each of the 108 supercells.

The temperature dependence of γsf for TS-f with levmax = 20 in Fig. 5c is performed using a size-converged system of 8316 atoms with 18 \((1\bar{1}\bar{1})\) planes. The stacking-fault energy is evaluated for each \((1\bar{1}\bar{1})\) plane in the final configuration of 10 independent Monte Carlo simulations.

The difference in energy per atom between the fcc and hcp phases (ΔE = EfccEhcp) is evaluated by first performing a structural relaxation (i.e., energy minimization) at fixed volume before evaluating the energies. The comparison with DFT in Fig. 3 is performed with a total of 108 supercells of 180 atoms. Meanwhile, the temperature dependence of ΔE for TS-f with levmax = 20 in Fig. 5d is evaluated using a size-converged system of 8316 atoms from 10 independent Monte Carlo simulations.

Melting temperature calculation

The melting temperature in Table 2 was computed using the phase-coexistence method96 as follows. The initial crystal–liquid interface structure had 196,608 atoms, with 25% of atoms in a 16 × 16 × 16 fcc slab with orthogonal axis aligned with the \([1\bar{1}2]\), \([1\bar{1}\bar{1}]\), and [110] directions at the center of the simulation box. The remaining 75% of atoms are randomly placed (i.e., liquid phase) within the empty space above and below the fcc slab, sharing the same xy plane with [110] normal. The initial simulation dimensions are such that both the equilibrium fcc lattice constant and average liquid density at the corresponding temperature are employed, resulting in a box size of 70.8, 100.1, and 337.1 Å. Periodic boundary conditions were applied in all directions.

This crystal–liquid system is equilibrated by first relaxing the liquid atoms' coordinates for 100 steps while keeping the solid atoms fixed. This is followed by 2 ps of equilibration of the crystal slab with a Nosé–Hoover thermostat at 0.95 × the target temperature, and a subsequent 2 ps equilibration of the liquid region at the target temperature. Finally, the liquid region is equilibrated for 2 ps at the target temperature with a Nosé–Hoover barostat to maintain zero pressure along the direction perpendicular to the crystal–liquid interface.

The phase-coexistence simulation is carried out for 300 ps with a timestep of 2 fs. A Nosé–Hoover thermostat and barostat were used to control the temperature and maintain zero pressure along the direction perpendicular to the crystal–liquid interface (the directions parallel to the interface are kept constant). To avoid drifting of the center-of-mass, the total linear momentum is zeroed every 2 ps, and the temperature is calculated by excluding the center-of-mass velocity. The phase-coexistence simulation is performed at various temperatures uniformly distributed along a 100 K window around the estimated melting temperature with 10 K increments. For each temperature, the interface velocity is measured by estimating the rate at which liquid atoms transform into solid atoms according to the polyhedral template matching87,97 approach with a root-mean-square deviation cutoff of 0.15. After an initial 20 ps of equilibration, the total number of atoms in the liquid phase is collected every 2 ps. The melting temperature is estimated as the temperature at which the crystal–liquid interface velocity is zero.