Learning grain boundary segregation behavior through fingerprinting complex atomic environments

Tavenner, Jacob P.; Gupta, Ankit; Thompson, Gregory B.; Kober, Edward M.; Tucker, Garritt J.

doi:10.1038/s43246-024-00616-y

Download PDF

Article
Open access
Published: 10 September 2024

Learning grain boundary segregation behavior through fingerprinting complex atomic environments

Communications Materials volume 5, Article number: 183 (2024) Cite this article

2200 Accesses
1 Citations
Metrics details

Subjects

Abstract

Although continuum-scale segregation is a well-documented behavior in multi-species materials, detailed site-specific behavior remains largely unexplored. This is partially due to the complexity of analyzing materials at the requisite time and length scales for describing segregation with full atomic accuracy. Here, we better evaluate the segregation behavior of disordered grain boundary (GB) atomic environments through leveraging a set of Strain Functional Descriptors (SFDs) to generate an atomic descriptor (i.e., fingerprint). Using this atomic fingerprint, we resolve key relationships between atomic structure and segregation energy. Machine learning (ML) techniques are utilized in concert with this SFD fingerprint to elucidate complex relationships relating segregation potential to changes in specific features of the local Gaussian density captured by the SFDs. Finally, we identify relationships that indicate both individual and joint structure-property correlations. Linking atomic segregation energy to key structural features demonstrates the value of higher-order descriptors for uncovering complex structure-property relationships at an atomic scale.

Learning grain boundary segregation energy spectra in polycrystals

Article Open access 11 December 2020

Atomistic and machine learning studies of solute segregation in metastable grain boundaries

Article Open access 23 April 2022

Feature engineering descriptors, transforms, and machine learning for grain boundaries and variable-sized atom clusters

Article Open access 26 January 2025

Introduction

Alloying is the primary method by which new metals are designed. In most alloys, solute distribution is heterogeneous as individual atoms segregate to different atomic configurations present along grain boundaries (GBs) or other material defects. Although segregation of atomic species often leads to desirable properties, the atomic origins of such segregation behavior are not fully understood. Properties of metal alloys can be significantly changed through alloying and subsequent processing^1,2,3. The modern design of metals relies on alloying as a critical aspect of material design to achieve unique, application-specific properties^4,5,6. Another key aspect of material design is microstructure. Stabilizing material microstructure is vital for retaining alloy properties^7,8,9,10. This is particularly important when considering nanostructured alloys, which are more susceptible to microstructural evolution^11,12,13. These effects stem from both solute strengthening and microstructural stabilization¹³. Highly-processed materials with nanograin structures often have significantly improved mechanical response, but often exhibit microstructural instability^13,14.

Microstructural stabilization stems from complex, interface-driven behaviors of segregation energy (E_seg), which are in turn determined by the character of atomic environments at microstructural defects such as interfaces^15,16,17,18. Through alloying and control of the subsequent segregation behavior, nanocrystalline (NC) systems can be stabilized, effectively unlocking their potential benefits^19,20. To better design advanced alloys and take advantage of unique properties and microstructures obtainable under segregating conditions, improving our understanding of atomic E_seg is critical.

Computational studies of microstructural stability have mainly focused on thermodynamic and solute partitioning effects to guide alloy design^21,22,23,24. Microstructural stability is significantly increased under specific thermodynamic conditions, resulting in solute partitioning between the grain and GB regions^14,24. While not directly calculating segregation behavior, the alloying effects investigated by previous studies result from localized segregation. Improved properties originating from solute segregation have been demonstrated both computationally and experimentally^14,25.

Numerous segregation models have been developed to capture the broad thermodynamics and kinetics of solute segregation in various alloy systems and predict the extent of segregation^{26,27,28,29,30,31,32,33}. While such studies improve our understanding of the global segregation behavior, they lack information regarding the atomic origins of this behavior. Atomistic segregation studies focusing on bicrystalline GB behavior for studying E_seg using molecular force-field based models^{32,33,34,35,36,37,38,39}, which limits the scope of application for resultant findings. Polycrystalline atomic simulations utilizing a combination of Molecular Statics (MS) and Monte-Carlo (MC) techniques can more accurately reproduce experimental segregation behavior in NC alloys, but are computationally expensive and usually limited to small NC samples^25,30,40.

Atomic-scale ML models demonstrate the ability for predicting site E_seg in GB bicrystals^31,41. The accuracy of input E_seg data calculated from interatomic potentials has been indicated to be within expected error compared to a DFT reference⁴². These models indicate that simple scalar atomic descriptors such as excess volume, coordination number, and Voronoi polyhedra parameters are insufficient for achieving appropriate accuracy. Improved accuracy can be achieved through the use of more complex atomic descriptors such as the Steinhardt order parameters⁴³. Further improvements can be made by implementing more advanced local atomic descriptors such as the smooth overlap of atomic positions (SOAP)^44,45. Although the use of the SOAP features improves prediction accuracy, it requires a large number of descriptors to attain such accuracy. Other studies utilize larger active-learning approaches, where the per-atom descriptors are often physically undefined^31,32,33. When implementing a dimensionally-reduced ML model, the atomic accuracy is likewise hampered, eliminating many of the gains realized by utilization of a more advanced descriptor set.

Here, we study the complex relationships between disordered atomic environments within GBs and their segregation energies in the dilute limit. The focus of this study is on the P-doping of Ni (Ni-P), chosen since it has been the subject of previous works regarding segregation behavior^25,41. Furthermore, the agreement between computational and experimental results by Gupta et al.²⁵ indicates that the NiP potential is applicable for a wide range of NiP simulations. To accurately quantify the link between local atomic environments and their properties, a set of 24 rotationally-invariant higher-order atomic descriptors (known as the Strain Functional Descriptors (SFDs)⁴⁶) are combined with modern ML methods. The resultant model allows for rapid calculation of preferential atomic sites for segregation, enabling much larger structures to be appropriately populated with segregates, rather than the current best approach of a computationally expensive random sampling of sites through methods such as MC. Such alloyed structures can be further utilized in studies investigating the fundamental mechanisms of alloy strengthening due to solute segregation, without the significant computational cost of generating the initial alloyed microstructure.

The SFDs improve upon prior higher-order descriptors due to their tuning to key characteristics of local atomic environments, which allows for classification with a minimal number of parameters and eliminates the need for post-processing the descriptors through dimensionality-reduction techniques. This isolation of individual local atomic characteristics in the SFD fingerprint helps elucidate the complex relationship between local atomic structure and segregation potential since each SFD feature isolates a single aspect of the local atomic environment. Model feature analysis is used to indicate these key structural elements, identifying critical aspects of the atomic ’fingerprint’ governing segregation behavior.

Results presented here are demonstrated to be relevant across multiple grain sizes at the nanoscale, indicating their ability to identify the fundamental structural elements driving changes in atomic E_seg. Extending beyond the dilute limit (thus enabling the full-scale implementation of this approach for creating more accurate atomic structures with the desired solute concentration) must await further developments to accurately capture behavioral changes when considering solute-solute interactions, a challenge shared by similar studies which predict E_seg values. The problem of addressing solute behavior beyond the dilute limit has been initially analyzed in regards to GB spectra⁴⁷, but detailed analysis of the evolution of such spectra and energies from an atomic perspective remains yet unrealized.

Results and Discussion

Atomic segregation energies for Ni-P systems comprised of symmetric tilt grain boundaries (STGBs), bulk amorphous metal, and GBs taken from a 6nm average grain size polycrystal were compared to determine the most appropriate training data for developing transferable predictions of E_seg. From ML models developed with each of these three datasets, performance is evaluated for predicting E_seg behavior for NC structures at larger grain sizes, demonstrating the applicability of the models used for accelerated computation. ML model behaviors are then analyzed using SHapley Additive exPlanations (SHAP) values^48,49 to identify critical structure-property relationships used by the models.

Overview of Strain Functional Descriptors (SFDs)

Efficient prediction of segregation behavior necessitates using only a-priori structural information. Relying only on the existing arrangement of atoms in the system provides orders of magnitude faster computation than MS simulations since computationally-expensive energetic relaxation steps and evaluation of acceptance criterion are bypassed. By utilizing the SFD framework to create a detailed fingerprint for each atom, a maximal amount of information regarding the local atomic environment can be encoded with a minimal number of features necessary for identifying key structure-property relationships (Fig. 1).

**Fig. 1: Schematic of atomic Structure-Property Relationships enabled by SFDs.**

To calculate the set of SFDs for each atomic environment, the Gaussian-overlap approximation method⁴⁵ was utilized to describe the local environment of each atom as an overlap of 3-D Gaussian fields with each of its neighboring atoms (Fig. 1b). From this atomic overlap, a set of spatial tensors are created to describe the density field which makes up the local environment. These tensors are subsequently decomposed into rotational and rotationally-invariant polynomial components. The orthogonality of each underlying polynomial function is verified for the relevant spherical-harmonic decomposition, such that each tensor environment can be described in the fewest possible number of polynomial functions (see section III C for further details)⁴⁶. When applied to a specific local Gaussian environment from a single atom, each polynomial function results in a single value which indicates the degree of specific spatial character for that environment.

The number of SFDs is naturally constrained by the order ℓ of the spherical harnomics used in each order of the expansion, where this can be achieved with 2ℓ + 1 terms. Identifying these terms to create an appropriate minimally-spanning set is a mathematical challenge, but this ensures that the structural features each SFD element captures are unique.

ML models using the SFDs of each atomic environment as input features are developed and subsequently analyzed using advanced analysis techniques for explaining model behavior. This enables complex structural relationships underlying atomic segregation behavior to be readily interpreted. Model interpretability combined with isolation of unique atomic features by the SFD fingerprint enables the relationships between specific atomic characteristics and segregation behavior. This improvement in the interpretability of predictive atomic behavior models is critical for furthering our understanding of underlying atomic behaviors beyond accurate predictions.

Training data analysis

When developing ML models, choice of training data is crucial for determining the proper underlying relationships between the input (SFD fingerprint), and output (E_seg). By investigating a variety of datasets independently, the most efficient and accurate method for developing a ML model for a new segregation system can be determined. The differences in accuracy between the models can then be evaluated relative to the expense of generating appropriate training data. Here, optimized gradient boosted machine (GBM) models were developed from E_seg data calculated on three different training sets: (1) a set of 36 STGBs (STGB model (Fig. 2a)), (2) an amorphous cube (Amorphous model (Fig. 2b)), and (3) GB atoms from a polycrystal with 6nm average grain size (6nm model (Fig. 2c)). These datasets were chosen to span a range of atomic structural order when considering non-crystalline environments present within FCC metallic systems.

Comparing these three models, the STGB model has the highest Coefficient of determination (R²), initially indicating that the model provides the best fit of the atomic sites within the training data to their E_seg. In other words, the SFD fingerprint provides enough distinction between atomic environments to uniquely identify each atomic site present in the dataset, given the high R² of the STGB model. Due to the very specific GB relationship of STGBs, the number of unique atomic sites is limited and does not form a continuous set in the SFD feature space - an aspect that will be discussed in more detail in sections II C and II F.

The Amorphous model dataset has significant outliers, indicating that some of the atomic environments within the amorphous system do not follow the predicted pattern regarding their E_seg as a function of the SFD fingerprint. This leads to a lower R² value for this model, though the root mean squared error (RMSE) still compares favorably to previously published results^41,44. Outliers in the model prediction are likely due to highly-disordered atomic environments where the structural (and thus energetic) changes initiated by single-atom segregation swaps extend significantly beyond the local atomic environment.

Finally, the 6nm model appears to have accurately reproduced the E_seg of each site with minimal variation, as no significant outliers are present in the dataset. This leads to both high R² and low RMSE values. The 6nm polycrystalline dataset has both a continuous distribution of segregation energies and eliminates significant outliers that are present in the bulk amorphous region.

For comparison, SOAP parameters were calculated for the 6nm polycrystalline dataset using the same parameters as Wagih and Schuh^44,45,50,51. This dataset has 1015 input features, compared to only 24 for the SFDs. This increase in input dimensionality greatly increases the computational cost of training, as noted by Wagih and Schuh⁴⁴ in the development of their dimensionally-reduced model. Despite the larger number of input parameters, this model achieved an RMSE of 0.0567eV, compared to 0.0273eV for the SFD-based model (Supplementary Fig. 1). This highlights the advantage of the SFDs being a minimally-spanning and complete descriptor set⁵², which likewise translates into greater model interpretability due to the reduced input dimensionality.

An advantage of GBM-type models is their ability for interpretability through feature analysis of the final model. Recent methods such as the SHAP approach focus on aggregating the effects of individual data points to explain tree-based model behavior^48,49. Such analysis methods provide a more detailed picture of the decisions and relationships being interpreted by the model. Each input data point is evaluated independently to determine the degree to which each feature in the input vector (atomic fingerprint) increases or decreases the final output value. This allows for further analysis and comparison between models since the effect of input features on the final model performance can be evaluated in more detail than simply ranking their final effect on total model behavior.

Across the ML models, P0_I0, P2_I2, and P4_I8 are generally among the most important features used for predicting E_seg. These features are all related as the purely radial descriptors of the SFD set, while the majority of other features describe non-radial or rotational aspects of the geometry. Due to this radial nature, the polynomial expansion of these features in Cartesian space is fairly straightforward (see Section II G for the functional forms). P0_I0 is 0^th order radial node, which here is the total spherical Gaussian weight of the neighboring atoms. P2_I2 is derived from the 2^nd order radial node, which here is a spherical Gaussian weighted by radial distance to the 2^nd power (r²). P4_I8 is derived from the 4^th order radial node, which here is a spherical Gaussian weighted by radial distance to the 4^th power (r⁴). They capture different aspects of the radial distribution of atoms in each neighborhood.

In this analysis, the atomic fingerprint input features are most similar between the 6nm and Amorphous models, with less similarity to the STGB model (Fig. 3). For the STGB model, there is a disparate distribution of SHAP values for each feature. The majority of data points have a singular SHAP value, with the remaining SHAP values distributed evenly over the total range. This indicates that the output E_seg is most often adjusted by a constant, rather than properly correlated with the relevant feature input. From this interaction, the STGB model appears to be fitting directly to the E_seg of specific sites, rather than extracting the fundamental relationships between site character and E_seg. This site-specific fitting is likewise supported by the significant number of data points that have high SHAP values for features outside of the top six features.

**Fig. 3: ML Model Feature Importances.**

In the Amorphous and 6nm models, SHAP values for a given feature are more continuously distributed, indicating a smoother transition between the identified behaviors of segregating and anti-segregating sites. Additionally, the majority of predictive variance is extracted from fewer features in both the Amorphous and 6nm models. With the Amorphous model, however, there are still several outliers with high magnitude SHAP values where the model has tried to adjust to the outlying atomic environments which do not necessarily represent the bulk trend in behavior.

Critically, with the continuous distribution and low variance of SHAP values beyond the first four features coupled with the high degree of accuracy determined by the training data comparison, the 6nm model appears to be accurately uncovering fundamental relationships between the atomic SFD fingerprint and resultant E_seg. This is vital for using ML relationships to identify physical structure-property relationships.

Since the SHAP values indicate the direction of change caused by each feature and not just the magnitude, the relationship of individual features to the resultant E_seg is directly determined. For the 6nm model, P4_I8 and P2_I2 both have a positive correlation with E_seg, while the P3_I0 has a primarily negative correlation. The primary features selected by the Amorphous and 6nm models of the P4_I8 and P2_I2 features are SFD measures of the radial density of the atomic environments (in 4^th and 2^nd respectively), while the P3_I0 measures the net 3^rd order distortion (strain gradient). This determination of the radial behavior and further details on the nature of various SFD functions is discussed in more detail in the interpretability section. Key findings from leveraging the SHAP analysis include:

6nm Model most accurately predicts E_seg
P4_I8 and P2_I2 positively correlate with E_seg
P0_I0 negatively correlates with E_seg

Application to Larger Structures

The creation and analysis of a ML model in isolation is only of limited value since without further application the model is of little practical use. The primary use of ML models predicting E_seg is to enable calculation of larger atomic systems which are otherwise computationally expensive with traditional MS techniques. To this end, each of the models developed was tested on a set of new polycrystalline samples of varying grain size from 6 to 16nm with a new random distribution of texture so as not to bias towards a possible texture-dependent E_seg relationship that would potentially skew results from the 6nm model⁵³. These grain sizes represent the majority of GB character studied by molecular dynamics techniques where such models are the most applicable. For evaluation of model performance, E_seg of all relevant GB sites was calculated with MS for these larger-grain structures. In both the MS simulation and ML predicted configurations, all polycrystals maintained a consistent distribution of segregation energies over the GB sites (Supplementary Fig. 2).

Evaluating the ML models on a common polycrystalline dataset allows for evaluation of model performance if they were to be used to evaluate E_seg of new structures. Comparing the performance of each model on the 10 nm polycrystal data, the 6 nm model is the most accurate, followed by the Amorphous and STGB models. Although R² correlation values only improve moderately between the Amorphous and 6 nm models, the RMSE is nearly halved indicating a significantly improved model performance (Fig. 4). This provides confidence that the 6 nm model is most appropriate for predicting the E_seg behaviors of unknown polycrystalline systems (and thus captures the underlying structure-property relationships most appropriately), which is consistent with the previous analysis of each model.

**Fig. 4: ML Model Prediction Accuracy.**

Investigating the changes in predictive accuracy over the range of grain sizes demonstrates the limits of the three ML models. Across all grain sizes tested, the 6nm model consistently resulted in the highest R² accuracy (Fig. 5). For each model, accuracy did not decrease with increasing grain size but stayed consistent throughout the range tested. This consistency in predictive power across grain sizes for each model indicates that the fundamental SFD atomic features relating to segregation energy are not changing with grain size.

It does not appear that any suspected changes in GB character with increasing grain size significantly influence the E_seg distributions of the GBs in these polycrystalline systems (Fig. 5). Thus, the atomic characteristics which govern E_seg are consistent throughout the grain sizes investigated in this study. Despite this consistency at the per-site level, significant variability in E_seg between discrete boundaries in the polycrystal or localized variation within a single GB can still be expected and is known to occur both computationally and experimentally²⁵. How grain size effects may be related to these GB-level differences has not been investigated in detail. Likewise, such variability in GB-level E_seg behavior likely plays an important role in determining segregation behavior beyond the dilute limit. Fully addressing this level of detail with regards to independent atomic motion driving continuous segregation from the bulk to the GB region awaits advances in computation and/or simulation procedure which will allow for more rapid determination of a complete segregation spectrum including dilute and non-dilute effects.

Computational efficiency is greatly increased utilizing the SFDs and ML approach, with per-atom computation times improved by over 150,000× (compared to the traditional MS approach) for systems with grain sizes over 10 nm (Supplementary Table 1). This drastic decrease of computational cost is critical, as these grain sizes encompass polycrystalline behavior where the largest variation in GB character would be expected to occur, given the large changes in GB curvature and energy required for even moderate change in overall grain size. By first training on a smaller dataset of atomic environments which encompass the appropriate variation typically found in a polycrystalline sample, the ML model allows for an increase in computational efficiency by avoiding both MC random sampling and MS energetic minimization steps, vastly improving scaling efficiency to larger systems.

Visualization

Although the Gaussian neighborhood utilized by the SFDs has been well-defined previously^45,54,55, the connection to individual SFD functions can be difficult to visualize. To address this challenge and illustrate the differences in segregating and anti-segregating atomic sites, surfaces representing the local atomic Gaussian density surrounding the central atom are illustrated for different sites from the 6 nm Ni polycrystal (Fig. 6). These surface representations were generated by first using a sphere around the central atom with a size scaled to the total local atomic density (P0_I0). The sphere is subsequently distorted by each Gaussian overlap vector generated to neighboring atoms, scaled by the corresponding Gaussian weighting and using a spherical Gaussian decay relative to the central atom (Gaussian Kernel Surface method, Supplementary Section 1). This visualization provides a qualitative method to compare distinct atomic environments and visualize SFD numerical features in complex local arrangements such as those found in a GB.

Local atomic features

To visualize the relationships between the key identified SFD elements and E_seg behavior, minimum E_seg (segregating), equilibrium FCC, and maximum E_seg (anti-segregating) environments are compared using this Gaussian surface representation (Fig. 6). Since these shapes represent local Gaussian density, it is important to remember that a larger structure visually represents a more compacted atomic core spatially.

Qualitatively comparing the equilibrium FCC structure to the minimum and maximum E_seg environments, the maximum E_seg structure appears closest to FCC (Fig. 6). However, relative neighbor contributions are significantly decreased, as noted by the lack of clear ’peaks’ on the surface. In the highly segregating minimum E_seg case, there several neighbors closer to the central atom as indicated by the large peak intensity, with only a marginal increase in total local atomic density (P0_I0). This indicates that the first nearest neighboring atoms are quite close to the central atom (Fig. 6b). For the FCC environment, natural slip-planes created by the triangular faces between neighbors on parallel 111 planes are readily identified on the top and front face of the image (Fig. 6b). Likewise, the 12 nearest neighbors are readily apparent in the surface representation due to the Gaussian distance scaling and their primary contribution to the local neighborhood.

Quantitatively, P0_I0 decreases from minimum to equilibrium to maximum, following what we would intuit for the base ’size’ of the environment (0^th order ’radius’, r). Compared to equilibrium conditions, both minimum and maximum atomic environments increase P2_I2 magnitude, indicating that they both have increased radial distortion compared to FCC where all of the nearest neighbors are the same distance (2^nd order r² measure). Identified as the most important feature for determining E_seg (Fig. 3), P4_I8 is decreased in the minimum environment and increased in the maximum environment, and indicates the sharpness of variation in local Gaussian density (4^th order r⁴ kurtosis measure). In the maximum energy case, variation between high and low density is relatively small compared to that of the minimum energy case, with the equilibrium lying somewhere in the middle (Fig. 6).

In terms of the local atomic environment, changes in various functions of the radial density can be summarized to indicate that E_seg is minimized in a compact environment, and maximized for a more expanded environment. By isolating various components of this variation in local density through the SFD atomic fingerprint, feature analysis indicates that localization (P4_I8) and radial distortion (P2_I2) of the local Gaussian environment outweigh the environment density (P0_I0) when determining E_seg (Fig. 6).

Fingerprint Feature Relationships

Since P4_I8 and P2_I2 have been identified as the most important local features for determining E_seg, direct comparisons between datasets can be more easily made by isolating these specific elements from the high-dimensionality atomic fingerprint composed of the entire SFD descriptor set. Conveniently, identification of precisely two atomic features from the 6nm model as carrying the majority of variation in E_seg allows for simple 2-D plots for comparison between datasets (Fig. 7).

**Fig. 7: Distribution of Atomic Environments Across SFD Features.**

Comparing the three datasets used for training data (6nm Polycrystal, Amorphous, and STGB), key differences in their distributions of atomic environments support the differences already identified in the ML model analysis. The STGB dataset has a larger range in both relevant axes, but the occupation of atomic sites is not continuous over the two-dimensional space (Fig. 7). Instead, there are isolated ’islands’ of specific atomic character with few atomic environments spanning the space between these highly occupied areas. Furthermore, the highest density location in the STGB dataset is close to the equilibrium FCC conditions, indicating that a large portion of atoms in that dataset are similar to an FCC occupancy. This is partially due to more FCC-like atoms near the GB being selected as part of the STGB region and illustrates the challenges with a binary selection of GB regions such as through Common Neighbor Analysis⁵⁶. These characteristics of the STGB dataset define the ML results discussed earlier, where the STGB model can accurately predict E_seg values within the testing data but is significantly less accurate when determining the energies of atomic sites within polycrystals (Figs. 4a, 5).

In comparison to the STGB dataset, the Amorphous and 6 nm Phase-Field datasets are more continuous in their distribution across the P2_I2-P4_I8 space (Fig. 7). Both of these distributions have a single maximum-density area from which the density of atomic environments decreases towards the maximum range of the distribution. This continuous nature of the distributions enables the most relevant atomic fingerprint features to be more easily discovered by the model. Instead of fitting primarily to the E_seg of select environments which are largely distinct from each other as in the STGB dataset, the Amorphous and 6 nm models can identify the continuous distribution of potential atomic sites as given by each of these datasets. Thus, the E_seg function identified by the Amorphous and 6nm models has a more continuous distribution based on the underlying distribution of atomic fingerprints, which more directly relates to the underlying physical phenomenon that determine the atomic E_seg landscape.

Comparing the Amorphous and 6 nm datasets, while both distributions have similar shapes, the Amorphous distribution has a smaller range (Fig. 7). This is a primary factor contributing to the decreased accuracy of the Amorphous model when predicting E_seg for polycrystalline GB environments. Although this decreased extent of the Amorphous distribution may be initially perplexing since the amorphous bulk is generally considered to be more ’disordered’ than GB environments, we note that the comparison is focused on P2_I2 and P4_I8. Since P2_I2 and P4_I8 are radial density functions, they do not fully capture all characteristics of atomic disorder which may be more significant in an amorphous structure than that of a polycrystal. In addition, the maximum density region of the Amorphous distribution is shifted slightly towards an FCC character compared to that of the 6 nm dataset. This indicates that the structural relaxation allowed in the amorphous bulk causes the local atomic environments to minimize their energy by making their local radial density of neighbor atoms as similar to FCC as possible. In the 6nm polycrystal, however, local structural re-arrangements are more restricted, leading to a slight increase in radial distortion. Although such re-arrangements of an atomic environment may be less energetically favorable individually, they enable nearby atoms to occupy a more favorable energetic state, causing the energy of the entire system to be decreased.

Radial feature analysis

In the feature analysis of the ML models, the P0_I0, P2_I2, and P4_I8 features stood out with significantly higher SHAP values, indicating their higher importance in determining E_seg behavior (Fig. 3). These features are determined as radial density components of local atomic environments in 0^th, 2^nd, and 4^th orders, respectively. By establishing their importance in dictating the segregation behavior, the feature impact analysis reveals a well-established feature of metallic systems that stems from their delocalized bonding character. This is the reason why the embedded atom method (EAM) potential forms of these systems are still widely accepted today, despite the lack of angular dependence in the EAM formulation. If a system best described by a potential form that included more significant angular behavior (and thus a system which intrinsically has important atomic radial components) was analyzed, other SFD elements of the atomic fingerprint would be expected to have higher importance when predicting trends in that system.

The complete functional form of P0_I0 is indicated by equation (1), as a simple Gaussian radial density function.

$${{{\rm{P}}}}0\_{{{\rm{I}}}}0(i)={(\sqrt{2\pi }\sigma )}^{-3} {\sum}_{j\ne i}\exp \left(-\frac{{r}_{ij}^{2}}{2{\sigma }^{2}}\right)$$

(1)

In all equations (1) through (4), r_ij is the distance between atoms i and j, and σ is the Gaussian width, which is determined such that the corresponding 3-D Gaussian volume is equal to the volume of a primitive unit cell of the reference crystal structure. P2_I2 and P4_I8 are outlined in equations (2) and (3) respectively, where constants A, B, and C are constructed to ensure orthogonality between the terms by removing the contributions of lower-order radial nodes in each term.

$${{{\rm{P}}}}2\_{{{\rm{I}}}}2(i)={(\sqrt{2\pi }\sigma )}^{-3} {\sum}_{j\ne i}\sqrt{\frac{2}{3}}\left(\frac{{r}_{ij}^{2}}{2{\sigma }^{2}}-\frac{3}{2}\right)\exp \left(-\frac{{r}_{ij}^{2}}{2{\sigma }^{2}}\right)/{{{\rm{P}}}}0\_{{{\rm{I}}}}0(i)$$

(2)

$${{{\rm{P}}}}4\_{{{\rm{I}}}}8(i)={(\sqrt{2\pi }\sigma )}^{-3} {\sum}_{j\ne i}\sqrt{\frac{2}{15}}\left(\frac{{r}_{ij}^{4}}{2{\sigma }^{4}}-5\frac{{r}_{ij}^{2}}{2{\sigma }^{2}}+\frac{15}{4}\right)\exp \left(-\frac{{r}_{ij}^{2}}{2{\sigma }^{2}}\right)/{{{\rm{P}}}}0\_{{{\rm{I}}}}0(i)$$

(3)

The polynomial forms of P2_I2 and P4_I8 are defined to provide orthogonality between the original r² and r⁴ forms and P0_I0. In general terms, such functions would define the mean, variance, and kurtosis of a distribution. However, given the limited number of points (neighbor atoms) in these functions as applied to atomic environments, the quantities can’t be readily interpreted in such a manner. However, they do capture distinct aspects of the Gaussian-weighted radial distribution.

Given these terms, it is not surprising that radial density is primarily identified to be an important factor in predicting E_seg. This can at least partially be explained by looking at the functional form of the EAM and EAM-FS potentials utilized to calculate the E_seg data on which the models were trained (4).

$${E}_{i}={F}_{\alpha }\left({\sum}_{j\ne i}{\rho }_{\alpha \beta }({r}_{ij})\right)+\frac{1}{2} {\sum}_{j\ne i}{\phi }_{\alpha \beta }({r}_{ij})$$

(4)

The functional form of both the EAM and EAM-FS models is purely radial, as indicated by equation (4), originally proposed by Daw and Baskes⁵⁷, with modification by Finnis-Sinclair⁵⁸. In summary, the most important SFD features selected by the ML models have purely radial dependence. This is consistent with the underlying computation of E_seg based on an EAM potential, which also considers purely radial terms. Since EAM potentials have been widely shown to accurately reproduce atomic energies in a large number of metallic systems, we can be confident that this relationship is representative of the underlying physics. If a system in which non-radial features dominated the energetic landscape was investigated, non-radial SFD features would be expected to have a primary contribution. This ability of the SFD and ML analysis to identify the critical elements of atomic structure is a significant advantage of this approach over previous methods.

Non-radial feature analysis

While the primary features of the 6 nm model are largely radial, one non-radial feature (P3_I0, which measures the net third order distortion, i.e., strain gradient) has a small but non-negligible impact on the model prediction. To investigate the nature of this relationship in the 6 nm model further, a joint effect of two input features can be compared to the SHAP value across the input dataset (Fig. 8). Investigating this joint effect with the feature with the highest overall SHAP values (P4_I8), we observe that the relative impact of P3_I0 increases as the value of P4_I8 increases. This effect serves to increase the E_seg when P3_I0 is low, and decrease the E_seg when P3_I0 is high (Fig. 8). Therefore, the degree of tetrahedral distortion serves to tune the model performance primarily at lower atomic densities (Fig. 6), decreasing the E_seg when there is large trigonal distortion and increasing the E_seg when distortion is small.

This E_seg dependence on non-radial components for an EAM-based potential calculation demonstrates the power of using advanced structural descriptors for mapping complex atomic properties. Although the EAM function does not directly model the effects of non-radial behaviors for a single atom, the combined effect of multi-body system behaviors over a complex energy landscape leads to effects from non-radial features of the local atomic environment. Such features are readily captured by higher-order descriptor sets, among which the SFDs are uniquely capable of deconstructing the contributions from distinct aspects of complex atomic arrangements. For non-radial contributions to E_seg, the key elements are:

P3_I0 measures net 3rd-order distortion
High P3_I0 impact when P4_I8 is large
Low P3_I0 impact when P4_I8 is small
P3_I0 negatively correlates with E_seg

It should be noted that the total SFD atomic fingerprint utilized in this work has a majority of non-radial angular terms which are utilized to uncover these more complex atomic trends. Such non-radial components are expected to be of increased importance when determining atomic behaviors in other systems where angular dependence plays a larger role in atomic behavior, such as systems with a higher degree of non-metallic or directional bonding (such as systems best described by other (angular dependent) potential formulations such as modified embedded atom method (MEAM) or Tersoff).

Implementation of the SFD atomic fingerprint coupled with the ML models investigated here illustrates this approach’s capability for uncovering complex atomic interactions. By isolating purely radial terms from terms with angular dependence, the SFD fingerprint allows a ML model to appropriately identify metallic systems as having a majority of radial behavior, with only small contributions from angular configurations. This isolation of local atomic features is particularly useful for model interpretability, enabling key relationships to be uncovered.

Methods

Simulations

Polycrystalline microstructures were initialized through a phase-field to atomic structure generation method developed by Gruber et al⁵⁹. This method results in more physically-representative microstructures which have improved grain, interface, and triple junction structure, resulting in more accurate deformation behavior. In this method, the microstructural information taken from the phase-field model is then given a random texture and grain regions populated accordingly. Polycrystals used in this study were initialized with around 40 grains, resulting in around 250 unique GBs, to sample a larger degree of GB texture. Once the initial system is populated with atoms, the polycrystalline structures are thermally equilibrated to ensure that they are in an appropriate metastable equilibrium. This process is done by a recently developed method utilizing a two-step equilibration methodology from Gupta et al.⁶⁰, which identifies and isolates the GB and near-GB regions from the inter-grain atoms. This methodology was shown to expedite the equilibration process without eliminating the selected grain topology while resulting in GB structures and associated properties comparable to structures equilibrated through traditional techniques, but with a significant reduction in required simulation time^53,59.

Atomic site E_seg in each polycrystalline system was calculated by setting a single solute atom inside an FCC grain and subsequently swapping that solute atom to each GB site, taking the difference in energy of the structures after minimization as the E_seg(5), (Fig. 9). All MS calculations were performed using LAMMPS^61,62. GB regions were identified by selecting non-FCC regions as identified by a common neighbor analysis algorithm⁶⁰. An EAM potential from Sheng was used to represent the Ni-P behavior⁶³. For the 12 nm, 14 nm, and 16 nm average grain size systems, a randomized set of 100,000 GB atoms were selected to minimize the computational time required for testing all the GB atoms present in these systems.

**Fig. 9: Schematic of E_seg Calculation.**

In the Ni-P system, site energy is well-correlated with segregation energy (Fig. 9). However, this correlation alone is a similar strength to the STGB model and alone is not a sufficient descriptor. This further highlights the significant limitations when training only on STGB data for fully describing polycrystalline GB environments. To verify that this procedure applies to a range of segregation systems and is not limited to the Ni-P system and demonstrate the capability of the SFDs for prediction when site energy has a worse correlation with the segregation energy, six previously studied Al-alloy systems were studied with 6 nm polycrystals generated by the same method. ML models developed for these systems showed similar accuracy to the results presented here, even for systems where the site energy has a significantly worse correlation with segregation energy than Ni-P (Supplementary Figs. 3, and 4).

The amorphous structure utilized a 12 × 12 × 12 nm simulation cube with 1,000,000 atoms which was homogenized at 1.8T_m before applying conjugate-gradient energy minimization to generate a single-species amorphous metallic region. The set of 36 STGBs used were generated from 〈100〉 and 〈110〉 tilt-axis symmetric boundaries⁶⁴.

E_seg in the STGB and Amorphous systems was calculated by directly changing the atom type at each reference site. This is necessary since no appropriate equilibrium FCC segregation sites are present in these simulations. Since this procedure changes the atomic makeup of the system, the difference in equilibrium energy between the solute and solvent must be taken into account by calculating reference energy values for single Ni and P atoms in perfect Ni FCC lattices. This is illustrated in Equation (5), where ΔE is the total change in system energy, ΔN^P is the change in the number of P atoms, ΔN^Ni is the change in the number of Ni atoms, and ${E}_{Ni}^{P}$ and ${E}_{Ni}^{Ni}$ are the equilibrium energies of a P and a Ni atom in an equilibrium FCC lattice, respectively.

$${E}_{seg}^{i}={{\Delta }}E+{{\Delta }}{N}^{P}{E}_{Ni}^{P}+{{\Delta }}{N}^{Ni}{E}_{Ni}^{Ni}$$

(5)

Similar formulations for calculating E_seg of individual atomic sites within a bulk system have been described by many previous authors studying segregation behavior^16,25,27,35.

Machine learning

ML models were trained using the XGBoost GBM architecture using a 5-fold cross-validation scheme⁶⁵. All model implementation and data analysis were completed with the use of python packages numpy, pandas, matplotlib, and scipy^66,67,68,69. Hyperparameters for the number of trees and maximum search depth were investigated, with default parameters from the XGBoost library found to provide accurate results (Supplementary Figs. 5, 6). Tested models were seen to be relatively robust at avoiding overfitting. When comparing the STGB, Amorphous, and 6 nm models, a linear offset term was utilized to find the best-fit line comparing the true and predicted values. This was done to account for the difference in segregation potential intrinsic to each of these calculation methods. The values were not scaled, however, keeping a slope of unity for all fit comparisons to ensure that the predicted and true distributions were not modified in shape, but only shifted in value where the shift represents the change in chemical potential between calculation methods.

SOAP parameters were calculated using QUIP and the quippy python interface^50,51. A cutoff of 6Åwas used, along with 12 radial basis functions (n_max), a maximum spherical harmonic degree (l_max) of 12, and a gaussian width (σ_at) of 1 Å, which results in 1015 SOAP features.

Utilizing these ML models for calculating E_seg provides significant speed-up for calculating atomic E_seg behavior of subsequent systems. As a reference, calculating initial E_seg training data on the 6nm polycrystal GBs took 520 core-hours while calculating SFDs and predicting E_seg of the entire 6nm polycrystal took 0.70 core-minutes. With larger systems, the efficiency improves further, with 9000 core-hours taken for the 10nm GBs and only 3.46 core-minutes for the SFDs and ML prediction (Supplementary Table 1). SHAP values used for analysis and model interpretation were calculated using the relevant TreeExplainer functions from Lundberg and Lee^48,49. Each model was evaluated with the entire dataset utilized for train/test splits.

Strain functional descriptors (SFDs)

To devise a unique descriptor of local atom environments, techniques were adapted from many different areas of atomic physics, chemistry, computer science, and computational materials science⁴⁶. The first step in describing a local atomic environment is first to identify the characteristics of local atomic neighborhoods which can then be translated into discrete descriptors of that environment. For the SFDs, the atomic mathematical framework is based on a Gaussian overlap method developed initially by Bartok et al., where each atomic neighbor is represented by a vector pointing from the central atom of interest with a length scaled by a Gaussian function of the distance between the two atoms^45,55,70. Here, we define the 3-D Gaussian with a σ such that the volume for a single Gaussian is equal to the atomic space-filling volume in the reference crystalline structure at its energetic minimum. Each atom each atom around the central atom inside a cutoff distance of 5σ is considered according to this weighting. For the NiP system used here, a σ of 0.881 Å is used. From this vector-based description of the atomic positions, a Taylor series expansion of n^th order terms (corresponding to rank n) is used to construct spatial tensors of rank 0 to 4 (representing ${{\mathbb{R}}}^{0}$ to ${{\mathbb{R}}}^{4}$) which describe the overall Gaussian geometry. Higher-order terms can be constructed, but expansion to rank 4 is sufficient for the majority of atomic environments. This construction leads to a set of 24 atomic descriptors which describe the complete nature of the local atomic environment captured by the 0^th to 4^th order Gaussian tensors.

This tensor-based description is then decomposed to form scalar values that represent various atomic distortions in geometric terms (i.e, tetrahedral or octahedral character, net shear). For the rank 0 tensor, this is trivial as it is already a scalar that represents the local Gaussian density of the atomic neighborhood (1). When considering all higher-order tensors, the Cartesian tensor formed from the initial construction (from the Gaussian overlap vectors) is first converted into a spherical-harmonic tensor. This spherical harmonic tensor is subsequently converted into a set of rotationally-invariant scalars using Clebsch-Gordan coefficients^{71,72,73,74,75}. The redundant set of all possible scalars representing a given spherical harmonic tensor is down-sampled to a unique and complete set of descriptors pre-defined for each tensor order. Finally, radial orthogonality is established (as in equations (2) and (3)) between similar scalar terms originating from different tensor orders such that each scalar carries a maximum amount of information when considering the entire set of SFDs across all orders. This independence between similar scalar features of different orders represents another advantage of the SFD formulation over previously published descriptors^76,77,78,79. Further representation of the mathematics behind these principles underlying our SFDs requires more in-depth discussion than relevant to this manuscript and awaits further publication⁴⁶.

The SFD approach follows an established method for describing physical properties in terms of irreducible spherical tensors^{71,72,73,74,75}. This step is somewhat different from the bispectrum approach of Bartok et al.⁷⁰ (SOAP) and produces a tensor representation that is very analogous to the principal quantum representation of atomic orbitals [see⁸⁰ for examples]. In the framework illustrated by Levine, the n = 4 expansion terms would be primarily described by shapes analogous to the g orbitals (l = 4). Additional lower order terms (l = 2 and l = 0 with radial nodes) are also generated that describe orthogonal subspaces in the rotationally invariant SO(3) space⁷⁵. These scalar functions can then be further reduced to rotationally invariant descriptors^{70,75,76,77,78,79}. The application of Clebsch-Gordan coupling is a useful means of achieving this rotational invariance based on the harmonic polynomial basis^70,79,81. As noted by Lo and Don⁷⁹, a large number of invariants that contain redundant information can be generated, and the reduction of these to a minimal set with compact geometric information is described by Kober et al.⁴⁶. This results in a set of 24 rotationally-invariant descriptors with an additional 12 indicating the local rotationally-dependent character of each tensor relative to a global coordinate frame. Only the 24 rotationally-invariant descriptors were used in this study.

Data availability

Data which support the ML model development and analysis of this study are available at https://doi.org/10.6084/m9.figshare.19210287.

Code availability

Code supporting this study is also available at https://doi.org/10.6084/m9.figshare.19210287.

References

Guttmann, M. The role of residuals and alloying elements in temper embrittlement. Philos. Trans. A Math. Phys. Sci. 295, 169–196 (1980).
CAS Google Scholar
Venezuela, J. & Dargusch, M. S. The influence of alloying and fabrication techniques on the mechanical properties, biodegradability and biocompatibility of zinc: acomprehensive review. Acta Biomater. 87, 1–40 (2019).
Zhao, X., Chen, H., Wilson, N., Liu, Q. & Nie, J. F. Direct observation and impact of co-segregated atoms in magnesium having multiple alloying elements. Nat. Commun. 10, 1–7 (2019).
Google Scholar
Vaidya, M., Muralikrishna, G. M. & Murty, B. S. High-Entropy Alloys by Mechanical Alloying: A Review. https://www.cambridge.org/core/journals/journal-of-materials-research/article/abs/highentropy-alloys-by-mechanical-alloying-a-review/8B9B1E7CDF719FA7C418330BFEF45912 (2019).
Chi, Y., Gu, G., Yu, H. & Chen, C. Laser surface alloying on aluminum and its alloys: a review. Opt. Laser Eng. 100, 23–37 (2018).
Long, H., Mao, S., Liu, Y., Zhang, Z. & Han, X. Microstructural and compositional design of Ni-based single crystalline superalloys - a review. J. Alloy Compd. 743, 203–220 (2018).
Shyam, A. et al. Elevated temperature microstructural stability in cast AlCuMnZr alloys through solute segregation. Mater. Sci. Eng. A 765, 138279 (2019).
Article CAS Google Scholar
Ejenstam, J., Thuvander, M., Olsson, P., Rave, F. & Szakalos, P. Microstructural stability of Fe-Cr-Al alloys at 450-550 ^∘C. J. Nucl. Mater. 457, 291–297 (2015).
Article CAS Google Scholar
Goyal, K. & Sardana, N. Phase stability and microstructural evolution of Ti2AlNb alloys-a review. In Materials Today: Proceedings, vol. 41, 951–968 (Elsevier, 2020).
Xia, W., Zhao, X., Yue, L. & Zhang, Z. Microstructural evolution and creep mechanisms in Ni-based single crystal superalloys: a review. J. Alloy Compd. 819, 152954 (2020).
Article CAS Google Scholar
Tschopp, M. A., Murdoch, H. A., Kecskes, L. J. & Darling, K. A. Bulk Nanocrystalline Metals: Review of the Current State of the art and Future Opportunities for Copper and Copper Alloys. https://link.springer.com/content/pdf/10.1007%2Fs11837-014-0978-z.pdf (2014).
Kube, S. A. et al. Combinatorial study of thermal stability in ternary nanocrystalline alloys. Acta Materialia 188, 40–48 (2020).
Article CAS Google Scholar
Mathaudhu, S. N. Building on Gleiter: the foundations and future of deformation processing of nanocrystalline metals. Metall. Mater. Trans. A: Phys. Metall. Mater. Sci. 51, 6020–6044 (2020).
Article CAS Google Scholar
Spearot, D. E., Tucker, G. J., Gupta, A. & Thompson, G. B. Perspective: Mechanical properties of stabilized nanocrystalline FCC metals. J. Appl. Phys. 126, 110901 (2019).
Article Google Scholar
Lejcek, P.Grain Boundary Segregation in Metals (Springer, 2010), springer s edn.
Biswas, A., Siegel, D. J., Wolverton, C. & Seidman, D. N. Precipitates in Al-Cu alloys revisited: Atom-probe tomographic experiments and first-principles calculations of compositional evolution and interfacial segregation. Acta Materialia 59, 6187–6204 (2011).
Article CAS Google Scholar
Baik, S. I., Olszta, M. J., Bruemmer, S. M. & Seidman, D. N. Grain-boundary structure and segregation behavior in a nickel-base stainless alloy. Scr. Materialia 66, 809–812 (2012).
Article CAS Google Scholar
Wynblatt, P. Effects of interfacial segregation on wetting in solid metal-on-metal and metal-on-ceramic systems. Acta Materialia 48, 4439–4447 (2000).
Article CAS Google Scholar
Zhou, X., Yu, X. X., Kaub, T., Martens, R. L. & Thompson, G. B. Grain Boundary Specific Segregation in Nanocrystalline Fe(Cr). Sci. Rep. 6, 34642 (2016).
Article CAS PubMed PubMed Central Google Scholar
Chookajorn, T. & Schuh, C. A. Nanoscale segregation behavior and high-temperature stability of nanocrystalline W-20 at.% Ti. Acta Materialia 73, 128–138 (2014).
Article CAS Google Scholar
Kalidindi, A. R. & Schuh, C. A. Stability criteria for nanocrystalline alloys. Acta Materialia 132, 128–137 (2017).
Article CAS Google Scholar
Abdeljawad, F. & Foiles, S. M. Stabilization of nanocrystalline alloys via grain boundary segregation: A diffuse interface model. Acta Materialia 101, 159–171 (2015).
Article CAS Google Scholar
Murdoch, H. A. & Schuh, C. A. Estimation of grain boundary segregation enthalpy and its role in stable nanocrystalline alloy design. J. Mater. Res. 28, 2154–2163 (2013).
Article CAS Google Scholar
Murdoch, H. A. & Schuh, C. A. Stability of binary nanocrystalline alloys against grain growth and phase separation. Acta Materialia 61, 2121–2132 (2013).
Article CAS Google Scholar
Gupta, A., Zhou, X., Thompson, G. B. & Tucker, G. J. Role of grain boundary character and its evolution on interfacial solute segregation behavior in nanocrystalline Ni-P. Acta Materialia 190, 113–123 (2020).
Article CAS Google Scholar
Jiang, H. & Faulkner, R. G. Modelling of grain boundary segregation, precipitation and precipitate-free zones of high strength aluminium alloys - I. The model. Acta Materialia 44, 1857–1864 (1996).
Article CAS Google Scholar
Wynblatt, P. & Chatain, D. Anisotropy of segregation at grain boundaries and surfaces (2006).
Scheiber, D., Jechtl, T., Svoboda, J., Fischer, F. D. & Romaner, L. On solute depletion zones along grain boundaries during segregation. Acta Materialia 182, 100–107 (2020).
Article CAS Google Scholar
Wagih, M. & Schuh, C. A. Thermodynamics and design of nanocrystalline alloys using grain boundary segregation spectra. Acta Materialia 217, 117177 (2021).
Article CAS Google Scholar
Wagih, M. & Schuh, C. A. Learning grain-boundary segregation: From first principles to polycrystals. Phys. Rev. Lett. 129, 46102 (2022).
Article CAS Google Scholar
Dösinger, C. et al. Efficient descriptors and active learning for grain boundary segregation. Phys. Rev. Mater. 7, 113606 (2023).
Article Google Scholar
Li, X., Li, Y. & Gao, W. An analytic descriptor for determining the effect of grain-boundary structures of metals on solute segregation. Journal of Applied Physics 135 (2024).
Mahmood, Y., Alghalayini, M., Martinez, E., Paredis, C. J. J. & Abdeljawad, F. Atomistic and machine learning studies of solute segregation in metastable grain boundaries. Sci. Rep. 12, 6673 (2022).
Article CAS PubMed PubMed Central Google Scholar
Kuo, S. M., Seki, A., Oh, Y. & Seidman, D. N. Solute-atom segregation: An oscillatory Ni profile at an internal interface in Pt(Ni). Phys. Rev. Lett. 65, 199–202 (1990).
Article CAS PubMed Google Scholar
Rajagopalan, M., Tschopp, M. A. & Solanki, K. N. Grain boundary segregation of interstitial and substitutional impurity atoms in alpha-iron. JOM 66, 129–138 (2014).
Article CAS Google Scholar
Detor, A. J. & Schuh, C. A. Grain boundary segregation, chemical ordering and stability of nanocrystalline alloys: Atomistic computer simulations in the Ni-W system. Acta Materialia 55, 4221–4232 (2007).
Article CAS Google Scholar
Wynblatt, P. & Chatain, D. Modeling grain boundary and surface segregation in multicomponent high-entropy alloys. Phys. Rev. Mater. 3, 054004 (2019).
Article CAS Google Scholar
Lee, H. B., Prinz, F. B. & Cai, W. Atomistic simulations of grain boundary segregation in nanocrystalline yttria-stabilized zirconia and gadolinia-doped ceria solid oxide electrolytes. Acta Materialia 61, 3872–3887 (2013).
Article CAS Google Scholar
Rittner, J. D. & Seidman, D. N. Solute-atom segregation to 〈110〉 symmetric tilt grain boundaries. Acta Materialia 45, 3191–3202 (1997).
Article CAS Google Scholar
Wagih, M. & Schuh, C. A. Spectrum of grain boundary segregation energies in a polycrystal. Acta Materialia (2019).
Huber, L., Hadian, R., Grabowski, B. & Neugebauer, J. A machine learning approach to model solute grain boundary segregation. npj Computational Mater. 4, 64 (2018).
Article Google Scholar
Huber, L., Grabowski, B., Militzer, M., Neugebauer, J. & Rottler, J. Ab initio modelling of solute segregation energies to a general grain boundary. Acta Materialia 132, 138–148 (2017).
Article CAS Google Scholar
Steinhardt, P. J., Nelson, D. R. & Ronchetti, M. Bond-orientational order in liquids and glasses. Phys. Rev. B 28, 784–805 (1983).
Article CAS Google Scholar
Wagih, M., Larsen, P. M. & Schuh, C. A. Learning grain boundary segregation energy spectra in polycrystals. Nat. Commun. 11, 6376 (2020).
Article CAS PubMed PubMed Central Google Scholar
Bartók, A. P., Kondor, R. & Csányi, G. On representing chemical environments. Phys. Rev. B - Condens. Matter Mater. Phys. 87, 184115 (2013).
Article Google Scholar
Kober, E. M., Tavenner, J. P., Adams, C. M. & Mathew, N. Strain functionals: A complete and symmetry-adapted set of descriptors to characterize atomistic configurations https://arxiv.org/abs/2402.04191v1 (2024).
Wagih, M. & Schuh, C. A. Grain boundary segregation beyond the dilute limit: Separating the two contributions of site spectrality and solute interactions. Acta Materialia 199, 63–72 (2020).
Article CAS Google Scholar
Lundberg, S. M. & Lee, S. I. A unified approach to interpreting model predictions. In Advances in Neural Information Processing Systems, vol. 2017-Decem, 4766–4775 https://github.com/slundberg/shap (2017).
Lundberg, S. M. et al. From local explanations to global understanding with explainable AI for trees. Nat. Mach. Intell. 2, 56–67 (2020).
Article PubMed PubMed Central Google Scholar
Csányi, G. et al. Expressive Programming for Computational Physics in Fortran 95+. IoP Comput. Phys. Newsletter Spring 2007 (2007).
Kermode, J. R. f90wrap: an automated tool for constructing deep Python interfaces to modern Fortran codes. J. Phys. Condens. Matter (2020).
Pozdnyakov, S. N. et al. Incompleteness of Atomic Structure Representations. Phys. Rev. Lett. 125, 166001 (2020).
Article CAS PubMed Google Scholar
Gupta, A. et al. On the mechanistic origins of maximum strength in nanocrystalline metals. npj Computational Mater. 6, 153 (2020).
Article CAS Google Scholar
Bartók, A. P. Gaussian Approximation Potential: an interatomic potential derived from first principles Quantum Mechanics http://arxiv.org/abs/1003.2817. (2010).
Bartõk, A. P. & Csányi, G. Gaussian approximation potentials: A brief tutorial introduction https://doi.org/10.1002/qua.24927 (2015).
Stukowski, A. Visualization and analysis of atomistic simulation data with OVITO - the Open Visualization Tool, Modelling Simul. Model. Simul. Mater. Sci. Eng. 18, 015012 (2010).
Article Google Scholar
Daw, M. S. & Baskes, M. I. Embedded-atom method: Derivation and application to impurities, surfaces, and other defects in metals. Phys. Rev. B 29, 6443–6453 (1984).
Article CAS Google Scholar
Finnis, M. W. & Sinclair, J. E. A simple empirical N-body potential for transition metals. Philos. Mag. A: Phys. Condens. Matter, Struct., Defects Mech. Prop. 50, 45–55 (1984).
Article CAS Google Scholar
Gruber, J., Lim, H., Abdeljawad, F., Foiles, S. & Tucker, G. J. Development of physically based atomistic microstructures: The effect on the mechanical response of polycrystals. Computational Mater. Sci. 128, 29–36 (2017).
Article CAS Google Scholar
Gupta, A., Rajaram, S. S., Thompson, G. B. & Tucker, G. J. Improved computational method to generate properly equilibrated atomistic microstructures. MethodsX 8, 101217 (2021).
Article CAS PubMed PubMed Central Google Scholar
Plimton, S. Fast parallel algorithms for short-range molecular dynamics. J. Comput. Phys. 117, 1–19 (1995).
Article Google Scholar
Thompson, A. P. et al. LAMMPS - a flexible simulation tool for particle-based materials modeling at the atomic, meso, and continuum scales. Comput. Phys. Commun. 271, 108171 (2022).
Article CAS Google Scholar
Sheng, H. W., Ma, E. & Kramer, M. J. Relating dynamic properties to atomic structure in metallic glasses. JOM 64, 856–881 (2012).
Article CAS Google Scholar
Foley, D. & Tucker, G. J. Quantifying grain boundary damage tolerance with atomistic simulations. Model. Simul. Mater. Sci. Eng. 24 (2016).
Chen, T. & Guestrin, C. XGBoost: A scalable tree boosting system. In Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, vol. 13-17-Augu, 785–794 (Association for Computing Machinery, New York, NY, USA, 2016). https://doi.org/10.1145/2939672.2939785.
Oliphant, T. E. Python for scientific computing. Comput. Sci. Eng. 9, 10–20 (2007).
Article CAS Google Scholar
Millman, K. J. & Aivazis, M. Python for scientists and engineers (2011).
McKinney, W. Data Structures for Statistical Computing in Python. In Proceedings of the 9th Python in Science Conference, 51–56 http://conference.scipy.org/proceedings/scipy2010/mckinney.html (2010).
Hunter, J. D. Matplotlib: A 2D graphics environment. Comput. Sci. Eng. 9, 99–104 (2007).
Article Google Scholar
Bartók, A. P., Payne, M. C., Kondor, R. & Csányi, G. Gaussian approximation potentials: the accuracy of quantum mechanics, without the electrons. Phys. Rev. Lett. 104, 136403 (2010).
Article PubMed Google Scholar
Coope, J. A. Irreducible Cartesian tensors. III. Clebsch-Gordan reduction. J. Math. Phys. 11, 1591–1612 (1970).
Article Google Scholar
Coope, J. A. & Snider, R. F. Irreducible Cartesian tensors. II. General formulation. J. Math. Phys. 11, 1003–1017 (1970).
Article Google Scholar
Coope, J. A. & Snider, R. F. Irreducible Cartesian tensors. J. Math. Phys. 43, 1003–1017 (1965).
Google Scholar
Mason, J. K. The relationship of the hyperspherical harmonics to SO(3), SO(4) and orientation distribution functions. Acta Crystallogr. Sect. A: Found. Crystallogr. 65, 259–266 (2009).
Article CAS Google Scholar
Jerphagnon, J., Chemla, D. & Bonneville, R. The description of the physical properties of condensed matter using irreducible tensors. Adv. Phys. 27, 609–650 (1978).
Article CAS Google Scholar
Auffray, N. Geometrical Picture of Third-Order Tensors, 22 https://doi.org/10.1007/978-3-642-36394-8 (2013).
Olive, M. & Auffray, N. Isotropic invariants of a completely symmetric third-order tensor. J. Math. Phys. 55, 40–48 (2014).
Article Google Scholar
Ahmad, F. Invariants of a cartesian tensor of rank 3. Arch. Mech. 63, 383–392 (2011).
Google Scholar
Lo, C.-H. & Don, H.-S. 3-D moment forms: their construction and application to object identification and positioning. IEEE Trans. Pattern Anal. Mach. Intell. 11, 1053–1064 (1989).
Article Google Scholar
Levine, I. N. Quantum Chemistry (Allyn & Bacon, Boston, MA, 1973), 2nd edn.
Edmonds, A. R. Angular Momentum in Quantum Mechanics (Princeton University Press, Princeton, NJ, 1974), 3rd edn.
Ramachandran, P. & Varoquaux, G. Mayavi: 3D visualization of scientific data. Comput. Sci. Eng. 13, 40–51 (2011).
Article Google Scholar

Download references

Acknowledgements

J.P.T., A. G., G.B.T., and G.J.T. are grateful for the support of the Army Research Office (ARO) through grant no. W911NF-17-1-0528. G.J.T. is grateful for the additional support of this work by the Eula Mae and John Baugh Chair in Physics at Baylor University. Simulations were performed using computational resources in the Computational Materials Science and Design Research Group at the Colorado School of Mines, HPC platforms from the High Performance Computing Center at the Colorado School of Mines, and NSF XSEDE resources under allocation number TG-DMR190011. Development of the SFD mathematical framework was performed at Los Alamos National Laboratory under Laboratory Direct Research and Development projects XWG7 and XX9A.

Author information

Authors and Affiliations

Colorado School of Mines, Department of Mechanical Engineering, Golden, CO, USA
Jacob P. Tavenner
Los Alamos National Laboratory, Los Alamos, NM, USA
Jacob P. Tavenner & Edward M. Kober
Department of Physics, Baylor University, Waco, TX, USA
Ankit Gupta & Garritt J. Tucker
University of Alabama, Metallurgical and Materials Engineering, Tuscaloosa, AL, USA
Gregory B. Thompson

Authors

Jacob P. Tavenner
View author publications
Search author on:PubMed Google Scholar
Ankit Gupta
View author publications
Search author on:PubMed Google Scholar
Gregory B. Thompson
View author publications
Search author on:PubMed Google Scholar
Edward M. Kober
View author publications
Search author on:PubMed Google Scholar
Garritt J. Tucker
View author publications
Search author on:PubMed Google Scholar

Contributions

J.P.T. assisted in development of the mathematical framework, implemented the simulations, designed, implemented the data analysis procedures and ML models, prepared the manuscript, and edited the manuscript. A.G. prepared the initial polycrystalline samples, assisted in calculation of segregation energies, and reviewed the manuscript preparation. E.M.K. conceived of and developed the initial mathematical framework for the SFDs, and reviewed the manuscript preparation. G.B.T. collaborated and provided scientific advice. G.J.T. conceived of the project, gave scientific and technical advice throughout the project, and reviewed the manuscript.

Corresponding author

Correspondence to Jacob P. Tavenner.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Communications Materials thanks Daniel Scheiber, Yanchun Zhou and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Primary Handling Editor: Aldo Isidori. A peer review file is available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Material

Peer Review File

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Cite this article

Tavenner, J.P., Gupta, A., Thompson, G.B. et al. Learning grain boundary segregation behavior through fingerprinting complex atomic environments. Commun Mater 5, 183 (2024). https://doi.org/10.1038/s43246-024-00616-y

Download citation

Received: 09 December 2021
Accepted: 16 August 2024
Published: 10 September 2024
DOI: https://doi.org/10.1038/s43246-024-00616-y