A process-synergistic active learning framework for high-strength Al-Si alloys design

Cai, Jianming; Han, Mengxia; Yan, Xirui; Chen, Yan; Li, Daoxiu; Zhao, Kai; Zhang, Dongqing; Hu, Kaiqi; Sua, Heng Han; Jun, Hieng Kiat; Xie, Kewei; Liu, Guiliang; Liu, Xiangfa; Liu, Sida

doi:10.1038/s41524-025-01721-3

Download PDF

Article
Open access
Published: 14 July 2025

A process-synergistic active learning framework for high-strength Al-Si alloys design

Jianming Cai¹,
Mengxia Han²,
Xirui Yan¹,
Yan Chen¹,
Daoxiu Li²,
Kai Zhao³,
Dongqing Zhang⁴,
Kaiqi Hu⁵,
Heng Han Sua⁶,
Hieng Kiat Jun^7,8,9,
Kewei Xie²,
Guiliang Liu¹⁰,
Xiangfa Liu² &
…
Sida Liu¹

npj Computational Materials volume 11, Article number: 228 (2025) Cite this article

3083 Accesses
Metrics details

Subjects

Abstract

High-strength Al-Si alloys are important lightweight materials, but their optimal design is hindered by scarce-imbalance data, and complex compositional-process-property relationships. Traditional trial-and-error experimentation fails to explore this multi-dimensional design space, where processing routes (PRs) and composition must be co-optimized to achieve superior strength. This study introduces a process-synergistic active learning (PSAL) framework leveraging a conditional Wasserstein autoencoder (c-WAE) to enable the data-efficient design. By encoding PRs as conditional variables, the PSAL framework reveals exceptional synergistic effects across diverse PRs, significantly outperforming single-process approaches. The process-aware latent representation facilitates the efficient exploration of potential compositions across multi-PRs simultaneously. Through iterative active learning cycles integrating machine learning predictions with experimental validations, ultimate tensile strength is greatly improved: 459.8 MPa for gravity casting with T6 heat treatment within three iterations and 220.5 MPa for gravity casting with hot extrusion in a single iteration. This framework handles sparse datasets effectively, capturing complex process-composition-property relationships and establishing a new paradigm for accelerated multi-objective material design.

A rapid and effective method for alloy materials design via sample data transfer machine learning

Article Open access 22 February 2023

Effect of friction stir processing parameters on mechanical properties of different aluminum alloy plates

Article Open access 25 July 2025

Transforming machine learning model knowledge into material insights for multi-principal-element superalloy phase design

Article Open access 14 April 2025

Introduction

In modern engineering and manufacturing, Al-Si alloys are among the most widely used aluminum alloys due to their exceptional properties, including low density, high specific strength, excellent plasticity, superior electrical and thermal conductivity, and corrosion resistance^1,2,3,4. These characteristics make Al-Si alloys highly suitable for applications in the automotive, aerospace, and defense industries^5,6. As the demand for lightweight and high-performance materials continues to increase, developing and optimizing high-strength Al-Si alloys have become increasingly important⁷.

Designing high-performance Al-Si alloys remains challenging due to compositional and processing complexity. Alloying with various elements forms strengthening phases^8,9,10,11, creating a large compositional space with over 1000 possible combinations, even when considering only common metallic and rare earth elements^12,13. Additionally, Al-Si alloys can be processed through diverse manufacturing routes, such as heat treatment, hot extrusion, and rolling¹⁴. The combination of these compositional and processing variables results in a large design space, making it difficult to identify optimal solutions using traditional approaches, such as trial-and-error experiments combined with phase diagram calculations and density functional theory analysis^15,16,17,18. These methods often lead to prolonged development cycles, substantial costs, and intensive labor requirements^19,20,21.

Recent advances in machine learning (ML) have shown promise in accelerating materials design^{22,23,24,25,26,27} by directly linking alloy properties with composition and processing parameters, avoiding the cumulative errors that often arise from multiple-mechanism complexity²⁸. ML approaches have been successfully applied to various alloy designs with high performance^{29,30,31,32,33,34,35}, such as high entropy alloys^{24,33,34,35,36}, Al-based alloys^29,30,31,37, and Cu-based alloys³². These approaches primarily focused on the composition design of a specific process or the optimization of process parameters to achieve desired performance^31,32,33,35. However, they are largely limited to alloy design within a single process, requiring the construction of distinct datasets for different processes, which resulted in significant labor and time costs. Apparently, the above ML approaches cannot satisfy the demand for designing Al-Si alloys, which need diverse processes to achieve specific strength characteristics for different service environments, especially when requiring expensive and complex processes to design high-strength alloys for special applications, such as aerospace³⁸, where limited data hinders effective ML training. Although some conventional PRs, such as simple casting, can generate abundant data due to their accessibility, advanced PRs, such as specialized hot extrusion, yield limited experimental data due to their complexity and cost³⁹. This data imbalance across different PRs prevents traditional ML models from establishing robust composition-processing-property relationships⁴⁰.

Active learning has emerged as a promising solution to address small-sample challenges in alloy design^41,42, employing a surrogate model to iteratively select the most informative, though unseen, data points to accelerate high-performance alloy design⁴³. However, current active learning applications in alloy design primarily use simple surrogate models and Bayesian optimization^44,45, which are better suited for low-dimensional data and often require multiple iterations to achieve substantial performance improvements^46,47. Furthermore, these approaches typically treat different PRs independently and fail to leverage potential synergies between readily available and scarce experimental data.

To address these issues, in this work, we propose a process-synergistic active learning (PSAL) framework to enable the efficient design of high-strength Al-Si alloys through multi-process collaborative prediction. This novel framework uniquely consolidates data across different PRs, leveraging abundant data from accessible PRs to enhance prediction accuracy for complex processes with limited data. This approach integrates generative models to explore the most informative compositional space across various PRs, along with Bayesian-optimized surrogate models to select high-strength compositions. This framework significantly reduces the required number of iterations through iterative experimental validation and feedback loops while simultaneously accelerating the design of high-strength Al-Si alloys across multi-PRs.

Results

PSAL framework

To address the diverse processing requirements for optimizing Al-Si alloy performance and mitigating data imbalances associated with complex PRs, we introduce the PSAL framework. This framework enables the efficient design of high-strength alloys across multi-PRs, while minimizing iteration cycles and enhancing prediction accuracy, especially for processes with limited experimental data. The PSAL framework consists of five integrated components (Fig. 1): dataset construction (Fig. 1a), composition generation (Fig. 1b), surrogate model development (Fig. 1c), candidate selection (Fig. 1d), and experimental validation. The framework operates through an iterative workflow: dataset construction first provides a foundational dataset for downstream tasks. Next, composition generation produces initial candidate alloy formulations, followed by surrogate model development to build predictive models for alloy performance. Candidate selection then identifies the most promising formulations based on model predictions, and experimental validation tests these selected candidates, feeding results back into the model for subsequent optimization cycles.

**Fig. 1: Overview of the process-synergistic active learning framework.**

By integrating experimental and literature-derived data, a comprehensive database comprising 140 composition-process-property entries can be established. The dataset consists of seven principal alloying elements, namely, Mg, Cu, Ni, Zn, Fe, Mn, and Cr, in addition to Al and Si. These are studied across four distinct PRs: gravity casting (GC), GC with T6 heat treatment (GC + T6), GC with hot extrusion (GC + HE), and GC with combined hot extrusion and T6 treatment (GC + HE + T6). The detailed database construction methodology is presented in the Methods section.

The PSAL framework employs c-WAE architecture to generate a large number of potential Al-Si alloy compositions tailored to different processing requirements⁴⁸. The encoder transforms high-dimensional compositional data into a low-dimensional latent space while preserving critical physical attributes, thus enabling efficient representation learning. PRs, encoded as conditional variables through one-hot encoding, allow the model to establish process-specific alloy compositional clusters within distinct regions of the latent space. By sampling from these conditional regions, the model generates compositions potentially suitable for various PRs. The decoder then reconstructs these low-dimensional latent representations into specific alloy compositions. Visualization of the latent space reveals the intrinsic relationships among alloy compositions, PRs, and strength metrics, thus accelerating the identification of high-strength combinations across different PRs.

The PSAL framework develops an ensemble surrogate model by integrating the strength predictions from neural networks (NN) and extreme gradient boosting decision trees (XGBDT), leveraging their complementary advantages^49,50,51. Trained with alloy compositions and PRs as input features and strength as outputs, the model capitalizes on the NN to capture complex nonlinear relationships in high-dimensional spaces, while XGBDT enhances predictive accuracy through feature importance evaluation. For better prediction, the hyperparameters of both models are fine-tuned via Bayesian optimization and random search over iterations. The predicted strength of sampled compositions is then determined by averaging the outputs of both models with all hyperparameter configurations (Supplementary Fig. S1).

For optimizing candidate selection, a ranking criterion based on the exploration-exploitation strategy is implemented due to the probable bias and uncertainty between the predicted and experimental results. This strategy leverages the mean predicted strength value to facilitate exploration by identifying compositions with high strength. Meanwhile, the standard deviation is used to guide the selection towards candidates with low uncertainty. By integrating these two factors, the ranking criterion ensures a systematic balance between exploring new compositions and exploiting high-strength candidates (detailed ranking criterion is presented in the Methods)¹². Thus, the candidate compositions with high-ranking scores are prioritized. To ensure compositional diversity and mitigate experimental bias, alloy compositions within each validation set maintain a minimum differential of 0.5% in mass percent (MP) for at least one constituent element, as the following expression showed.

$$\left|{{\rm{MP}}}_{j,i}-{{\rm{MP}}}_{k,i}\right|\ge 0.5 \% ,$$

(1)

where j and k mean the j-th and k-th compositions in the set, respectively. The i represents the i-th element in the compositions.

The top-ranked three candidates undergo experimental validation, with results iteratively incorporated into the database for further model refinement. Through these iterative optimization cycles, the PSAL framework demonstrates exceptional effectiveness in identifying Al-Si alloy compositions with enhanced tensile strength.

Data distribution imbalance

From the analysis of the collected dataset, a significant imbalance in data volumes across different PRs is revealed (Fig. 2a). The data volumes for GC, GC + T6, GC + HE, and GC + HE + T6 PRs are 38, 70, 13, and 19, respectively. This imbalance primarily comes from the varying inherent complexity and economic considerations associated with each PR.

**Fig. 2: Dataset distribution analysis and multi-process synergistic effect on model performance.**

The GC PR is primarily used for casting ingots, which serve as precursors for heat treatment and hot extrusion. The GC + T6 PR incorporates T6 heat treatment of the cast ingots, optimizing the alloy’s grain structure and reinforcing precipitates (e.g., Si particles), thereby enhancing ultimate tensile strength (UTS), hardness, and corrosion resistance⁹. The T6 heat treatment significantly improves the UTS while maintaining cost efficiency, making it suitable for a wide range of applications, including automotive and construction industries.

In contrast, the GC + HE PR is more complex, requiring high-temperature mechanical extrusion of the cast ingots to improve the alloy’s mechanical properties. Hot extrusion demands higher equipment investment, greater energy consumption, and longer processing times, resulting in increased costs³⁹. The GC + HE + T6 PR, involving further heat treatment on the GC + HE-treated alloys, could continuously improve mechanical properties; however, it is more complex and costlier. Therefore, these processes have been typically used only when higher strength and specialized performance are required, such as in aerospace applications⁵². As a result, the data volumes for GC and GC + T6 PRs are generally larger than those for GC + HE and GC + HE + T6 PRs.

ML-based predictive models are traditionally constrained by the scarcity of experimental data. To overcome this limitation, we propose a novel approach that systematically leveraged system-level similarities and synergistic data effects to enhance model prediction accuracy, thereby enabling rapid iterative optimization of high-strength Al-Si alloys, particularly for GC + HE and GC + HE + T6 PRs.

Multi-process synergistic effect

The limited availability of process-specific experimental data can hinder the effective training and refinement of ML-based models. To address this, we propose an innovative approach that enhances model performance by systematic integration of data across multi-PRs. Crucially, the effectiveness of this approach fundamentally depends on two key factors: the existence of inherent correlations within the dataset and the potential for meaningful data synergies across different PRs.

Comprehensive correlation analysis reveals that the influence of alloy elements on mechanical properties varies significantly across different PRs (Fig. 2b and Supplementary Fig. S2a–c), indicating that single-process data alone inadequately captures the complete composition-performance relationships across various PRs and prevents direct generalization. Our multi-PRs synergistic approach addresses this limitation by enabling comprehensive mapping of composition-performance correlations across multi-PRs, thereby enhancing the predictive capability of the model.

We subsequently conduct t-distributed stochastic neighbor embedding analysis, which demonstrates that data points from different PRs exhibit overlapping distributions due to their shared alloy system and elemental composition (Fig. 2c). This intertwining indicates that the model successfully captures cross-process shared and similar characteristics⁵³, facilitating the improved alloy generalization of the model. Notably, the reconstructed data points in the 2D latent space maintain similar distribution patterns with the original dataset (Supplementary Fig. S3), validating the preservation of key compositional features and enhancing both the stability, accuracy and reliability of model predictions

Quantitative evaluation of model performance through comparative analysis of single-PR versus multi-PRs synergistic approaches reveals significant improvements in both accuracy and stability. Performance metrics, including mean absolute error (MAE) and coefficient of determination (R²) (detailed definitions provided in the Methods section), demonstrate substantially reduced prediction errors and improved prediction stability for multi-PRs synergistic data (Fig. 2d)^54,55. Specifically, for the synergistic GC PR, the XGBDT model achieves UTS prediction error of 9.3 MPa, with an error fluctuation of 6.9 MPa, both notably lower than those for single-process predictions (i.e., 17.7 and 13.2 MPa). The NN model exhibits a similar trend. The model selection and implementation details can be found in the Methods section.

Furthermore, the NN and XGBDT models both demonstrate superior performance on the multi-PRs synergistic test set, achieving R² values exceeding 0.9 (i.e., NN: 0.914, XGBDT: 0.912), significantly outperforming the single-process test sets (Fig. 2e). These findings strongly suggest that multi-PRs synergy effectively enhances both prediction accuracy and stability, which could be attributed to the increased data diversity and representativeness across multi-PRs, enabling a more robust understanding of the composition-performance relationships.

Process-aware compositional latent space

The c-WAE encoder effectively reduces dimensionality while enabling PRs to guide the organization of alloy data within distinct regions of the latent space (Fig. 3a). Four well-defined clusters can be distinguished, each corresponding to a specific PR with clear boundaries. Notably, alloys processed using GC + T6 PR demonstrate consistently superior UTS compared to other PRs, consistent with observations in Fig. 2a. Within each cluster, a systematic arrangement emerges from left to right based on Si content, with compositions featuring similar Si content naturally clustering together (Fig. 3a). This inherent organization facilitates targeted design of Al-Si alloys with varying Si content for specific PRs, such as hypoeutectic and hypereutectic Al-Si alloys^56,57.

**Fig. 3: Process-aware latent space analysis and sampling.**

A particularly significant feature of the latent space organization is the central positioning of GC PR data points, serving as a foundational reference nucleus for overall distribution. Other PRs generate clusters aligned diagonally around the central GC cluster, reflecting a hierarchical relationship in processing complexity. Notably, alloys produced via the GC + T6 PR are categorized as cast aluminum alloys, while those prepared via GC + HE and GC + HE + T6 PRs are classified as deformed aluminum alloys. The observed correlation between cluster distance from the GC reference and processing complexity aligns with established metallurgical principles (Fig. 2a)⁵⁸, where the greater the separation of the cluster from the central GC cluster, the more sophisticated the PRs, and the greater the enhancement in UTS.

This process-aware latent space organization demonstrates the proposed PSAL framework’s capability to capture and represent complex interactions among alloy compositions, PRs, and resultant UTS. The physically meaningful pattern of the latent space provided a robust foundation for sampling and designing high-strength alloy compositions through targeted exploration of specific regions associated with desired PRs and property combinations.

Latent space sampling and active learning iteration

By leveraging four distinct processing-specific clusters identified within the latent space (Fig. 3a), we aim to design high-strength Al-Si alloys using the GC + T6 and GC + HE PRs. The GC + T6 PR is selected for its superior mechanical properties, while the GC + HE PR is chosen to demonstrate the current model’s ability to effectively address the challenge of data scarcity.

The sampling workflow for the GC + T6 PR is illustrated in Fig. 3b. Initially, the Gaussian Mixture Model (GMM) is used to define the boundaries of the GC + T6 clusters (Supplementary Fig. S4). Then, the Markov Chain Monte Carlo (MCMC) method is employed to sample compositions within these clusters^59,60. During the sampling process, a trained binary classifier is used to validate the features of each sampled point (Supplementary Fig. S5). The validated compositions, which are marked as yellow symbol “+” in the GMM probability density map (Fig. 3c), are then reconstructed into high-dimensional space by the decoder, revealing their elemental and processing details.

The sampled alloy compositions and their respective PRs are employed to predict the UTS using the well-trained surrogate model and ranked by the ranking criterion ((details provided in the Methods section). Using this approach, top-ranked Al-Si alloy composition candidates can be well-identified (Supplementary Table S1), with the top three (A1, A2, and A3) selected for experimental validation (Table 1). The positions of these alloys are highlighted in the GMM probability density map (lower row of Fig. 3c).

Table 1 Composition (wt%) and UTS (MPa) of sampled Al-Si alloys in iterations

Full size table

Subsequently, alloy samples of these three compositions are cast into dog-bone-shaped specimens (Fig. 4b and Supplementary Fig. S6), and experimental tensile tests are performed to determine their actual UTS, as shown for the A1–A3 groups in Table 1. Notably, the experimental values do not fully align with the predicted range during the first iteration, underscoring the need to incorporate the new composition-performance data back into the original dataset. This feedback allows the PSAL framework to adjust model parameters and enhance prediction accuracy.

**Fig. 4: Experimental validation and mechanical performance of high-strength Al-Si alloys.**

During the second iteration, the latent space distribution and probability density of the GC + T6 PR are updated, indicating that the model could self-adjust and optimize data feature extraction for improved identification of high-strength Al-Si alloys. By the third iteration, the PSAL framework successfully predicts Al-Si alloys with UTS surpassing the original dataset. A comparison of experimental and predicted values for the GC + T6 PR over three iterations (Table 1, A1 to C3) shows that while the experimental values for one group do not fall within the predicted range in the first and second iterations, all experimental values fall within the predicted range in the third iteration. This reflects the continuous improvement in the model’s prediction accuracy and search ability for high-strength Al-Si alloys with further iterations (Fig. 4e and Supplementary Fig. S7).

For the GC + HE PR, only a single iteration is required to identify high-strength Al-Si alloys, as indicated in Table 1 (D1 to D3) and Fig. 3d. Note that the number of validated compositions (yellow “+”) for the GC + HE PR is much lower than for the GC + T6 PR, due to the smaller size of the initial GC + HE dataset. Experimental validation shows that alloys D1 and D2 outperform the original dataset, and their positions are marked within the probability density map. The need for only a single iteration can be attributed to the generally lower and more consistent performance distribution of the GC + HE PR, which enables the model to effectively capture the relationships among composition, PR, and strength, ultimately predicting Al-Si alloys with superior strength compared with the original dataset.

High-strength Al-Si alloys

Iterative optimization demonstrates the effectiveness of the PSAL framework in designing high-strength Al-Si alloys across multi-PRs within just a few iterations. As shown in Fig. 4d, f, for the GC + T6 PR, the UTS of the optimized alloys C1, C2, and C3 reach 436.6, 459.8, and 454.6 MPa, respectively, exceeding both the highest UTS reported in the literature data (425 MPa) and our own previous results (405 MPa). Similarly, for the GC + HE PR, alloys D1 and D2 achieve UTS values of 208.5 and 220.5 MPa, respectively, outperforming the literature maximum of 198 MPa as well as our prior experiments (191.8 MPa). Notably, our PSAL framework provides valuable insights into the optimal Si content ranges for high-strength Al-Si alloys, suggesting 6–8 wt% for the GC + T6 PR and around 12 wt% for the GC + HE PR. These findings aligned with previously proven values widely employed in practical applications, such as engine cylinder block and piston^61,62.

To gain deeper insights into the mechanisms responsible for the enhanced performance of alloys C2 and D2, comprehensive microstructural analyses are conducted, as shown in Fig. 4a–c. The X-ray diffractometer (XRD) results (Fig. 4a) indicate that, similar to conventional Al-Si alloys, the primary phases of both C2 and D2 predominantly consist of α-Al and eutectic Si phases. In alloy D2, the increased magnesium (Mg) content promotes the formation of amounts of Mg₂Si, which plays a critical role in improving its mechanical properties. In contrast, the relatively low concentrations of trace elements, such as iron (Fe), nickel (Ni), and manganese (Mn) in alloy C2 prevent the detection of corresponding phases in the XRD pattern. To further investigate the strengthening phases and their contribution to the alloy’s performance, scanning electron microscopy (SEM) combined with energy-dispersive X-ray spectroscopy (EDS) is employed. The SEM images and EDS mapping analyses of the tensile test specimens of the optimized Al-Si alloys are shown in Figs. 4b, c. In addition to eutectic Si, alloy C2 contains strengthening phases such as Q-Al₅Cu₂Mg₈Si₆, γ-Al₇Cu₄Ni, and α-Al(FeMn)Si. These phases are primarily located at the grain boundaries, strengthening them. The synergistic effects of these phases significantly contribute to the observed enhancement in mechanical properties, aligning with the predictions made by the PSAL framework. The SEM and EDS analysis of D2 (Fig. 4c) further corroborates the XRD findings, revealing that the microstructure primarily consists of eutectic Si and Mg₂Si phases, which are uniformly distributed within the aluminum matrix, thereby enhancing the mechanical properties of D2. Additional characterizations for other alloys (C1, C3, D1) can be found in Supplementary Fig. S8–S10. Spectral analysis of all the Al-Si alloy samples (C1–C3, D1–D2) confirms that the compositional deviations between the experimental compositions and predictions are within 0.5%, highlighting the accuracy of the framework’s predictions and the strong correlation between the designed and realized alloy compositions (Supplementary Table S2).

Discussions

The proposed PSAL framework demonstrates several key advantages in high-strength Al-Si alloy design. Primarily, the integration of multi-PRs into a unified model provides an effective solution for alloy design across diverse PRs. Instead of handling each process independently, and requiring separate models and datasets, our approach consolidates information from various PRs into a shared latent space. This integration not only provides a comprehensive understanding of how different PRs affect alloy performance, but also enhances predictive accuracy by enabling cross-pollination of insights across these PRs. For example, our PSAL framework successfully reconciles disparate mechanical properties resulting from casting versus hot extrusion and identifies high-strength compositions suitable for both processes. Notably, the universality of this approach can extend to other system-level attributes that share correlations or transferable information, such as testing temperature and stress levels of materials.

Subsequently, the PSAL framework effectively addresses the challenge of data scarcity, which often constrains alloy design studies, especially for specialized PRs. By leveraging GMM and MCMC sampling methods, the framework efficiently uses the available scarce data to generate new potential composition candidates. This capability may be particularly valuable for alloy materials with complex processing requirements, where data acquisition is often limited by experimental difficulty and cost, such as high-temperature Al-Si alloys used in aerospace applications⁵². Our framework mitigates these challenges by effectively modeling and generating high-performance candidates from limited initial data.

Finally, the PSAL framework is found to be particularly advantageous for designing alloys that require complex and expensive experimental validation. Many alloy design scenarios involve resource-intensive or time-consuming processes, such as specialized heat treatments or fatigue endurance testing. To address these challenges, our PSAL framework intentionally incorporates data from simpler and less costly experiments within the same alloy system, thereby augmenting the overall dataset and optimizing the target system. For example, when the hot extrusion process of heat-resistant Al-Cu alloy is too expensive for comprehensive exploration⁶³, data from room temperature heat treatment of the same alloy system can expand the database. This strategy not only conserves resources but also accelerates the discovery and design of high-performance materials.

Despite these advantages, the PSAL framework has several areas that warrant further improvement. One notable limitation stems from the limited availability of yield strength and elongation data in the reference dataset. During the data collection process for specific PRs, UTS data are present in nearly 100% of the collected alloy compositions, whereas yield strength and elongation data are frequently incomplete or missing. Especially in data-scarce and complex PRs, the absence of yield strength or elongation data can reach as high as 52.6% (Supplementary Table S13). Given that UTS is the most extensively studied mechanical property in aluminum alloy research, the current framework focuses on optimizing this single performance metric. Future developments should incorporate multiple performance metrics concurrently to facilitate the development of alloys with an optimal balance of properties. This would ultimately produce alloys with more comprehensive and industry-relevant performance profiles. However, these performances may also depend on other conditions, such as temperature, time, or stress levels, which are not always strictly related to PRs. Future improvements should consider a broader set of conditions in the optimization process, which would provide a more holistic view of alloy performance across different scenarios.

Another potential limitation is the reliance on surrogate models, such as NN and XGBDT, which may not always fully capture the complexity of the relationships between composition, PRs, and performance. Additionally, the optimization of process parameters has not been handled with sufficient granularity, and a more detailed treatment of processes may be required to improve the overall design accuracy. This lack of precision in process optimization could eventually necessitate the use of more sophisticated models. To address these challenges, incorporating more advanced ML techniques, such as graph neural networks or Transformer-based architectures^64,65, could enhance the prediction accuracy by better capturing the intricate interactions within the material system. Furthermore, the integration of additional data types, such as image-based information from field emission scanning electron microscopy (FESEM) with an EDS, could provide valuable insights into the microstructural behavior of the alloys. These data types would require more complex models to effectively interpret the information and improve the model’s predictive capabilities.

Unveiling the intrinsic relationships among composition, process, microstructure, and performance has long been a central challenge in materials design⁶⁶. This complexity arises from the inheritable characteristics of microstructure and performance, their dynamic evolution over time, and the nonlinear interactions between different processing steps. However, the proposed PSAL framework, with further optimization and extension, holds the potential to effectively address this issue, offering novel insights and methodologies for materials design and performance regulation.

In summary, the proposed PSAL framework offers an efficient and data-driven approach to the design of high-strength Al-Si alloys, particularly for PRs with scarce experimental data. By integrating diverse PRs into a unified model, this framework significantly enhances predictive accuracy and stability, especially for processes with limited datasets, enabling the identification of potential high-strength compositions. Experimental validations of the designed alloys, which achieve UTS surpassing existing benchmarks, demonstrate the effectiveness of this ML-driven, multi-PRs synergy approach in accurately predicting and optimizing alloy performance, even when data is limited.

Methods

Data creation and collection

The Al-Si alloys exhibited a wide range of compositions due to their various grades. To create a unified and high-quality dataset, alloy compositions and properties were compiled from our experimental data, supplemented by relevant data from existing literature with matching compositions, yielding an initial set of 170 records. To ensure dataset quality, the remaining data were stratified by processing route and randomly split into an 80% training set and a 20% test set. Within the training set, we performed five-fold cross-validation and calculated the regression residual for each sample; those that consistently exhibited large deviations across folds were cross-checked against their source publications, and if their reported processing conditions or performance values were physically implausible, they were excluded. This effort resulted in a dataset containing 140 composition-property pairs (Supplementary Table S13), with the specific process parameter ranges detailed in Supplementary Table S3. We considered four common PRs for Al-Si alloys, GC, GC with T6 heat treatment (GC + T6), GC with hot extrusion (GC + HE), and GC with hot extrusion followed by T6 heat treatment (GC + HE + T6). Due to the different parameters of each process, the high-dimensional parameter space necessitated a larger dataset, which proved to be impractical for small-sample and complex processes.

To address this, we introduced process codes (1, 2, 3, 4) to represent the processing flow and its internal parameters. In situations where literature data lacked specific processing parameters, we analyzed our accumulated experimental data to evaluate the impact of alloy composition, key processing parameters, and process codes on performance using Shapley Addictive exPlanation (SHAP) plots⁶⁷. Our findings revealed that the SHAP values for the process codes closely aligned with the combined SHAP values of multiple parameters (Supplementary Fig. S11), suggesting that the process codes could effectively replace these parameters. To remove implicit relationships between different processes, one-hot encoding was employed to convert process codes into numerical data suitable for computational analysis⁶⁸, represented as GC {0 0 0 1}, GC + T6 {0 0 1 0}, GC + HE {0 1 0 0}, and GC + HE + T6 {1 0 0 0} (Fig. 1a).

Implementation of surrogate model

In alloy composition-performance prediction, a wide variety of ML algorithms were employed. In this study, we compared four commonly used models, NN, XGBDT, random forest, and support vector machines. The dataset was randomly split into training FV² were used to evaluate the predictive accuracy of the models. The specific expressions for MAE and R² were provided below.

$${\rm{MAE}}=\frac{1}{n}\mathop{\sum}\limits_{i=1}^{n}\frac{\left|{y}_{i}-{\hat{y}}_{i}\right|}{{y}_{i}}$$

(2)

$${{\rm{R}}}^{2}=1-\frac{{\sum }_{i=1}^{n}{({y}_{i}-{\hat{y}}_{i})}^{2}}{{\sum }_{i=1}^{n}{({y}_{i}-{\bar{y}}_{i})}^{2}}$$

(3)

where y_i denoted the actual UTS value, ŷ_i was the model-predicted value, ȳ represented the mean of the actual values, and n was the sample size. MAE reflected the average absolute deviation between the predicted and actual values without considering the FV lower MAE value indicated smaller deviations between predicted and actual values, implying higher predictive accuracy of the model. R² measured the degree of fit between predicted and actual values, with a typical range between 0 and 1. An R² value of 1 represented a perfect fit, while R² = 0 indicated that the model does not explain the variance in the data.

The comparison results of the four models are shown in Supplementary Fig. S12. The NN and XGBDT models demonstrated low errors and high R² values on this dataset, indicating excellent training capability and generalization ability. Therefore, we used the same dataset to train these two models, and used the trained models to predict the performance respectively. Then, the predicted performance was combined, and the average value was the final predicted value of our surrogate model.

We applied a Bayesian optimization algorithm to fine-tune the hyperparameters of the two models, thereby improving their predictive performance. The detailed process was shown in Supplementary Fig. S1. First, we trained the NN and XGBDT models using the original dataset. It was important to note that the one-hot encoding step ensured the model allocated equal weight to each processing route. During training, Bayesian optimization was applied to explore the hyperparameters of the NN model (e.g., batch_size, learning_rate, num_hidden_layers) and the XGBDT model (e.g., colsample_bytree, learning_rate, max_depth) through 30 rounds of random parameter selection and 150 iterations of model training. A total of 180 sets of hyperparameter combinations and their corresponding minimized loss values were optimized. The models were then trained separately on the dataset and test set using these 180 hyperparameter configurations.

According to the R² value of the test set and the negative test loss in Bayesian optimization of the corresponding hyperparameters, the optimal 6 groups of hyperparameters were selected from high to low. The model hyperparameters in iterations of the PSAL framework were listed in attached Supplementary Tables S4–S9, and the corresponding model training results were also shown in Supplement Figs. S13–S18. Obviously, in each iteration, the six groups of hyperparameters optimized by Bayesian methods made the models achieve better prediction effects. Then, the trained six groups of NN models and six groups of XGBDT models were used to predict the latent space sample alloy composition, and their mean (pre_mean) was used as the predicted performance value, while their standard deviation (pre_std) was used as the floating range of the performance value.

Sampling in process-aware latent space

The PSAL framework first utilized c-WAE to reduce high-dimensional Al-Si alloy data from the dataset into a two-dimensional latent space. The architecture comprised an encoder and a decoder, both constructed from multiple fully connected layers. The encoder took a combined representation of alloy compositions and PRs, managed them through successive linear layers and ReLU activation functions⁶⁹, and output a two-dimensional representation of the latent space. The decoder then reconstructed the sampled alloy compositions from the joint representation of the latent space and PRs’ information.

The network was initialized using a specific weight initialization strategy to optimize the training process. The c-WAE model was trained with the Adam optimizer, a learning rate of 0.0005 and a weight decay of 0.0001⁷⁰, aiming to minimize a composite loss function that included reconstruction error (Recon_loss) and maximum mean discrepancy (MMD_loss)⁷¹. The model underwent training over 350 epochs, with a batch_size set to 50. Additionally, the MMD loss ensured that, while preserving the data structure, the distribution of the latent representations approximated the prior distribution. The progression of MMD_loss and Recon_loss throughout the training, depicted in Supplementary Fig. S19, demonstrated a steady decrease in error rate as the epochs increased, indicating effective training of the c-WAE model. Finally, a scatter plot visualized the data points in the latent space, with different colors representing different PRs, vividly illustrating how the model discriminated between data under various conditions in the latent space.

To further explore the latent space and generate new data points, we implemented an MCMC sampling strategy. Initially, sampling points were initialized from a trained GMM, and these points were iteratively updated using an acceptance-rejection algorithm. In each iteration, the acceptance probability of new sampling points was evaluated based on the probability score obtained from a binary classifier (similarity of the point’s features to the original data’s features) and the probability density function of the GMM.

Specifically, in MCMC sampling, the acceptance probability of a new sample point z_next was determined by comparing its relative probability to the current sample point z. Let P(z) denote the probability of the current sample point, and P(z_next) denote the probability of the newly generated sample point. These probabilities were derived by combining the probability density value of the GMM with the binary classifier’s score. If log P(z_next) exceeded log P(z), the new point was accepted, and it would be marked as yellow symbol “+” in GMM probability density map (Fig. 3c, d); otherwise, the new point’s acceptance probability was exp (log P(z_next) − log P(z)), i.e., P(z_next)/ P(z).

This process enabled efficient extraction of samples from the complex latent distribution, which were subsequently used for data reconstruction. Using the decoder part of the c-WAE model, the MCMC-sampled latent points were transformed back into the original data space. This step was performed by the conditional decoder, which took the latent variables and corresponding conditions and output the reconstructed data.

Ranking criteria

In the optimization of high-strength Al-Si alloy compositions, uncertainty played a pivotal role as it served as an indicator of how much a new composition differed from the existing dataset. By capturing this ‘gap,’’ uncertainty helped identify areas that remained unexplored, making it a valuable tool in guiding the discovery process. The process was supported by a strategic framework that balanced two essential components⁷²: exploration and exploitation. Exploration focused on identifying novel compositions by prioritizing those with higher uncertainty, thus encouraging the discovery of unconventional solutions. In contrast, exploitation honed in on compositions that demonstrated high predicted strength, enabling incremental improvements in the design process.

Even with well-trained prediction models, experimental results exhibited significant deviations from the predicted values due to the nonlinear relationship between composition and strength or the limitations of small datasets. To overcome these challenges, this study adopted the ranking criterion, which prioritized candidates by their relative position rather than their absolute values, balancing the contributions of predictions and uncertainty through a weighted approach. The ranking score (R_s) for each candidate was computed as a weighted combination of prediction and uncertainty rankings. This method enhanced the robustness and reliability of the model, ensuring more accurate alloy composition optimization. The details were as follows:

$${R}_{s}=p* {\rm{pre\_mean}}+q* {\rm{pre\_std}},$$

(4)

where p and q were represented importance assigned to prediction accuracy and uncertainty, which satisfied the relationship: p + q = 1. The pre_mean was the mean prediction of the surrogate model under several different hyperparameters, which represented the prediction accuracy, and the pre_std was the standard deviation of the surrogate model’s predictions, which represented the uncertainty. To further optimize the process, this study assigned higher weight to predicted performance (p = 0.85) compared to uncertainty (q = 0.15), reflecting a preference for refining known high-strength compositions while still allowing room for exploratory innovation. This ranking-based approach enabled a balanced exploration and exploitation, ensuring that the optimized Al-Si alloy compositions achieved both innovative breakthroughs and performance stability. The ranking criterion helped reduce the impact of model inaccuracies on candidate selection, making the high-strength Al-Si alloy optimization became not only more efficient but also more reliable. Notably, in each iteration of the active learning scheme in this study, candidate compositions were selected based on a descending rank of predicted performance (Supplementary Tables S1, S10–S12).

Material preparation and processing

Industrial-grade aluminum was melted in a graphite-clay crucible at 750 °C in an electric resistance furnace. Once fully melted, pure copper wrapped in aluminum foil and sponge zirconium were added, and the melt was held for at least 30 min. The molten alloy was poured into a metal mold preheated to 300 °C and into a cylindrical graphite mold (Φ90 mm). The metal mold was used for testing the mechanical properties of cast rods, while the graphite mold castings were used for hot extrusion. The heat treatment for cast rods involved a solution treatment at 480 °C for 6 h, increasing to 510 °C for an additional 4.5 h, followed by water quenching, and then aging at 175 °C for 6 h. The hot extrusion was performed at 510 °C with an extrusion ratio of 45:1.

Tensile testing

Room-temperature tensile tests were conducted on a universal testing machine (WDW-100D) with a crosshead speed of 2 mm/min. The room-temperature tensile specimen dimensions are shown in Supplementary Fig. S6. For each sample, tensile strength was averaged over 2–3 specimens.

Spectral composition analysis

Chemical composition significantly impacted the physical, chemical, and mechanical properties of the alloy. Each alloy composition was tested for its spectral composition before processing, performance testing, and microstructural analysis. Given that the spectral signal was sensitive to surface impurities, the sample testing surfaces were machined to ensure cleanliness and accuracy. An inductively coupled plasma optical emission spectrometer (ICP-OES, SpectroMax PE8000) was used to analyze the alloy composition after calibration with standard samples. Three to five surface points were randomly selected and averaged to obtain the actual composition. The carbon content in the alloys was determined using a carbon-sulfur analyzer (CS-2800) after grinding the alloy into fine chips and igniting it with tungsten flux in a ceramic crucible.

Microstructure observation

Samples were taken from the cast rods, mounted, ground, and polished. High-magnification morphology and point/area compositional analysis were conducted using a FESEM (FESEM, JSM-7800F) equipped with an energy-dispersive spectroscopy (EDS, OXFORD XMAX-80) detector at an accelerating voltage of 15 kV.

Software implementation

The proposed PSAL framework for high-strength Al-Si alloy design leveraged several well-established ML techniques and libraries. The GMM and cross-validation algorithms were implemented using the sklearn library. Bayesian optimization was performed using the bayes_opt library. The XGBDT model was implemented using the xgboost library. All algorithms and implementations in this work were conducted within the PyTorch v1.13.0 framework, running on an NVIDIA RTX 4090 GPU.

Data availability

The underlying code and Al-Si alloy datasets for this study can be accessed via this link: https://www.kaggle.com/code/caijianming/psal-al-si.

Code availability

The underlying code and Al-Si alloy datasets for this study can be accessed via this link: https://www.kaggle.com/code/caijianming/psal-al-si.

References

Ejiofor, J. U. & Reddy, R. G. Developments in the processing and properties of particulate Al-Si composites. JOM 49, 31–37 (1997).
Article CAS Google Scholar
Luo, Q. et al. Mechanism of Fe removal by Sn addition in Al-7Si-1Fe alloy. J. Alloy. Compd. 948, 169724 (2023).
Article CAS Google Scholar
Zhu, L., Zhang, Y., Luo, Q., Peng, L. & Li, Q. Effectively refining Al-10Si alloy via Al-Ti-Nb-B refiner with Nb2O5. J. Mater. Sci. Technol. 176, 204–210 (2024).
Article CAS Google Scholar
Zhao, Z. et al. Insights into the dual effects of Ti on the grain refinement and mechanical properties of hypoeutectic Al–Si alloys. J. Mater. Sci. Technol. 189, 44–59 (2024).
Article CAS Google Scholar
Tamuly, R., Behl, A. & Borkar, H. Effect of addition of grain refiner and modifier on microstructural and mechanical properties of squeeze cast A356 alloy. Trans. Indian Inst. Met. 75, 2395–2408 (2022).
Article CAS Google Scholar
Shiga, K., Fujiwara, T., Murakami, Y. & Omura, N. In situ X-ray imaging of α-Al(FeMn)Si grain evolution during solidification of an Al-Si alloy with electromagnetic stirring. J. Alloy. Compd. 1004, 175866 (2024).
Article CAS Google Scholar
Gao, J. & Li, Z. Current situation and prospect of computationally assisted design in high-performance additive manufactured Aluminum alloys: a review. Acta Metall. Sin. 59, 87–105 (2022).
Google Scholar
Yamasaki, S. et al. Effect of Fe addition on heat-resistant aluminum alloys produced by selective laser melting. Metals 9, 468 (2019).
Article CAS Google Scholar
Zhang, B. et al. Effect of heat treatment on the microstructure and mechanical properties of Er-containing Al–7Si–0.6 Mg alloy by laser powder bed fusion. J. Mater. Res. Technol. 18, 3073–3084 (2022).
Article CAS Google Scholar
Wang, H. et al. Deformation-induced crystalline-to-amorphous phase transformation in a CrMnFeCoNi high-entropy alloy. Sci. Adv. 7, eabe3105 (2021).
Article CAS PubMed PubMed Central Google Scholar
Yi, W., Liu, G., Lu, Z., Gao, J. & Zhang, L. Efficient alloy design of Sr-modified A356 alloys driven by computational thermodynamics and machine learning. J. Mater. Sci. Technol. 112, 277–290 (2022).
Article CAS Google Scholar
Rao, Z. et al. Machine learning–enabled high-entropy alloy discovery. Science 378, 78–85 (2022).
Article CAS PubMed Google Scholar
Gao, T., Gao, J., Zhang, J., Song, B. & Zhang, L. Development of an accurate “composition-process-properties” dataset for SLMed Al-Si-(Mg) alloys and its application in alloy design. J. Mater. Inf. 3, 6 (2023).
Article CAS Google Scholar
Luo, A. A., Sachdev, A. K. & Apelian, D. Alloy development and process innovations for light metals casting. J. Mater. Process. Technol. 306, 117606 (2022).
Article CAS Google Scholar
Mao, H., Chen, H.-L. & Chen, Q. TCHEA1: a thermodynamic database not limited for “high entropy” alloys. J. Phase Equilibra Diffus. 38, 353–368 (2017).
Article CAS Google Scholar
Körmann, F. et al. Treasure maps” for magnetic high-entropy-alloys from theory and experiment. Appl. Phys. Lett. 107, 142404 (2015).
Article Google Scholar
Rao, Z. et al. Invar effects in FeNiCo medium entropy alloys: from an Invar treasure map to alloy design. Intermetallics 111, 106520 (2019).
Article CAS Google Scholar
Liu, S. et al. A crystal glass–nanostructured Al-based electrocatalyst for hydrogen evolution reaction. Sci. Adv. 8, eadd6421 (2022).
Article CAS PubMed PubMed Central Google Scholar
Curtarolo, S. et al. The high-throughput highway to computational materials design. Nat. Mater. 12, 191–201 (2013).
Article CAS PubMed Google Scholar
Oganov, A. R., Pickard, C. J., Zhu, Q. & Needs, R. J. Structure prediction drives materials discovery. Nat. Rev. Mater. 4, 331–348 (2019).
Article Google Scholar
Ramprasad, R., Batra, R., Pilania, G., Mannodi-Kanakkithodi, A. & Kim, C. Machine learning in materials informatics: recent applications and prospects. npj Comput. Mater. 3, 54 (2017).
Article Google Scholar
Wang, C., Tan, X. P., Tor, S. B. & Lim, C. S. Machine learning in additive manufacturing: state-of-the-art and perspectives. Addit. Manuf. 36, 101538 (2020).
Google Scholar
Tabor, D. P. et al. Accelerating the discovery of materials for clean energy in the era of smart automation. Nat. Rev. Mater. 3, 5–20 (2018).
Article CAS Google Scholar
Sasidhar, K. N. et al. Enhancing corrosion-resistant alloy design through natural language processing and deep learning. Sci. Adv. 9, eadg7992 (2023).
Article CAS PubMed PubMed Central Google Scholar
Jiang, L. et al. A rapid and effective method for alloy materials design via sample data transfer machine learning. npj Comput Mater. 9, 26 (2023).
Article CAS Google Scholar
Chen, Y. et al. MatterGPT: A generative transformer for multi-property inverse design of solid-state materials. Preprint at https://doi.org/10.48550/arXiv.2408.07608 (2024).
Xiao, H. et al. An invertible, invariant crystal representation for inverse design of solid-state materials using generative deep learning. Nat. Commun. 14, 7027 (2023).
Article CAS PubMed PubMed Central Google Scholar
Shen, C. et al. Physical metallurgy-guided machine learning and artificial intelligent design of ultrahigh-strength stainless steel. Acta Mater. 179, 201–214 (2019).
Article CAS Google Scholar
Juan, Y. et al. Accelerated design of Al−Zn−Mg−Cu alloys via machine learning. T Noferr. Metal. Soc. 34, 709–723 (2024).
Article CAS Google Scholar
Huang, J., Ando, D. & Sutou, Y. Heat-resistant aluminum alloy design using explainable machine learning. Mater. Des. 243, 113057 (2024).
Article CAS Google Scholar
Chen, Z. Q., Shang, Y. H., Liu, X. D. & Yang, Y. Accelerated discovery of eutectic compositionally complex alloys by generative machine learning. npj Comput. Mater. 10, 204 (2024).
Article CAS Google Scholar
Zhang, H., Fu, H., Zhu, S., Yong, W. & Xie, J. Machine learning assisted composition effective design for precipitation strengthened copper alloys. Acta Mater. 215, 117118 (2021).
Article CAS Google Scholar
Wen, C. et al. Machine learning assisted design of high entropy alloys with desired property. Acta Mater. 170, 109–117 (2019).
Article CAS Google Scholar
Liu, S. et al. A comparative study of predicting high entropy alloy phase fractions with traditional machine learning and deep neural networks. npj Comput. Mater. 10, 172 (2024).
Article Google Scholar
Yang, C. et al. A machine learning-based alloy design system to facilitate the rational design of high entropy alloys with enhanced hardness. Acta Mater. 222, 117431 (2022).
Article CAS Google Scholar
He, J. et al. Machine learning-assisted design of high-entropy alloys with superior mechanical properties. J. Mater. Res. Technol. 33, 260–286 (2024).
Article CAS Google Scholar
Park, S. et al. High strength aluminum alloys design via explainable artificial intelligence. J. Alloy. Compd. 903, 163828 (2022).
Article CAS Google Scholar
Starke, E. A. & Staley, J. T. Application of modern aluminum alloys to aircraft. Prog. Aerosp. Sci. 32, 131–172 (1996).
Article Google Scholar
Noga, P., Skrzekut, T. & Wędrychowicz, M. Microstructure and mechanical properties of Al-Si alloys produced by rapid solidification and hot extrusion. Materials 16, 5223 (2023).
Article CAS PubMed PubMed Central Google Scholar
Kaur, H., Pannu, H. S. & Malhi, A. K. A systematic review on imbalanced data challenges in machine learning: applications and solutions. ACM Comput. Surv. 52, 79:1–79:36 (2019).
Google Scholar
Balachandran, P. V., Kowalski, B., Sehirlioglu, A. & Lookman, T. Experimental search for high-temperature ferroelectric perovskites guided by two-step machine learning. Nat. Commun. 9, 1668 (2018).
Article PubMed PubMed Central Google Scholar
Pei, X., Pei, J., Hou, H. & Zhao, Y. Optimizing casting process using a combination of small data machine learning and phase-field simulations. npj Comput. Mater. 11, 27 (2025).
Article Google Scholar
Gubernatis, J. E. & Lookman, T. Machine learning in materials design and discovery: examples from the present and suggestions for the future. Phys. Rev. Mater. 2, 120301 (2018).
Article CAS Google Scholar
Jung, J., Yoon, J. I., Park, H. K., Kim, J. Y. & Kim, H. S. Bayesian approach in predicting mechanical properties of materials: application to dual phase steels. Mater. Sci. Eng. A. 743, 382–390 (2019).
Article CAS Google Scholar
Yamawaki, M., Ohnishi, M., Ju, S. & Shiomi, J. Multifunctional structural design of graphene thermoelectrics by Bayesian optimization. Sci. Adv. 4, eaar4192 (2018).
Article PubMed PubMed Central Google Scholar
Xue, D. et al. Accelerated search for materials with targeted properties by adaptive design. Nat. Commun. 7, 11241 (2016).
Article CAS PubMed PubMed Central Google Scholar
Yuan, R. et al. Accelerated discovery of large electrostrains in BaTiO3 -based piezoelectrics using active learning. Adv. Mater. 30, 1702884 (2018).
Zhao, T., Zhao, R. & Eskenazi, M. Learning discourse-level diversity for neural dialog models using conditional variational autoencoders. In: Proc. 55th Annual Meeting of the Association for Computational Linguistics 654–664 (ACL, 2017).
Kriegeskorte, N. & Golan, T. Neural network models and deep learning. Curr. Biol. 29, R231–R236 (2019).
Article PubMed Google Scholar
Binhao, F., Hong, H. & Ziyun, Z. Improve the application of XGBDT in network abnormal traffic detection. In: Proc. IEEE International Conference on Emergency Science and Information Technology (ICESIT) 193–200 (2021).
Zhang, L., Zhang, J. & Nie, Q. DIRECT-NET: an efficient method to discover cis-regulatory elements and construct regulatory networks from single-cell multiomics data. Sci. Adv. 8, eabl7393 (2022).
Article CAS PubMed PubMed Central Google Scholar
Li, S. et al. Development and applications of aluminum alloys for aerospace industry. J. Mater. Res. Technol. 27, 944–983 (2023).
Article CAS Google Scholar
van der Maaten, L. & Hinton, G. Visualizing data using t-SNE. J. Mach. Learn. Res. 9, 2579–2605 (2008).
Google Scholar
Cabuk, K. S. et al. Chasing the objective upper eyelid symmetry formula; R2, RMSE, POC, MAE, and MSE. Int. Ophthalmol. 44, 303 (2024).
Article Google Scholar
Mohammad, N. et al. Predicting the age of field Anopheles mosquitoes using mass spectrometry and deep learning. Sci. Adv. 10, eadj6990 (2024).
Article CAS PubMed PubMed Central Google Scholar
Mohanty, P. S. & Gruzleski, J. E. Grain refinement mechanisms of hypoeutectic Al-Si alloys. Acta Mater. 44, 3749–3760 (1996).
Article CAS Google Scholar
Kang, N., Coddet, P., Liao, H., Baur, T. & Coddet, C. Wear behavior and microstructure of hypereutectic Al-Si alloys prepared by selective laser melting. Appl. Surf. Sci. 378, 142–149 (2016).
Article CAS Google Scholar
Liwen, X. et al. Progress in effects of squeeze casting process parameters and heat treatment on microstructure and properties of Al-Si alloy. Spec. Cast. Nonferrous Alloy. 41, 842–848 (2023).
Google Scholar
McLachlan, G. J. & Rathnayake, S. On the number of components in a Gaussian mixture model. Wires Data Min. Knowl. 4, 341–355 (2014).
Article Google Scholar
Mira, A. Ordering and improving the performance of Monte Carlo Markov chains. Stat. Sci. 16, 340–350 (2001).
Article Google Scholar
Wang, P., Sun, H., Zhang, Y., Zhang, J. & Li, Q. Influences of Al2.5Cu0.5Ti master alloy on micro-structure and mechanical properties of Al-7Si alloy. J. Mater. Eng. 51, 84–92 (2023).
Google Scholar
Jiang, F., Zhang, G., Zhang, Z., Suo, Z. & Wang, Y. Microstructure and mechanical properties of Al-12Si alloy under pressurized solidification. Spec. Cast. Nonferrous Alloy. 37, 1081–1083 (2017).
Google Scholar
Fu, X. et al. Effect of Mn/Ag ratio on microstructure and mechanical properties of heat-resistant Al-Cu alloys. Materials 17, 1371 (2024).
Article CAS PubMed PubMed Central Google Scholar
Wu, Z. et al. A comprehensive survey on graph neural networks. IEEE Trans. Neural Netw. Learn. Syst. 32, 4–24 (2021).
Article PubMed Google Scholar
Kitaev, N., Kaiser, L. & Levskaya, A. Reformer: the efficient transformer. In: Proc. 8th International Conference on Learning Representations (ICLR) 1–12 (ICLR, 2020).
Jiang L. Research on next-generation advanced aluminum alloys with high strength and high toughness via data-driven methods. (University of Science and Technology Beijing, 2024).
Marcílio, W. E. & Eler, D. M. From explanations to feature selection: assessing SHAP values as feature selection mechanism. In: Proc. 33rd SIBGRAPI Conference on Graphics, Patterns and Images (SIBGRAPI) 340–347 (SIBGRAPI, 2020).
Rodríguez, P., Bautista, M. A., Gonzàlez, J. & Escalera, S. Beyond one-hot encoding: lower dimensional target embedding. Image Vis. Comput 75, 21–31 (2018).
Article Google Scholar
Petersen, P. & Voigtlaender, F. Optimal approximation of piecewise smooth functions using deep ReLU neural networks. Neural Netw. 108, 296–330 (2018).
Article PubMed Google Scholar
Zhang, Z. Improved Adam optimizer for deep neural networks. In: Proc. IEEE/ACM 26th International Symposium on Quality of Service (IWQoS) 1–2 (IEEE, 2018).
Borgwardt, K. M. et al. Integrating structured biological data by Kernel maximum mean discrepancy. Bioinformatics 22, e49–e57 (2006).
Article CAS PubMed Google Scholar
Krause, A. & Guestrin, C. Nonmyopic active learning of Gaussian processes: an exploration-exploitation approach. In: Proc. 24th International Conference on Machine Learning 449–456 (ICML, 2007).

Download references

Acknowledgements

We thank the Laboratory for Multiscale Mechanics and Medical Science of Xi’an Jiaotong University and the Key Laboratory for Liquid–Solid Structural Evolution and Processing of Materials of Shandong University for providing help. This work was supported by the open subject of the State Key Laboratory of Powder Metallurgy, Central South University (SKLPM-KF-003), a grant from the National Natural Science Foundation of China (52471142 and 52301167), the National Youth Talent Program, Ministry of Industry and Information Technology of China (GQQNKP005), the open subject of the State Key Laboratory of Solidification Processing, Northwestern Polytechnical University (SKLSP202403), the National Natural Science Foundation of China (12302140), the Fundamental Research Funds for the Central Universities of China (sxzy012023213), China Postdoctoral Science Foundation (2023M732794) and Postdoctoral Fellowship Program (Grade B) of China Postdoctoral Science Foundation (GZB20230575).

Author information

Authors and Affiliations

Laboratory for Multiscale Mechanics and Medical Science, SV LAB, School of Aerospace, Xi’an Jiaotong University, Xi’an, China
Jianming Cai, Xirui Yan, Yan Chen & Sida Liu
Key Laboratory for Liquid‒Solid Structural Evolution and Processing of Materials, Ministry of Education, Shandong University, Jinan, China
Mengxia Han, Daoxiu Li, Kewei Xie & Xiangfa Liu
School of Mechanical Engineering, Shandong University of Technology, Zibo, China
Kai Zhao
Center for Alloy Innovation and Design (CAID), State Key Laboratory for Mechanical Behavior of Materials, Xi’an Jiaotong University, Xi’an, China
Dongqing Zhang
Xi’an Rare Metal Materials Institute Co. Ltd, Xi’an, China
Kaiqi Hu
Schott Glass Malaysia, Perai, Malaysia
Heng Han Sua
Department of Mechanical and Materials Engineering, Lee Kong Chian Faculty of Engineering and Science, Universiti Tunku Abdul Rahman, Kajang, Malaysia
Hieng Kiat Jun
Centre for Sustainable Mobility Technologies, Universiti Tunku Abdul Rahman, Kajang, Malaysia
Hieng Kiat Jun
Centre for Advanced and Sustainable Materials Research, Universiti Tunku Abdul Rahman, Kajang, Malaysia
Hieng Kiat Jun
Shandong Crystal Seed Materials Co. Ltd, Linyi, China
Guiliang Liu

Authors

Jianming Cai
View author publications
Search author on:PubMed Google Scholar
Mengxia Han
View author publications
Search author on:PubMed Google Scholar
Xirui Yan
View author publications
Search author on:PubMed Google Scholar
Yan Chen
View author publications
Search author on:PubMed Google Scholar
Daoxiu Li
View author publications
Search author on:PubMed Google Scholar
Kai Zhao
View author publications
Search author on:PubMed Google Scholar
Dongqing Zhang
View author publications
Search author on:PubMed Google Scholar
Kaiqi Hu
View author publications
Search author on:PubMed Google Scholar
Heng Han Sua
View author publications
Search author on:PubMed Google Scholar
Hieng Kiat Jun
View author publications
Search author on:PubMed Google Scholar
Kewei Xie
View author publications
Search author on:PubMed Google Scholar
Guiliang Liu
View author publications
Search author on:PubMed Google Scholar
Xiangfa Liu
View author publications
Search author on:PubMed Google Scholar
Sida Liu
View author publications
Search author on:PubMed Google Scholar

Contributions

S.D.L., Y.C. and J.M.C. conceptualize the study. J.M.C. and Y.C. contribute to the methodology. S.D.L., J.M.C., X.R.Y., K.Z., D.X.L., K.W.X., K.Q.H., D.Q.Z., H.H.S. and H.K.J. are responsible for dataset construction. G.L.L., M.X.H. and X.F.L. conduct the experimental verification. J.M.C., Y.C. and S.D.L. write the original draft of the manuscript. All authors review and edit the manuscript.

Corresponding authors

Correspondence to Yan Chen or Sida Liu.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Cite this article

Cai, J., Han, M., Yan, X. et al. A process-synergistic active learning framework for high-strength Al-Si alloys design. npj Comput Mater 11, 228 (2025). https://doi.org/10.1038/s41524-025-01721-3

Download citation

Received: 08 February 2025
Accepted: 27 June 2025
Published: 14 July 2025
DOI: https://doi.org/10.1038/s41524-025-01721-3

Subjects

Abstract

Similar content being viewed by others

A rapid and effective method for alloy materials design via sample data transfer machine learning

Effect of friction stir processing parameters on mechanical properties of different aluminum alloy plates

Transforming machine learning model knowledge into material insights for multi-principal-element superalloy phase design

Introduction

Results

PSAL framework

Data distribution imbalance

Multi-process synergistic effect

Process-aware compositional latent space

Latent space sampling and active learning iteration

High-strength Al-Si alloys

Discussions

Methods

Data creation and collection

Implementation of surrogate model

Sampling in process-aware latent space

Ranking criteria

Material preparation and processing

Tensile testing

Spectral composition analysis

Microstructure observation

Software implementation

Data availability

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Additional information

Supplementary information

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

Search

Quick links