Abstract
A machine learning model was developed to predict the oxidation resistance of Ti-V-Cr burn-resistant titanium alloy, and the natural logarithm of the parabolic oxidation rate constant (lnkp) was utilized as the model output. The results show that the two algorithms based on multiple learners, gradient boosting decision tree (GBDT) and eXtreme Gradient Boosting (XGBoost), show better performance. The coefficient of determination R2 of the models are 0.98 and the maximum error is 6.57 and 6.40%, respectively. The importance and interpretability of the input features were analyzed. The trend of the model analysis results was the same as that of the experimental conclusions, which further revealed the mechanism of the influence of element content and temperature changes on the oxidation resistance of Ti-V-Cr alloys and verified the effectiveness of the model. This study is of great significance for the discovery, prediction, and quantification of new high-temperature oxidation-resistant Ti-V-Cr alloys.
Similar content being viewed by others
Introduction
The development and application of Ti-V-Cr burn-resistant titanium alloys is a direct and effective way to solve the “titanium fire” accident as well as the key basis for the successful design and production of advanced high thrust-to-weight ratio aeroengines1,2,3,4. However, the lack of high-temperature oxidation resistance is one of the main reasons restricting its development. Alloying is one of the significant ways to improve the high-temperature oxidation resistance5. With the increase in the maximum operating temperature of the titanium alloys, the alloy composition system tends to be much more complicated6. In addition, it is necessary to accurately control the alloying elements based on the high-temperature oxidation resistance and mechanical properties of the alloys. Generally, the oxidation resistance properties of titanium alloys can be quantified and evaluated by correlating them with parabolic oxidation rate constant Kp according to Wagner's theory of metal oxidation7.
The lower parabolic oxidation rate constant denotes the better oxidation resistance of alloys. Plenty of research, including isothermal oxidation weight gain experiments with different exposure durations, have been carried out to explore the effects of different alloying elements on the oxidation resistance of titanium alloys6,8,9,10. STRINGER J. et al. originally studied the oxidation behavior of commercially pure titanium in the temperature range of 1073–1473 K in 196011. Kitashima T. et al. revealed the enhancement effect of Nb, Si, Zr, and Hf on the oxidation resistance of titanium alloy and the negative effect of Sn on its oxidation resistance by studying the oxidation behavior of binary α titanium alloy Ti-X (X = Al, Zr, Si, Hf, Nb, Ge, and Sn)12,13. VOJTECH D. et al.14 and KNALSLOVA A. et al.15 demonstrated the effect of Si elements on the oxidation resistance of titanium alloys through oxidation experiments. Dai et al. summarized the high-temperature oxidation behavior experiments of titanium alloy and the effects of various alloying elements and coatings on the oxidation resistance of titanium alloy and titanium-aluminum alloy6. However, the oxidation weight gain experiment is time-consuming and laborious, and the oxidation resistance of the alloy can only be obtained at a specific combination of elements and oxidation temperature. Yang et al.12 conducted an isothermal oxidation experiment for 500 h on nearly α titanium alloys containing Ga at 650, 700, and 750 °C. It was found that the oxidation kinetics of the alloys followed the parabolic oxidation rule at 650 °C and the cubic oxidation rule at 700 and 750 °C, respectively. Cruchley et al.16 obtained seven oxidation weight gain curves as long as 5000 h at different temperatures.
In order to tackle the issues of low efficiency and time-costing of traditional optimization methods based on experimental design, several theoretical calculation methods, namely density functional theory17,18,19 including molecular dynamics simulation20,21,22, phase field model23,24,25, and phase diagram calculation (CALPHAD)26,27,28 have been utilized to study the high-temperature oxidation behavior of titanium alloys. Ohler et al.29 calculated the separation work at the interface between TiO2 and titanium substrate by the DFT method, indicating that there is a strong bond between the substrate and the oxide film. Wu et al.30 studied the diffusion of oxygen in α titanium and the impact of other solid solutions on oxygen diffusion. Bhattacharya et al.31 explored the effect of the Si element on the surface oxide film of titanium alloy through a molecular dynamics simulation method based on DFT. However, due to the limited computing ability and resources, such calculation methods have previously focused on studying the effects of individual elements on oxidation resistance and elucidating the oxidation mechanism of alloys32,33,34,35. With the increase of alloying elements, the relationship between alloy properties and component elements tends to be much more complicated36,37. Apart from that, there exists interactions between components which comes up with a greater challenge to the design and optimization of alloy components38,39. The calculation and experimental verification costs of traditional alloy design methodologies are extremely high. Under this circumstance, new alloy composition design and performance prediction methods are still to be developed.
In recent years, the rapid development of machine learning and data science has provided new possibilities for the design of multi-component alloys40. The concept of machine learning was first proposed by Samuel in 195941. Based on a large number of known data and algorithms, machine learning mimics the human learning process and realizes the prediction of the relationship between alloy composition and macroscopic properties of materials through feature extraction and model training42. With the assistance of the trained machine learning model, a single objective or multi-objective optimization process is able to be carried out to obtain the most appropriate alloy composition combination43. The active learning method inspired by machine learning and data statistics has a simple concept and clear logic, which can effectively reduce the calculation and time cost of alloy composition design44. With respect to the machine learning algorithm for aeroengine titanium alloy, there are several specific characteristics. Firstly, Ti element can form a solid solution with Al, Nb, Cr, V, Zr, Sn, and other metal elements33,45,46; Secondly, although the solid solubility of C, Si, N, O, and other nonmetallic elements is low, they can form precipitate phases in the substrate which has a significant effect on the plasticity and creep strength of the alloy47,48,49,50. Thirdly, the phase transformation process of titanium alloy substrate is fairly complicated, while different alloying elements have distinct effects on the stability of phase structure51. Subsequently, alloying elements react with each other leading to the formation of complex multi-component compounds such as Ti2AlNb and Ti2AlC52,53. Lastly, the number of alloying elements of high-temperature titanium alloys above 550 °C can reach 10 or even larger54. These factors lead to a significantly complicated and implicit relationship between the element compositions and alloy properties of aeroengine titanium alloys. Therefore, the researchers modified the machine learning model by introducing special parameters and other methods to make it more suitable for titanium alloy systems. For example, Yang et al.55 embedded the Mo equivalent and cluster equation into the model in the Ti-Mo-Nb-Zr-Sn-Ta alloy system, and five kinds of low-elastic β-Ti-alloys were designed by combining XGBoost with a genetic algorithm, which well predicted the influence of the change of Mo element content on the properties of titanium alloys. Among them, the Mo equivalent is closely related to the proportion of β phase in titanium alloy56,57. The introduction of Mo equivalent can improve the perception of physical information in the model, and its expression is shown in equation (1).
In this work, the isothermal oxidation test experiments of Ti-V-Cr burn-resistant alloys were systematically carried out. Different machine learning models were established and trained based on experimental data and data accessed from the literature. The oxidation resistance mechanism of the Ti-V-Cr alloys was analyzed by combining the SHAP interpretability analysis, feature importance ranking, and material characterization results. Additionally, the Bayesian optimization algorithm was adopted to improve the performance of the models so that the models could accurately predict the oxidation resistance of different alloys. Through this study, the efficiency of the Ti-V-Cr alloy design process will be greatly improved. The Ti-V-Cr alloy properties database and accurate prediction models were established and verified by the comparison between the experimental results and predictions of machine learning models, which, to a great extent, provide significantly accurate guidance for Ti-V-Cr burn-resistant alloy composition design.
Results and discussion
Performance of ML models
In the K-fold cross-validation, the average RMSE of the testing set was adopted as the return value of the objective function and the optimal hyperparameter combination in the model was searched in the direction of minimizing RMSE by BO. After obtaining the optimal solutions of hyperparameters of GBDT, BPNN, SVR, and XGBoost models, the machine learning model framework was built with them respectively. The K-fold cross-validation method was utilized to evaluate them again after training. Among them, the determination coefficients R2 and RMSE introduced above were selected as the evaluation indexes of the model. The corresponding R2, RMSE, and mean absolute error (MAE) of each model are shown in Table 1.
In addition, according to the results of the K-fold cross-validation method of the model, the scatter map of the predicted value and the actual value was drawn. Since the K-fold cross-validation method takes each fold as a testing set for prediction analysis, the result naturally traverses the entire dataset. Therefore, all the scatter points in the figure were used as testing sets, and lnkp was predicted by the model, which is shown in Fig. 1. The bar chart below the scatter plot shows the difference between the predicted value and the actual value when all the samples are used as the test set, which is used to analyze the distribution of the true error of the model. Combined with the results of Table 1 and Fig. 1, the performance of four different models can be analyzed.
GBDT model demonstrated excellent performance with an R2 of 0.98, indicating a very high degree of fitting to the data. Lower RMSE (0.91) and MAE (0.59) compared to the other three models further illustrated the advantages of GBDT in forecasting accuracy and stability. This result is attributed to the continuously optimizing residuals of the GBDT model in the process of gradually building multiple decision trees thus improving the predictive power of the overall model. GBDT’s step-up mechanism can effectively handle nonlinear relationships and complex data structures.
The R2 value of BPNN is 0.97, which is slightly lower than the GBDT model but still shows high fitting ability. However, its RMSE (1.09) and MAE (0.76) are relatively high, indicating that BPNN is inferior to GBDT in terms of prediction accuracy and error. BPNN can deal with complex nonlinear problems by optimizing weights through a backpropagation algorithm. Nevertheless, BPNN has high requirements on data volume and training time and it is easy to fall into local optimization resulting that its performance may not be as stable as GBDT in some cases.
The R2 value of XGBoost is 0.98, which is comparable to GBDT, implying that it also has very high fitting capability. Its RMSE (0.96) and MAE (0.59) also show excellent performance in forecasting accuracy and stability. XGBoost is optimized on the basis of traditional GBDT using regularization processing to prevent overfitting and improve training speed through parallel computation. Its efficient tree structure construction and pruning mechanism make it especially good for processing large-scale data.
It can be concluded that GBDT and XGBoost have the best performance in all evaluation indicators demonstrating that they have obvious advantages in processing complex data structures and improving prediction accuracy. These two models are more appropriate for the dataset adopted in this study. In the following work, the GBDT model will be further sorted for feature importance and analyzed for the interpretability of the model.
Feature importance analysis
Feature importance analysis is a technique used to assess the extent to which input features influence the prediction results of a machine learning model. Calculating the contribution of individual features to the model output can help identify which features play a significant role in the model.
Firstly, the most influential features can be identified, thereby optimizing the feature set, removing redundant or irrelevant features, and improving the efficiency and accuracy of the model. Secondly, through feature importance analysis, the potential mechanism and hidden patterns in the data can be obtained to understand the problem domain further. In addition, feature importance analysis can improve the interpretability of the model making the model more transparent and reliable when applied to practical issues. Therefore, feature importance analysis is not only an important part of machine learning modeling, but also a powerful tool for making data-driven decisions.
Figure 2 shows the feature importance ranking diagram of GBDT and XGBoost models where the x-axis is the feature importance degree, and the y-axis denotes the feature name. In GBDT, the three most important characteristics are temperature, Ti content, and V content. In XGBoost model, besides temperature, Zr content and Cr content are the two most influential features. According to the Arrhenius equation, oxidation temperature should be the most significant feature in the oxidation process, which is consistent with the model analysis results. The V element forms a protective layer of V2O5 on the surface as it oxidizes, preventing further oxidation. The dense ZrO2 oxide layer formed during the oxidation of Zr element can also protect the metal substrate from further oxidation. TiO2 and Cr2O3 oxides formed by Ti and Cr elements are thermal stable at high temperatures which can reduce the oxidation rate and protect the substrate metal. Subsequently, the model analysis results can be compared with the experimental results to analyze the reliability of the model and provide guidance for the design of alloy composition.
Interpretability analysis of the model
SHapley Additive exPlanations (SHAP) is a game theory-based method for explaining the predictions of machine learning models. It provides a unified measure of the interpretability of the model by calculating the contribution value of each feature to the predicted result. SHAP values reflect the influence of each feature on the model output in different input cases helping to understand the decision mechanism of the model.
The necessity of SHAP interpretability analysis lies in:
-
(1)
By quantifying the importance of features, SHAP can reveal the decision-making process of the model, making it more transparent and understandable.
-
(2)
Help identify features that have a significant impact on prediction results and guide feature engineering and model optimization.
-
(3)
Identifying potential model biases or complex relationships between features helps to find problems in the model and directions for improvement.
SHAP analysis plays a significant role in improving the interpretability, reliability, and validity of machine learning models and is a critical tool in data science and machine learning.
Figure 3 demonstrates the interpretability analysis of the XGBoost model using the SHAP method where the x-axis is the SHAP value, that is, the influence and contribution of each feature to the model output lnkp. The Y-axis is the name of each feature, sorted from top to bottom in order of the degree of contribution. The analysis results show that the higher the temperature, imply the greater the positive influence on the output lnkp. When the content of Ti and Al elements is higher, most SHAP is negative, indicating that the degree of negative influence on lnkp is greater and the oxidation resistance of the corresponding alloy is better. The higher the content of element V, the larger the lnkp output of the model is and the worse the oxidation resistance of the corresponding alloy is, which is similar to the experimental results. The SHAP value has both positive and negative of the higher the content of Cr element which is different from the rule obtained by the experiment and the specific reason needs to be further analyzed.
Results of oxidation experiments
The oxidation weight gain curve is a dynamic description of the alloy oxidation process. Based on the oxidation weight gain curve, the oxidation rate index and oxidation constant which measure the oxidation resistance of the target alloys can be calculated. This summary plots the oxidation weight gain curves for alloys and compares the oxidation resistance of alloys by comparing the shapes and relative positions of the curves. There were more than 60 groups of oxidation data obtained from the experiments, including the variation of V, Cr, and Al content and the corresponding results were demonstrated by the weight gain curves.
The data from the oxidation experiment are divided into three groups, and the average of the difference between the measured mass and the mass before oxidation is obtained as the oxidation gain.
The oxidation gain is the difference between the mass after oxidation and the mass before oxidation. The oxidation gain divided by the sample’s surface area gives the oxidation gain per unit surface area, as shown in Equation (2):
where M is the oxidation gain at the corresponding time and A is the total surface area of the sample.
Figure 4 shows the oxidation weight gain experiment of 1000 h. It can be seen from the figure that the oxidation weight gain of the listed kinds of Ti-V-Cr alloys meet the parabolic rule and conforms to Wagner oxidation theory. Therefore, it is reasonable to apply the parabolic oxidation rate constant as the model prediction index in the subsequent model.
The oxidative weight gain curve of the sample was plotted with time (h) as the horizontal coordinate and oxidative weight gain per unit surface area (g/m2) as the vertical coordinate. Tables 2–4 are the corresponding series and components of the alloys whose effects of Cr, V, and Al content on the oxidation resistance of the alloys were studied by using the control variable methodology and three sets of parallel experiments were carried out for each component with error calculated.
Non-consumable vacuum arc melting was carried out on the alloys in the table, and the samples were processed as standard samples for oxidation experiments. After the 100 h oxidation weight gain experiment, the oxidation weight gain curves of each alloy were shown in Fig. 5.
Comparative analysis shows that other conditions remain unchanged, the higher the content of Cr in the experimental range, the better the high-temperature oxidation resistance of titanium alloy. When the mass fraction of V element is less than 0.3, the higher the content of V element, the better the high-temperature oxidation resistance of titanium alloy. When the mass fraction of V element reaches 0.3, the higher content of V element cannot improve the high-temperature oxidation resistance of titanium alloy. Only from the oxidation weight gain curve, the higher the Al content on the whole, the better the oxidation resistance of titanium alloy. However, when the content of Al element is higher than 3%, increasing the content of Al element can hardly increase the oxidation resistance of titanium alloy. It was noted that there was a data anomaly in the group of experiments with TF23, namely 6% Al content. The weight gain per unit area calculated by high-temperature oxidation for 100 h is smaller than that for 75 h. In theory, when the oxide film does not fall off, the oxidation weight gain will only become larger, not smaller. Therefore, this study speculated that there may be errors in the experiment process leading to oxidative shedding. To this end, the data of three sets of parallel experiments (oxidation gain per unit area: g/m2) were checked, as shown in Table 5.
It can be seen from Table 5 that the oxidation weight gain of the three groups of parallel experiments decreases during the oxidation period from 75 to 100 h, thus the error of experimental operation can be eliminated. The reasons for this phenomenon remain to be explained in the follow-up work.
Characterization analysis and mechanism discussion
The oxidized samples were analyzed by metallography, SEM and EDS. The oxidation film thickness of each alloy was calculated by scanning electron microscopy, and the oxidation rate constant calculated by the oxidation weight gain curve was used to evaluate the oxidation resistance of different alloy components. The oxidation resistance mechanism of Ti-V-Cr alloy was discussed based on the reports in the literature, and the influence of the change of the content of each alloy element on the oxidation resistance of the alloy was analyzed, and the reliability of GBDT and XGBoost models was verified by comparing with the analysis results obtained by the machine learning model.
Three alloys, TF12, TF18, and TF24, were selected for scanning electron microscopy, surface scan, and line scan energy spectrum analysis of their cross sections. The analysis results are as follows.
As can be seen from Fig. 6 of the section scanning electron microscope, the average thickness of the oxide film of the binary alloy TF12 (Ti-12.5Cr) containing only Cr element is 1.250 μm, which is thinner than that of the other alloys containing V and Al elements in the experiment, but the oxide film is not continuous, and the phenomenon of spalling and falling off of the oxide film is common. It has a certain effect on the oxidation resistance of the alloy. EDS results showed that the oxide film was composed of Ti and Cr oxides.
Combined with the analysis results of the machine learning model and the experimental law, the higher the content of Cr element, the better the corresponding oxidation resistance of the alloy, but too much Cr element (higher than 15%) will lead to the deterioration of the plasticity and thermal stability of the alloy. Considering the comprehensive mechanical properties, the recommended design range of the Cr element is about 15%, and the oxidation resistance of the alloy is better. At the same time, it also takes into account the requirements of mechanical properties.
Figure 6b shows the SEM and EDS characteristic results of Ti-20V-15Cr-1Al alloy. The average oxide film thickness is 5.329 μm. The oxide film is composed of Ti, V, Cr, and Al oxides, and the oxide content in the outer oxide layer of the oxide film is generally higher than that in the inner oxide layer near the matrix. The content of the Al element in the oxide film is similar to that in the matrix.
Figure 6c shows the SEM and EDS characterization results of Ti-25V-15Cr-1Al alloy. The average oxide film thickness is 6.446 μm. The oxide film is composed of oxides of Ti, V, Cr, and Al elements. The distribution of Al element in the oxide film has a polarization phenomenon. In the local high Al element region, the content of Ti is significantly lower than that in other regions of the oxide film, and the content of V and Cr elements is increased to a certain extent in this region, which may indicate the formation of a new multiphase. In order to understand the spatial distribution and concentration changes of each element in oxide film and matrix, the energy spectrum line scanning of these three alloys was further carried out.
Figure 7 shows the line sweep results of Ti-12.5Cr alloy. In the oxide film region, the content of oxygen elements gradually decreased from outside to inside, and was relatively low in the matrix, and the value of cps was close to 0. The content of Ti element increased gradually from outside to inside the oxide film. The content of Cr was different from its changing trend, from outside to inside, it increased first and then decreased, and remained stable in the matrix.
Figure 8 shows the line scanning results of Ti-20V-15Cr-1Al alloy. In the oxide film region, the content of O element remains at a relatively stable level. The contents of the V element and Cr element were maintained at a stable level in the oxide film.
Figure 9 shows the line scanning results of Ti-25V-15Cr-1Al alloy, which are similar to the results of Ti-20V-15Cr-1Al. In the oxide film region, the content of O element remained at a relatively stable level. The content of the V element and Cr element in the oxide film increased steadily, and the increasing rate was slow. Different from Ti-12.5Cr alloy, the content of Ti element in the oxide film is more evenly distributed, and there is no obvious upward trend, and only a sudden increase occurs near the matrix.
The following discussions and conclusions can be drawn from the prediction model and experimental study of the oxidation resistance of Ti-V-Cr flame retardant titanium alloy. (1) A prediction model for the oxidation resistance of Ti-V-Cr flame retardant titanium alloy was constructed. Due to sparse data when oxidation gain was used as the prediction index, the R2 of the model reached only 0.81, and the corresponding algorithm was ANN; When the parabolic oxidation rate constant lnkp was used as the predictor, the data quality was higher, the R2 determination coefficient of the model reached 0.98, and the maximum error was only 6.40%. The corresponding algorithm was the extreme gradient lifting decision tree XGBoost. (2) In the current oxidation experiment, the higher the content of Cr, the better the high-temperature oxidation resistance of titanium alloy. When the mass fraction of V element is less than 30%, the higher the content of V element, the better the oxidation resistance of the alloy. When the mass fraction of V element reaches 30%, the higher the content of V element can not significantly improve the high-temperature oxidation resistance of titanium alloy. The higher the content of Al, the better the oxidation resistance of the corresponding titanium alloy. (3) The feature importance and SHAP interpretability analysis results of the machine learning model show that the factors that significantly affect the parabolic oxidation rate constant of the alloy are temperature, Ti content, Al content, V content, and Cr content. SHAP analysis shows that Ti and Al elements are beneficial to improving the oxidation resistance of the alloy. The oxidation resistance of the alloy will be reduced with the increase of V content. The SHAP value of the Cr element is positive and negative, which shows that the anti-oxidation performance can be improved as a whole, which is consistent with the experimental law.
Methods
Data source and quality
So as to tackle the issue of sparse data of isothermal oxidation experiments when only the Ti-V-Cr alloy system is considered. This work investigated and collected the isothermal oxidation data of high-temperature titanium alloys above 550 °C and high entropy alloys containing Ti at different temperatures that meet Wagner’s metal oxidation theory and basically obey the parabolic oxidation rule in literature. Apart from that, more than 60 groups of isothermal oxidation experiments at 600 °C were carried out, varying the content of V, Cr, and Al in the Ti-V-Cr alloy system according to the oxidation weight gain method. The results were included in the database combined with the data accessed from literature. The final database contains 156 sets of data6,58,59,60,61,62,63,64,65,66,67,68,69,70,71, each of which includes element composition (wt,%), oxidation temperature (°C), oxidation time (h), oxidation weight gain per unit area (mg/cm2), and parabolic oxidation rate constant kp. The parabolic oxidation rate constant kp is an intuitional and quantitative index to assess the oxidation resistance of specific alloys, whereas the oxidation weight gain per unit area can only linearly obtain the overall oxidation resistance during a time period. The oxidation time and oxidation weight gain per unit area were therefore utilized only to calculate the kp values in this research. The machine learning models adopted 19 features (including 18 alloy composition and oxidation temperature) as inputs and parabolic oxidation rate constant kp as the prediction target.
After the database was established, the units of the data were converted. The alloy atomic ratio content (at,%) was uniformly converted into a mass fraction (wt%); The unit of oxidation temperature is uniformly converted to °C. The unit of the parabolic oxidation rate constant kp is unified to mg2cm−4s−1.
All data needed to be normalized after being converted into unified units. The purpose of normalization processing is to scale the data with different characteristics into the same range. Through this approach, the model training could be more stable and efficient. The accuracy of the model is able to be improved to a certain extent. In this paper, the Min-Max normalization method was used to scale the data to the range of 0 and 1. The Min-Max normalization equation is shown in equation (3)
where \({x}^{{\prime} }\) represents the normalized value of the data; \(x\) is the original value; \({x}_{\min }\) is the minimum value of the feature; \({x}_{m{ax}}\) indicates the maximum value of the feature.
Due to the magnitude of feature kp being on the order of 10−6 to 10−7 and the significant difference between its minimum and maximum values, a natural logarithm transformation (lnkp) was applied resulting in a range of −18.13 to 4.61 for lnkp. Table 6 presents statistical information for all features in the dataset, including lnkp as the target for machine learning models.
The distribution of the dataset significantly impacts the accuracy and robustness of models. In addition to the information revealed in Table 6. Table 7 demonstrates the data distribution information of the whole 19 features, which are the inputs of machine learning models as well.
So as to have an intuitional knowledge about the data distribution of both 19 features and the target lnkp, a 3D scatter diagram of all the samples within the dataset after being processed by PCA, which includes the information of all 19 features as well (Fig. 10) was plotted. Figure 11 illustrates the distribution of lnkp. Except for the range of −15 to −10 (data obtained in experiments of this study), the values of lnkp are approximately uniformly distributed across the intervals facilitating the subsequent model establishment.
Model establishment and training
In this research, machine learning models were developed using both single learners algorithm (BPNN, SVR) and ensemble learners algorithm (GBDT, XGBoost, RF). Their prediction performance was compared horizontally. The evaluation metrics for the machine learning models were the coefficient of determination R2 and root mean squared error (RMSE). RMSE is a commonly utilized metric to assess the prediction performance of regression models representing the square root of the average squared difference between predicted and actual values. A lower RMSE indicates better model performance with the calculation formula shown in Equation (4):
where n represents the total number of samples which is the number of observations in the dataset as well; yi denotes the actual value of the ith observation.
The coefficient of determination R2, also known as the goodness of fitting, measures the proportion of variance in the dependent variable that is predicted from the independent variables. The R2 value ranges from 0 to 1 with values closer to 1 indicating a better fit of the model to the data. It is calculated as shown in equation (5).
In the equations, \(S{S}_{{res}}\) represents the residual sum of squares, and \(S{S}_{{tot}}\) represents the total sum of squares. These are calculated using the following formulas, as shown in equations (6) and (7):
where \(\bar{y}\) is the average value of the actual value.
All the machine learning models included in this study were verified by K-fold crossover validation method. The K-fold crossover validation method is often used to evaluate machine learning models with small data sets. In the data partitioning stage, the method divides the dataset into K subsets of roughly equal size called collapses. In the model training and verification stage, for each fold, each fold is taken as the verification set, respectively and the remaining K-1 folds are taken as the training set. After the training is complete, the remaining fold (validation set) is used to evaluate the performance of the model. This process is repeated K times while using a different fold as the verification set at each time to make sure that each fold is utilized as a validation set once. After all K tests, the performance of the model on each verification set is averaged to get the overall performance evaluation of the model. There are several advantages of the K-fold method:
-
(a)
Throughout the process, the model was trained and validated, transversing all the data, and every data point was used for both training and validation, making the most of the limited data available;
-
(b)
Through multiple training and verification, the performance of the model can be evaluated more reliably and the contingencies caused by data division can be avoided. Additionally, the generalization ability and accuracy of the model can be improved.
Bayesian optimization on hyperparameters of machine learning model
Bayesian optimization(BO) is an effective method for optimizing hyperparameters of machine learning models. It is especially suitable for high-dimensional and non-convex optimization problems. Through this method, the hyperparameter combinations close to global optimal solutions can be found in relatively few iterations. BO works by building a surrogate model (usually a Parzen estimator of a Gaussian process or tree structure) and optimizing the surrogate model to find the optimal hyperparameters.
The basic steps of (BO) are as follows: (1) Firstly, a set of hyperparameters is selected, and the corresponding model performance is calculated by Latin hypercube sampling or other methodologies. The initial sample set provides the initial data for Bayesian optimization; (2) Utilize the data of the initial sample set to build a surrogate model (such as the Gaussian process). This surrogate model is an approximation of the objective function (i.e., model performance or RMSE); (3) The next hyperparameter combination is determined by optimizing the Acquisition Function. The acquisition function balances exploration (trying new areas) and exploitation (fine searching in areas that are known to be optimal); (4) Evaluate the performance of the newly selected hyperparameter combination in the actual model, add the results to the sample set, and eventually update the surrogate model. (5) Repeat steps (3) and (4) until the preset maximum number of iterations is reached or a satisfactory combination of hyperparameters is found. In this study, the artificial neural network (BPNN), support vector regression (SVR), gradient lifting decision tree (GBDT) and extreme gradient lifting decision tree (XGBoost) were all optimized by using the library “scikit-optimize” in python. For example, in GBDT, the optimized hyperparameters are: learning rate (0.01–1.0), maximum depth of each tree (1–10), number of trees (n estimators: 50–500), and subsample ratio during tree training (subsample:). 0.5–1.0), min samples split of internal nodes (min samples split: 2–20), and min samples leaf required (min samples leaf: 1–20). The method was realized through optuna in python by setting the number of optimization steps to 300 performing BO for the four models. Figure 12 demonstrates the variations of RMSE values in the process of optimization for four machine learning models.
Experimental methods
This experiment refers to the “Test method for the determination of oxidation resistance of steel and superalloy” and the standard indicates that this method can also be used to test the oxidation resistance of titanium alloys. The experiments in this research originally implemented more than 60 Ti-V-Cr alloys production by non-consumable vacuum arc melting and oxidation resistance tests varying the content of V, Cr, and Al. Considering the establishment of machine learning models, the performance of ML models would be compromised by utilizing 60+ groups of data. Hence, the oxidation data and behaviors of high-temperature Ti-alloys such as Ti811, IMI834, and Ti65 were collected, together with the experimental 60+ groups of data (156 samples in total), as the training dataset as well.
Experimental equipment: sample heating furnace, analytical balance, and sample container. The temperature measurement accuracy of the heating furnace is 0.5 °C while the temperature difference in the furnace is less than 5 °C. The average temperature fluctuation in the furnace is less than 3 °C, and the oxidation atmosphere is maintained in the furnace. The accuracy of the analytical balance is 0.0001 g. The container used for the test is a small crucible, and the sample is tilted in the crucible. The contact area between the surface area and the container is kept to a minimum.
During the experiment, the crucible was placed on the tray according to the fixed position. The orientation and the position of the crucible was kept, and the orientation of the tray was marked when taking out the measurement. Before the experiment, the sample was polished flat and left for 1 day. The residue in the crucible was removed and roasted at 650 °C for 5 h, and the quality was measured after being taken out and left for 2 h. Then it was roasted again and taken out to stand and measure the quality. If the difference between the two measurements is not greater than 0.0002 g, it is considered that the crucible has reached constant weight and can be put into the drying chamber for backup. Pallets were roasted in the same way. The experimental temperature was 600 °C while the total experiment time was 100 h and the sample was taken out every 25 h after standing, weighed and then returned to the heating furnace. The experiment was repeated with three samples for each component. The quality of the sample was measured before heating at 25, 50, 75, and 100 h. The oxidation weight gain was calculated, and the oxidation weight gain curve was drawn. Subsequently, the oxidation rate constant was calculated from the oxidation weight gain curve. After the experiment was completed, metallographic samples were prepared by grinding plate method, and metallographic images, scanning electron microscope images and energy spectrum images were obtained by optical microscope and electron microscope.
Data availability
The training dataset, including the oxidation data obtained by the experiments and the data accessed from previous relevant literature and the machine learning companies with the optimization codes, are publicly available at the following GitHub repository: https://github.com/yuanzhisun/DatasetAndCodes.
Code availability
The training dataset including the oxidation data obtained by the experiments and the data accessed from previous relevant literature and the machine learning companied with the optimization codes are publicly available at the following GitHub repository: https://github.com/yuanzhisun/DatasetAndCodes.
References
Cao, J., Huang, X., Mi, G., Sha, A. & Wang, B. Research progress on application technique of Ti-V-Cr burn resistant titanium alloys. J. Aeronaut. Mater 34, 92–97 (2014).
Shao, L. Combustion behavior and mechanism of typical titanium alloys for aircraft. University of Science and Technology Beijing (2022).
Mi, G. et al. Research progress on non-isothermal oxidation behavior of high-temperature titanium alloys for aero engines. Aero. Manuf. Technol. 62, 88–92 (2019).
Mi, G. Foreword to the “titanium fire technology. Column. Mater. Eng 52, 2 (2024).
Taylor, C. D. & Tossey, B. M. High temperature oxidation of corrosion resistant alloys from machine learning. Npj Mater. Degrad. 5, 1–10 (2021).
Dai, J., Zhu, J., Chen, C. & Weng, F. High temperature oxidation behavior and research status of modifications on improving high temperature oxidation resistance of titanium alloys and titanium aluminides: a review. J. Alloys Compd. 685, 784–798 (2016).
Aghaeian, S., Nourouzi, F., Sloof, W. G., Mol, J. M. C. & Böttger, A. J. Predicting the parabolic growth rate constant for high-temperature oxidation of steels using machine learning models. Corros. Sci 221, 111309 (2023).
Li, W. et al. Effect of different oxidation modes on the growth of oxide layer of TC11 titanium alloy. J. Mater. Res. Technol. 32, 4265–4277 (2024).
Optasanu, V. et al. The influence of Si on the high-temperature oxidation of near-alpha titanium alloys. High Temp. Corros. Mater. https://doi.org/10.1007/s11085-024-10310-6 (2024).
Valenza, T. C., Weber, P. K. & Marquis, E. A. Role of nitrogen in the high-temperature oxidation of titanium alloys. Corros. Sci. 235, 112164 (2024).
Stringer, J., Cowgill, M. G. & Griffiths, N. C. Oxidation of titanium–iron alloys. Nature 185, 304–304 (1960).
Kitashima, T., Yamabe-Mitarai, Y., Iwasaki, S. & Kuroda, S. Effects of Ga and Sn additions on the creep strength and oxidation resistance of near-α Ti alloys. Metall. Mater. Trans. A 47, 6394–6403 (2016).
Yang, Y. et al. Effects of Ga, Sn addition and microstructure on oxidation behavior of near-α Ti alloy. Oxid. Met. 88, 583–598 (2017).
Vojtěch, D., Novák, P., Macháč, P., Morťaniková, M. & Jurek, K. Surface protection of titanium by Ti5Si3 silicide layer prepared by combination of vapour phase siliconizing and heat treatment. J. Alloys Compd. 464, 179–184 (2008).
Knaislová, A. et al. High-temperature oxidation of Ti–Al–Si alloys prepared by powder metallurgy. J. Alloys Compd. 810, 151895 (2019).
Cruchley, S., Evans, H. E., Taylor, M. P., Hardy, M. C. & Stekovic, S. Chromia layer growth on a Ni-based superalloy: sub-parabolic kinetics and the role of titanium. Corros. Sci. 75, 58–66 (2013).
Itakura, M., Yamaguchi, M., Egusa, D. & Abe, E. Density functional theory study of solute cluster growth processes in Mg-Y-Zn LPSO alloys. Acta Mater 203, 116491 (2021).
Ko, W.-S., Park, K. B. & Park, H.-K. Density functional theory study on the role of ternary alloying elements in TiFe-based hydrogen storage alloys. J. Mater. Sci. Technol. 92, 148–158 (2021).
Chakraborty, P. et al. Design and development of low density, high strength ZrNbAlVTi high entropy alloy for high temperature applications. Int. J. Refract. Met. Hard Mater. 113, 106222 (2023).
Bao, H., Xu, H., Li, Y., Bai, H. & Ma, F. The interaction mechanisms between dislocations and nano-precipitates in CuFe alloys: a molecular dynamic simulation. Int. J. Plast. 155, 103317 (2022).
Bahramyan, M., Mousavian, R. T. & Brabazon, D. Molecular dynamic simulation of edge dislocation-void interaction in pure Al and Al-Mg alloy. Mater. Sci. Eng. A 674, 82–90 (2016).
Arrousse, N. et al. Corrosion protection studies of different alloys in 1 M HCl by benzimidazole derivative: combined molecular dynamic simulations/DFT. J. Environ. Chem. Eng. 11, 109642 (2023).
Mai, W., Soghrati, S. & Buchheit, R. G. A phase field model for simulating the pitting corrosion. Corros. Sci. 110, 157–166 (2016).
Zhang, J., Chadwick, A. F., Chopp, D. L. & Voorhees, P. W. Phase field modeling with large driving forces. Npj Comput. Mater 9, 1–9 (2023).
Zheng, S., Zheng, D., Ni, Y. & He, L. Improved phase field model of dislocation intersections. Npj Comput. Mater. 4, 1–8 (2018).
Ueberricke, L., Murata, T., Ikeda, H., Nakane, S. & Deubener, J. Crystal growth in oxide melts—from CALPHAD thermodynamic modeling to statistical prediction. Acta Mater 273, 119960 (2024).
Wu, D. et al. Designing high-strength titanium alloy using pseudo-spinodal mechanism through diffusion multiple experiment and CALPHAD calculation. J. Mater. Sci. Technol. 74, 78–88 (2021).
Luo, Q., Zhai, C., Sun, D., Chen, W. & Li, Q. Interpolation and extrapolation with the CALPHAD method. J. Mater. Sci. Technol. 35, 2115–2120 (2019).
Ohler, B., Prada, S., Pacchioni, G. & Langel, W. DFT simulations of titanium oxide films on titanium metal. J. Phys. Chem. C 117, 358–367 (2013).
Wu, H. H. & Trinkle, D. R. Solute effect on oxygen diffusion in α-titanium. J. Appl. Phys. 113, 223504 (2013).
Bhattacharya, S. K., Sahara, R., Kitashima, T., Ueda, K. & Narushima, T. First principles study of oxidation of Si-segregated α-Ti(0001) surfaces. Jpn. J. Appl. Phys. 56, 125701 (2017).
Shen, C. et al. Discovery of marageing steels: machine learning vs. physical metallurgical modelling. J. Mater. Sci. Technol. 87, 258–268 (2021).
Biermair, F., Razumovskiy, V. I. & Ressel, G. Influence of alloying on thermodynamic properties of AlCoCrFeNiTi high entropy alloys from DFT calculations. Comput. Mater. Sci 202, 110952 (2022).
Kang, K., Kwon, S., Lee, C., Hong, D. & Lee, H. M. Hierarchical analysis of alloying element effects on gas nitriding rate of Fe alloys: a DFT, microkinetic and kMC study. Acta Mater 174, 173–180 (2019).
Song, T. Y., Liu, R. L., Li, L. Z., Bian, C. X. & Yan, M. F. DFT investigation of carbon-expanded α phase with different alloying element. Vacuum 202, 111199 (2022).
Zhang, R. et al. Short-range order and its impact on the CrCoNi medium-entropy alloy. Nature 581, 283–287 (2020).
Chen, Y. et al. Origin of strong solid solution strengthening in the CrCoNi-W medium entropy alloy. J. Mater. Sci. Technol. 73, 101–107 (2021).
Zhou, P., Erning, J. W. & Ogle, K. Interactions between elemental components during the dealloying of Cu-Zn alloys. Electrochim. Acta 293, 290–298 (2019).
Zhou, Y., Wu, J., Ma, W., Lei, Y. & Dai, Y. Experimental determination of interaction coefficients of components in ternary Si-Fe-Al alloy solution. J. Alloys Compd. 751, 257–265 (2018).
Hart, G. L. W., Mueller, T., Toher, C. & Curtarolo, S. Machine learning for alloys. Nat. Rev. Mater. 6, 730–755 (2021).
Samuel, A. L. Some studies in machine learning using the game of checkers. IBM J. Res. Dev. 3, 210–229 (1959).
Castro, J. L., Mantas, C. J. & Benı́tez, J. M. Neural networks with a continuous squashing function in the output are universal approximators. Neural Netw. 13, 561–563 (2000).
Gubernatis, J. E. & Lookman, T. Machine learning in materials design and discovery: examples from the present and suggestions for the future. Phys. Rev. Mater. 2, 120301 (2018).
Yuan, R. et al. Accelerated discovery of large electrostrains in BaTiO3-based piezoelectrics using active learning. Adv. Mater. 30, 1702884 (2018).
Belov, N. A., Akopyan, T. K., Belov, V. D., Gershman, J. S. & Gorshenkov, M. V. The effect of Cr and Zr on the structure and phase composition of TNM gamma titanium aluminide alloy. Intermetallics 84, 121–129 (2017).
Wu, H., Fan, G., Geng, L., Cui, X. & Huang, M. Nanoscale origins of the oriented precipitation of Ti3Al in TiAl systems. Scr. Mater. 125, 34–38 (2016).
Pang, J. et al. Diffusion of carbon and nitrogen in TC4 titanium alloy plasma electrolytic saturation. Chem. Phys. 505, 12–18 (2018).
Liu, Z., Zhu, P., Mao, X., Zhao, Y. & Xin, S. Effect of Si addition on the microstructure and creep properties of the forged titanium alloy. Mater. Chem. Phys. 317, 129212 (2024).
Guo, R. et al. Microstructure and mechanical properties of powder metallurgy high temperature titanium alloy with high Si content. Mater. Sci. Eng. A 777, 138993 (2020).
Kondoh, K., Kariya, S., Khantachawana, A., Alhazaa, A. & Umeda, J. Quantitative strengthening evaluation of powder metallurgy titanium alloys with substitutional Zr and interstitial O solutes via homogenization heat treatment. Materials 14, 6561 (2021).
Fu, X. et al. Atomic-scale observation of non-classical nucleation-mediated phase transformation in a titanium alloy. Nat. Mater. 21, 290–296 (2022).
Zhang, F., Du, M., Fan, K., Ye, C. & Zhang, B. Fabrication and mechanical properties of network structured titanium alloy matrix composites reinforced with Ti2AlC particulates. Mater. Sci. Eng. A 776, 139065 (2020).
Zhang, S. et al. The effect of post heat treatment on microstructural evolution and mechanical properties of Ti2AlNb intermetallic alloy prepared by point-forging and laser-deposition. Mater. Sci. Eng. A https://doi.org/10.1016/j.msea.2024.147331 (2024).
Cao, S. et al. Interaction between Al and other alloying atoms in α-Ti for designing high temperature titanium alloy. Comput. Mater. Sci. 197, 110620 (2021).
Yang, F. et al. Cluster-formula-embedded machine learning for design of multicomponent β-Ti alloys with low Young’s modulus. Npj Comput. Mater. 6, 1–11 (2020).
Guo, Y. et al. Relative strength of β phase stabilization by transition metals in titanium alloys: The Mo equivalent from a first principles study. Mater. Today Commun. 35, 106123 (2023).
Wang, C. et al. The equivalence of titanium alloys defined via β phase decomposition paths. J. Mater. Sci. https://doi.org/10.1007/s10853-024-10026-7 (2024).
Zhang, N. & Zhu, L. Cyclic oxidation behavior of rare earth elements La and Y modified CMSX-4 single crystal superalloy. YanTai University (2023).
Li, Z. & Li, W. & Wang, Q. Oxidation resistance Of Ti-based multiprincipal alloys. Yanshan University (2022).
Feng, Y. & Wang, Q. & Chen, Z. High temperature cyclic oxidation and deformation behavior of Ti65 titanium alloy plate. University of Science and Technology of China (2022).
Zhang, M., Liu, Y., Zheng, Z., Cao, J. & Huang, X. Effect of compound alloying on high temperature oxidation resistance property of Ti2AlNb alloy. J. Mater. Eng. 50, 93–100 (2022).
Zhou, Z., Min, X., Bai, P., Wang, E. & Zhang, W. High temperature oxidation and phase precipitation behavior of β-type Ti-Mo-O alloys. Rare Met. Mater. Eng. 52, 186–194 (2023).
Cui, W. & Wei, H. Oxidation behaviour of IMI834and Ti−1100 alloys at high temperature of 550 °C–750 °C. Rare Met. Mat. Eng. 31, 35 (1997).
Zhou, Z., Li, H., Liu, Y., Niu, T. & Li, H. Effect of Ti on the corrosion behavior of (FeCrCoNi)100−xTix alloy. Corros. Sci. 209, 110807 (2022).
Che, J. et al. Effects of minor additions of elements into Ti1100 on elevated temperature oxidation- and corrosion-resistance. Rare Met. Mater. Eng. 47, 1471–1477 (2018).
Liu, Y. et al. Electronic structure calculation of influence of Cr content on structural stability and corrosion resistance of Ti-V-Cr alloy. J. Liaoning Univ. 47, 245–252 (2020).
Wang, F. & Wang, H. High temperature oxidation behavior of Ti811 alloy. Foundry Tech. 32, 507–510 (2011).
Tang, M., Zhao, Y., Hong, Q. & Wang, Z. Oxidation behavior of Ti600 alloy at high temperature. Rare Met. Mater. Eng. 41, 1226–1230 (2012).
Jia, W., Zeng, W., Liu, J., Wang, Q. & Zhou, Y. Oxidation behavior of near α-titanium alloy Ti60. Rare Met. Mater. Eng. 39, 781–786 (2010).
Ran, L., Yang, F., Liu, J. & Li, H. Study on high temperature oxidation kinetics of Ti40 alloy. Mater. Protect. 51, 48–53 (2018).
Wu, C., Zhang, S., Guo, H., Li, Z. & Zhang, D. The effect of high temperature oxidation on microstructure and properties of Ti60 alloy. Gas Turbine Test. Res. 36, 55–62 (2023).
Acknowledgements
This work was supported by the National Natural Science Foundation of China [grant number U2141222]; Independent innovation project of Aero Engine Corporation of China (CXPT-2022-034).
Author information
Authors and Affiliations
Contributions
Y.S. wrote the main manuscript, carried out the investigation and experiments, and established and validated the machine learning models. Y.S. and G.M. collected the data and provided the concept and the methodology. G.M and P.L. reviewed, edited the manuscript, and were responsible for this project and funding. L.H. reviewed the manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.
About this article
Cite this article
Sun, Y., Mi, G., Li, P. et al. Prediction of oxidation resistance of Ti-V-Cr burn resistant titanium alloy based on machine learning. npj Mater Degrad 9, 3 (2025). https://doi.org/10.1038/s41529-025-00553-2
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41529-025-00553-2