Predicting CaO activity in multiple slag system using improved whale optimization algorithm and categorical boosting

Xin, Zi-cheng; Zhang, Jiang-shan; Liu, Qing

doi:10.1038/s41598-025-93980-9

Download PDF

Article
Open access
Published: 19 March 2025

Predicting CaO activity in multiple slag system using improved whale optimization algorithm and categorical boosting

Zi-cheng Xin^1,2,
Jiang-shan Zhang¹ &
Qing Liu¹

Scientific Reports volume 15, Article number: 9533 (2025) Cite this article

1716 Accesses
4 Citations
Metrics details

Subjects

Abstract

The activity of slag components is one of the primary factors influencing the thermodynamic properties of slag. In this study, a feasible model was established to predict the a(CaO) using improved whale optimization algorithm (IWOA) and Categorical Boosting (CatBoost). The effects of other variables on a(CaO) were listed in descending order of influence as follows: w(CaO), w(SiO₂), temperature, w(MgO), and w(Al₂O₃). And the IWOA-CatBoost model achieved the highest R² value of 0.9200, lowest RMSE of 0.0042, and lowest MAE of 0.0030 in predicting the a(CaO). The performance of the optimal IWOA-CatBoost model was evaluated and compared with that of known models. The results demonstrate that the IWOA-CatBoost model outperformed existing models and methods, such as the Factsage, ion and molecule coexistence theory, and genetic algorithm—backpropagation neural network. The accurate calculation of slag component activity is of great significance to the analysis of the thermodynamic properties of slag. Meanwhile, the approach and algorithm used to develop the a(CaO) prediction model can also be applied to predicting the activity of other slag components or other metallurgical applications (e.g., predicting molten steel temperature, steel composition, and alloy yield).

Thermodynamic evaluation of viscosity behavior for CaO–SiO₂–Al₂O₃–MgO slag systems examined at the temperatures range from 1500 to 1700 °C

Article Open access 28 September 2023

Deep learning revealed statistics of the MgO particles dissolution rate in a CaO–Al₂O₃–SiO₂–MgO slag

Article Open access 11 September 2024

An evaluation of maximizing production and usage of biofuel by machine learning and experimental approach

Article Open access 26 September 2025

Introduction

With the advancement of the steel industry toward higher-end, smarter and greener production, the requirements for product quality continue to increase¹. Slag has a significant impact on the quality of molten steel, and desulfurization is one of its primary tasks in LF refining process^2,3,4. The desulfurization reaction primarily occurs through interfacial reactions between the steel and slag, and the activity of each slag component significantly impacts the desulfurization process. In addition, activity theory can explain a range of critical phenomena in metallurgical processes, such as phase equilibria and phase transformations, element migration, and the direction of chemical reactions. With the deepening research on physicochemical properties of slag, the research on slag component activity has been paid more attention by metallurgists.

Metallurgists have developed a range of thermodynamic models for molten slag, including the complete ionic solution model⁵, the regular solution model⁶, the new generation solution geometrical model⁷, and the ion and molecule coexistence theory (IMCT)⁸. Chang et al.⁹ calculated the liquidus temperature, activity, and cooling crystallization process of slag using the FactSage thermodynamic software, and analyzed the effect mechanism of Al₂O₃ on the slag viscosity. Tang et al.¹⁰ calculated the slag components activity in the CaO–MgO–Al₂O₃–SiO₂ refining slag system to explore the thermodynamic equilibrium relationships among refining slag, molten steel, and inclusions using the FactSage. Guo et al.¹¹ developed an activity calculation model for the CaO–SiO₂–MgO–Al₂O₃ slag system based on the IMCT, and validated this model using experimental data. However, FactSage thermodynamic software typically assumes that reactions reach equilibrium. In actual metallurgical processes, the reactions between slag and metal do not always achieve thermodynamic equilibrium, especially under conditions of rapid reactions or non-equilibrium states. As a result, the calculated activity value may not accurately reflect situation in actual process. Additionally, most models and methods involve assumed conditions and overlook the effects of interactions between components on activity, resulting in deviations between calculated and experimental values. Therefore, the thermodynamic model should be further developed and improved¹¹.

With the rapid development of machine learning theory, Wu et al.¹² established an activity prediction model for multiple slag systems based on a genetic algorithm (GA)—back propagation (BP) neural network algorithm and demonstrated a good agreement between this model’s calculated values and experimental values. However, the study validated the accuracy of the constructed model solely by comparing calculated values with literature values in graphical form, without conducting a comprehensive comparative analysis with known models and methods, and employed the traditional GA-BP neural network algorithm. Since the emergence of deep learning, deep neural network has been widely applied across diverse industries¹³. However, nearly all deep neural network algorithms currently require large datasets for effective training. In fact, challenges such as limited data availability and high data collection costs are common, so small sample datasets learning is particularly important. Categorical Boosting (CatBoost), an improved decision tree algorithm, was applied to the prediction of a(CaO) in multiple slag system with small sample datasets in this study. The CatBoost algorithm can achieve high predictive accuracy with small sample set¹⁴. Jin and Gu et al.^15,16 validated the feasibility of CatBoost for small-sample prediction in the contexts of the blasting fragment large block percentage ratio (a regression study with 36 data samples) and fault diagnosis of photovoltaic array (a classification study with 55 training samples and 110 training samples), respectively.

Based on the aforementioned analysis, the data of a(CaO) obtained using the same activity measurement method was first collected in this study. Next, a correlation analysis was performed to assess the effect of various factors on a(CaO). Furthermore, the convergence factor of the standard whale optimization algorithm was improved to enhance the global search ability in the early stage and the local optimization speed in the later stage. Then, a prediction model of a(CaO) was established based on the improved whale optimization algorithm (IWOA)—CatBoost. Finally, various statistical evaluation metrics were employed to compare and assess the established model against existing models and methods (such as FactSage, IMCT, GA-BP), demonstrating the accuracy of the established model. Meanwhile, the modeling approach presented in this study can also be applied to predicting the activity of other slag components.

Data collection and data analysis

Slag is a multi-component melt mainly composed of different oxides and is a typical by-product in steelmaking. Refining slag plays an important role in the steelmaking process, as sulfur is primarily removed from the molten steel through interfacial chemical reaction of the steel-slag. Meanwhile, the activity of CaO in the slag also has a significant impact on the content of CaO in inclusions^17,18. Therefore, studying the activity of slag components is of great significance. The following analysis, based on metallurgical mechanisms, examines the impact of various factors (temperature and slag composition, including CaO, SiO₂, MgO, and Al₂O₃ content) on CaO activity, desulfurization reactions, and inclusion control.

In terms of the impact of different factors on CaO activity, as the temperature increases, the activity of CaO increases. The possible reason is that SiO₂, as a reactant, always involves CaO in the reaction. According to Le Chatelier’s principle and IMCT, compounds such as CaSiO₃ and Ca₂SiO₄ decompose to form Ca²⁺ and O²⁻, which leads to an increase in the activity of Ca²⁺ and O²⁻, thus raising the activity of CaO¹⁹. As the CaO content increases, the activity of CaO also increases. Meanwhile, a large amount of CaO exists in the slag in the form of Ca²⁺ and O²⁻, which causes the activity of other components in the slag to gradually decrease¹⁷. As the SiO₂ content increases, the activity of CaO gradually decreases¹⁹. Under constant basicity, the activity of CaO first increases and then decreases as the MgO content increases²⁰. As the Al₂O₃ content increases, the activity of CaO decreases. Under basic slag conditions, Al₂O₃ is acidic. The combination of Al₂O₃ and CaO forms the CaO·Al₂O₃ compound, which reduces the free CaO content in the slag. Meanwhile, some of the free O^2- is consumed when forming aluminates, leading to a decrease in the CaO activity in the slag²¹.

In terms of the influence of various factors on desulfurization reactions and inclusion control, as the CaO content increases, more O^2- is provided, the optical basicity increases, the sulfur capacity of the slag increases, and the sulfur distribution ratio between steel and slag increases, all of which facilitate desulfurization²². Under a fixed Al₂O₃ content, as the CaO content increases, the CaO activity increases while the Al₂O₃ activity decreases, which facilitates the slag’s adsorption of Al₂O₃ inclusions and enhances its deoxidation capacity. However, if the CaO content becomes excessively high, the increased CaO activity results in a higher CaO content in the inclusions²³. Meanwhile, an excessively high CaO content leads to the precipitation of solid phase particles from the slag, which increases the viscosity, reduces fluidity, and deteriorates the desulfurization kinetics of slag²⁴. Under basic slag conditions, SiO₂ is a stronger acidic oxide than Al₂O₃, and an excess of SiO₂ results in a decline in the desulfurization efficiency of the slag. Under specific conditions, SiO₂ increases the viscosity of the slag and decreases its surface tension. To promote the infiltration, adsorption, and dissolution of inclusions, it is crucial to minimize the surface tension of the slag while ensuring that the viscosity remains stable¹⁹. MgO, being a basic oxide, can provide O^2-, but its desulfurization capacity is slightly lower than that of CaO²⁴. Under certain conditions, as the MgO content increases, the desulfurization capacity of the slag improves. However, when the MgO content surpasses a certain threshold, further increases lead to a decrease in desulfurization capacity. This is due to the high melting point of MgO (2800 °C), which, with increasing content, results in reduced slag fluidity and deteriorated desulfurization kinetics. Al₂O₃ itself lacks desulfurization capacity. In basic slags, Al₂O₃ acts as an acidic oxide. As the Al₂O₃ content increases, the effective CaO content in the slag decreases, leading to a reduced desulfurization capacity. This also hampers the removal of Al₂O₃ inclusions from the molten steel²². Increasing the Al₂O₃ content within a certain range can reduce the viscosity of the slag, improve its fluidity, and enhance the desulfurization kinetics²⁵.

For the determination of slag component activity, commonly used experimental methods include the vapor pressure method, chemical equilibrium method, partition coefficient method, and electromotive force method. Metallurgical researchers usually use the chemical equilibrium method to measure slag component activity. Accordingly, 123 experimental data sets of a(CaO) measured by this method were collected for modeling research, as shown in Table 1

Table 1 Experimental data used for the calculation of a(CaO).

Full size table

A scatter matrix diagram and a Pearson correlation coefficient were used to visualize the data sets and reflect the correlation between these data, as shown in Fig. 1. In Fig. 1, the histograms along the diagonal display the distribution of each individual variable, while the scatter plots in the lower triangles illustrate the relationships between pairs of variables. For example, the left-most plot in the bottom row of Fig. 1 shows the relationship between temperature and the a(CaO). The temperature values were 1823 K and 1873 K. The range of the w(CaO), w(SiO₂), w(MgO), and w(Al₂O₃) was 3.9–55.6%, 7.1–87.6%, 0–48.2%, and 0–56.2%, respectively. Meanwhile, the effects of other variables on a(CaO) are listed in descending order of influence as follows: w(CaO), w(SiO₂), temperature, w(MgO), and w(Al₂O₃). The a(CaO) increased with the increase of the temperature. A higher w(CaO) was beneficial to improve the a(CaO). The w(SiO₂) has a negative effect on the a(CaO). Compared to w(CaO), w(SiO₂), and temperature, w(MgO) and w(Al₂O₃) have a smaller impact on a(CaO).

Establishment of a(CaO) prediction model

In this study, the data set was first randomly divided into a training data set (80%) and a testing data set (20%). Then, an a(CaO) calculation model based on IWOA-CatBoost was developed using the training data set. Subsequently, based on the same testing data set, the calculated values of a(CaO) were obtained using FactSage¹⁰, IMCT¹¹, GA-BP neural network algorithm¹², and IWOA-CatBoost. Finally, the accuracy of the established model was evaluated using R², RMSE, MAE, and scatter plots. The modeling workflow is shown in Fig. 2.

Figure 3 presents the IWOA-CatBoost modeling flowchart. The specific steps of the IWOA-CatBoost modeling are outlined as follows:

(1)
Collect CaO activity data under different factors from existing literature (a total of 123 data sets);
(2)
Divide the 123 experimental data sets into a training data set (80%) and a testing data set (20%), with factors such as temperature, w(CaO), w(SiO₂), w(MgO), and w(Al₂O₃) as input variables, and CaO activity as the output variable of the model;
(3)
Set the whale population size n, maximum number of iterations max_iter, and define the value ranges for the CatBoost hyperparameters: learning_rate, depth, n_estimators, l2_leaf_reg, subsample, bagging_temperature, and colsample_bylevel;
(4)
Set the CatBoost hyperparameters to each whale individual and initialize the whale population;
(5)
Calculate the fitness value of each whale individual to determine the current best individual and the optimal value of the whale population;
(6)
The improved IWOA algorithm is used to update the positions of the population individuals, updating parameters a, D, A, and C, where a represents the improved nonlinear convergence factor, which coordinates the algorithm’s global search and local optimization;
(7)
Calculate the fitness values, and through the comparison of fitness values, update the optimal solution for each whale individual and the optimal solution for the whale population, thereby obtaining a new population;
(8)
Determine whether the algorithm meets the termination conditions (minimizing the prediction error of CaO activity). If satisfied, proceed to (9); otherwise, proceed to (6);
(9)
Obtain the optimal hyperparameters (learning_rate, depth, n_estimators, l2_leaf_reg, subsample, bagging_temperature, colsample_bylevel);
(10)
Establish the model using the optimal hyperparameter combination;
(11)
Evaluate the IWOA-CatBoost model using the testing set;
(12)
Output the optimal hyperparameter combination (learning_rate, Depth, n_estimators, l2_leaf_reg, subsample, bagging_temperature, colsample_bylevel) and the model evaluation metrics (R², RMSE, MAE).

Improved whale optimization algorithm

Whale optimization algorithm (WOA) is an intelligent optimization algorithm proposed by Mirjalili et al.²⁸, which has the characteristics of simple algorithm principle, few parameter setting and strong global search ability. The whale optimization process is divided into three main stages: encircling prey, bubble-net attacking method, and search for prey. The synergy of these three stages makes the whale optimization algorithm an effective tool for obtaining optimal solutions in different scenarios²⁹.

(1)
Encircling prey.

The search range of the WOA algorithm is the entire solution space. Since the location of the optimal solution is unknown, a candidate solution is assumed to be the target prey. Once the prey is determined, the other whales update their positions to the target prey. This behavioral model is shown in Eq. (1).

$$\left\{ {\begin{array}{*{20}c} {X(t + 1) = X^{ * } (t) - AD} \\ {D = \left| {CX^{ * } (t) - X(t)} \right|} \\ \end{array} } \right.$$

(1)

where t represents the iteration number; X^*(t) represents the position of the current optimal solution; X(t) represents the position of the whale; X(t + 1) represents the position of the whale at the next moment; D represents the distance between the position of the whale and the current optimal solution. A and C represent the parameters for updating the position of the whales, as shown in Eq. (2).

$$\left\{ {\begin{array}{*{20}c} {A = a(2r_{1} - 1)} \\ {C = 2r_{2} } \\ {a = 2(1 - t/t_{\max } )} \\ \end{array} } \right.$$

(2)

where r₁ and r₂ represent the random number in the range of [0, 1]; t_max represents the maximum number of iterations; a represents the convergence factor that linearly decreases from 2 to 0 as t increases.

(2)
Bubble-net attacking method.

During the predation phase, the constriction encirclement and spiral ascent are performed simultaneously, bringing the prey close to the sea surface for hunting. This model of hunting behavior is shown in Eq. (3).

$$X(t + 1) = D \cdot e^{bl} \cdot \cos (2\pi l) + X^{ * } (t)$$

(3)

where, b is a constant for defining the shape of the logarithmic spiral; l is a random number in [− 1, 1].

(3)
Search for prey.

The decision to perform either a global search (when |A|≥ 1) or a local search (when |A|< 1) is based on the value of |A|. When performing a global search, an individual whale is randomly selected to ensure the balance between local optimization and global search. The model is shown in Eq. (4).

$$\left\{ {\begin{array}{*{20}c} {X(t + 1) = X_{{{\text{rand}}}} (t) - AD_{1} } \\ {D_{1} = \left| {C \cdot X_{{{\text{rand}}}} (t) - X(t)} \right|} \\ \end{array} } \right.$$

(4)

where X_rand represents the position of a randomly selected whale individual within the population.

The above introduction is the standard WOA, where the balance between local optimization and global search significantly influences optimization accuracy and convergence speed. This balance is controlled by the parameter A. The primary factor influencing A is the convergence factor a, which linearly decreases from 2 to 0 as t increases in the standard WOA. This approach may lead to inadequate exploration of feasible solutions in the early stage and slow convergence in the later stage³⁰. For this problem, a piecewise nonlinear convergence factor was proposed to improve both the exploration capability in the early stage and the convergence speed in the later stage, as shown in Eq. (5). Based on this, the IWOA algorithm is used to ensure a global search within the feasible solution by maintaining a large convergence factor with a slow reduction rate in the early iterations. In the later stages, the convergence factor is small, and its reduction rate is rapid to enhance the speed of local optimization.

$$\left\{ {\begin{array}{*{20}c} {a = 1 + \cos \left[ {\left( {\frac{{2t + t_{\max } }}{{2 \cdot t_{\max } }}} \right) \cdot \pi - \frac{\pi }{2}} \right],} & {t \le \frac{{t_{\max } }}{2}} \\ {a = 1 + \cos \left[ {\left( {\frac{{2t - t_{\max } }}{{2 \cdot t_{\max } }}} \right) \cdot \pi + \frac{\pi }{2}} \right],} & {t > \frac{{t_{\max } }}{2}} \\ \end{array} } \right.$$

(5)

Categorical boosting (CatBoost) algorithm

CatBoost is an algorithm developed by the Russian company Yandex, based on oblivious trees as its base learners. In the boosting algorithms, CatBoost demonstrates higher computational accuracy and shorter training times compared to XGBoost. Furthermore, CatBoost effectively addresses the overfitting issue present in LightGBM through its ordered boosting method³¹. Therefore, a prediction model of a(CaO) was established based on CatBoost.

CatBoost has the following characteristics³²: (1) It utilizes the Ordered Target-based Statistics method for feature label classification, employing a core principle of ranking to randomly permute the data through various methods, thereby generating different permutation sequences. Subsequently, for each permutation sequence, the average target value of samples belonging to the same category is calculated by estimating each sample. When handling the categorical features of each sample, the average target value of the previous categorical labels of that sample is utilized and presented in the form of numerical variables. This approach enhances the modeling capability of categorical features. (2)To solve the problem of gradient estimation bias, the step size of the gradient is improved in the first stage by utilizing unbiased estimates and employing the ordered boosting method for gradient calculation and estimation; In the second stage, the traditional gradient boosting decision tree algorithm is used for optimization. This method can effectively reduce the bias caused by gradient estimation, thereby solving the problem of prediction shift and improving the accuracy and generalization ability of the model.

Model evaluation

Coefficient of determination (R²), root mean square error (RMSE), and mean absolute error (MAE) are adopted as the performance evaluation criteria for different models³³. Table 2 shows the performance evaluation criteria.

Table 2 Performance evaluation criteria.

Full size table

Results and discussion

Hyperparameter optimization of CatBoost

The Improved Whale Optimization Algorithm (IWOA) was used to explore and optimize specific hyperparameters of CatBoost, with the corresponding ranges and optimal values shown in Table 3. Learning rate (learning_rate): The learning rate is used to control the convergence speed of the algorithm. A smaller learning rate can make the model more stable but may require more training iterations to reach optimal performance. Depth of tree (depth): The depth of a tree refers to the maximum depth of each tree. Increasing the depth of the tree can improve the complexity of the model, thereby enhancing its performance. However, if the depth of the tree is too large, it may lead to overfitting. Number of tree (n_estimators): The number of trees refers to the number of trees in the model. Increasing the number of trees can enhance the model’s complexity, thereby improving its performance. However, if the number of trees is too large, it may lead to overfitting. L2 regularization coefficient (l2_leaf_reg): The L2 regularization coefficient is used to control the degree of regularization in the model. In general, smaller values of l2_leaf_reg tend to make the model more prone to overfitting, while larger values of l2_leaf_reg tend to make the model more prone to underfitting. Subsample ratio (subsample): The main purpose of the subsample parameter is to reduce overfitting by randomly selecting a portion of the data to simulate the diversity of the training set. Bagging temperature (bagging_temperature): The Bagging parameter is used to control the proportion of samples in each iteration step. A smaller Bagging parameter can reduce the variance of the model, thereby improving its stability. Feature sampling ratio (colsample_bylevel): The feature sampling ratio refers to the proportion of features considered for splitting at each node. A smaller feature sampling ratio can reduce the model’s variance, thereby improving the model’s stability³⁴. Other hyperparameters were set to their default values as provided by Python’s CatBoost library. This approach aimed to improve model performance by focusing optimization efforts on key parameters, while allowing default settings for other parameters.

Table 3 Best parameters for CatBoost within the parameter boundaries.

Full size table

Comparison of IWOA-CatBoost model with other models and methods

Different performance evaluation metrics were used to compare the MLR, MLP, and KNN models with the optimal IWOA-CatBoost model, as shown in Table 4.

Table 4 Performance evaluation of different models and methods.

Full size table

Table 4 show the performance of the various models and methods. In Table 4, the R² value of the IWOA-CatBoost model were better than those of the FactSage, IMCT, and GA-BP model. The IWOA-CatBoost model achieved the highest R² value of 0.9200, lowest RMSE of 0.0042, and lowest MAE of 0.0030 in predicting the a(CaO). Meanwhile, the R² value of the IWOA-CatBoost model of the a(CaO) was 0.3804 higher than those of the FactSage, 0.4309 higher than those of the IMCT, and 0.1625 higher than those of the GA-BP model, respectively. The RMSE and MAE values of the IWOA-CatBoost model of the a(CaO) were 0.0088 and 0.0053 lower than those of the FactSage, 0.0053 and 0.0028 lower than those of the IMCT, and 0.0032 and 0.0020 lower than those of the GA-BP model, respectively. The possible analysis of the above results is as follows: (1) FactSage is a thermodynamic calculation software widely used in metallurgy for slag system analysis. It utilizes relevant databases to calculate the thermodynamic properties of slag³⁵. However, it may not capture the complex non-linear interactions between slag components as effectively as data-driven models like CatBoost. Moreover, Factsage’s predictive accuracy is highly dependent on the quality and extent of the thermodynamic databases it uses³⁶, which may limit its applicability and accuracy in predicting CaO activity compared to machine learning models that incorporate real-time data for modeling and prediction. (2) IMCT is a classical model used for predicting slag properties, based on a transfer function that models the relationship between slag composition and activity. While it provides reasonable estimates in many cases, it is limited by its linear relationships and least square method, which can lead to less accurate predictions in complex slag systems where interactions are more intricate. (3) GA-BP is a hybrid model combining genetic algorithms for optimization and a backpropagation neural network for prediction. While GA-BP performs well in capturing non-linear relationships, its performance heavily depends on the fine-tuning of the network’s hyperparameters. Additionally, the genetic algorithm optimization may lead to overfitting or convergence to local minima, which could affect the model’s predictive accuracy³⁷. (4) The IWOA algorithm is used to ensure a global search within the feasible solution by maintaining a large convergence factor with a slow reduction rate in the early iterations. In the later stages, the convergence factor is small, and its reduction rate is rapid to enhance the speed of local optimization. CatBoost’s ability to handle feature interactions and non-linearities makes it more adaptable to complex slag systems³². IWOA optimizes the hyperparameters of CatBoost, which reduces the risk of overfitting compared to GA-BP and avoids the reliance on thermodynamic databases like Factsage.

In addition, the scatter plot, confidence interval, and absolute error plot were used to evaluate the performance of various models and methods. Figure 4 shows the comparison of the experimental and calculated a(CaO) on different models and methods using the same testing data set. The closer the scatter to the 45-degree diagonal line, the smaller the error between the calculated and experimental values. The coefficient of determination (R²) was used to evaluate the model’s goodness of fit, with the value closer to 1 indicating a stronger fitting ability. The confidence interval, shown as light blue shading and typically set at 95%, was used to reflect the uncertainty in the estimation results. A narrower confidence interval indicates greater stability in the model’s predictive performance³⁸. In Fig. 4, the overall scatter plots of the IWOA-CatBoost prediction model of the a(CaO) was closer to the 45-degree diagonal dotted line than that of the FactSage, IMCT, and GA-BP model. Meanwhile, according to the R² values, the IWOA-CatBoost model (R² = 0.9200) demonstrates the strongest fitting ability, followed by the GA-BP model (R² = 0.7575), whereas the FactSage and IMCT model display relatively weaker fitting performance. Additionally, as shown in Fig. 4, the IWOA-CatBoost model has the narrowest confidence interval, suggesting greater stability in its predictions. Based on various model evaluation metrics, the overall performance of the IWOA-CatBoost model is superior to that of the other models. Meanwhile, the regression line slope of the IWOA-CatBoost model is closer to 1 compared to other models, and the scatter distribution of predicted versus actual values is relatively concentrated, reflecting high prediction accuracy.

Figure 5 presents a comparison of the absolute values of the errors for different models and methods. In comparison with the FactSage, IMCT, and GA-BP, the absolute error of the a(CaO) value calculated by the IWOA-CatBoost model is overall closer to the zero reference line. Through an analysis of scatter plots, confidence intervals, and absolute error plots of different models, the IWOA-CatBoost model demonstrates advantages over the FactSage, IMCT, and GA-BP models in terms of confidence interval width, stability, fitting ability, and generalization performance. In the future, a dedicated activity database can be developed and data-sharing can be realized to further optimize the hyperparameters of the model established in this study. Meanwhile, the approach and algorithm used to develop the a(CaO) prediction model can also be applied to predicting the activity of other slag components or other metallurgical applications (e.g., predicting molten steel temperature, steel composition, and alloy yield), with stronger applicability, higher calculation accuracy and stronger generalization ability.

Conclusions

Through the collection and analysis of experimental data of a(CaO), a feasible model was established to predict the a(CaO) using the IWOA-CatBoost. The following conclusions can be drawn.

(1)
Through correlation analysis, the effects of other variables on a(CaO) were listed in descending order of influence as follows: w(CaO), w(SiO₂), temperature, w(MgO), and w(Al₂O₃). The optimal structures of the IWOA-CatBoost model had learning_rate of 0.3276, depth of 8, n_estimators of 684, l2_leaf_reg of 8.6729, subsample 0.6301, bagging_temperature of 0.5256, and colsample_bylevel of 0.8468.
(2)
The performance of the optimal IWOA-CatBoost model was evaluated and compared with that of existing models and methods. The IWOA-CatBoost model achieved the highest R² value of 0.9200, lowest RMSE of 0.0042, and lowest MAE of 0.0030 in predicting the a(CaO), demonstrating superior stability, fitting accuracy, and generalization capability, thereby supporting its feasibility for calculating a(CaO). Meanwhile, the establishment method of a(CaO) prediction model can also be applied to the prediction of other slag components activity or other metallurgical applications.

Data availability

For data inquiries, please contact Zicheng Xin (sklxzc@163.com).

References

Yin, R. Y. Topic of times of metallurgy-Get through process, communicate different levels and open up a new theory. Iron Steel 56, 4–9 (2021).
MATH Google Scholar
Xin, Z. C. et al. Sulphide capacity prediction of CaO–SiO₂–MgO–Al₂O₃ slag system by using regularized extreme learning machine. Ironmak. Steelmak. 48, 275–283 (2021).
CAS Google Scholar
Xin, Z. C. et al. Mathematical modelling and plant trial on slagging regime in a ladle furnace for high-efficiency desulphurization. Ironmak. Steelmak. 48, 1123–1132 (2021).
CAS MATH Google Scholar
Yang, X. M., Shi, C. B., Zhang, M., Chai, G. M. & Wang, F. A thermodynamic model of sulfur distribution ratio between CaO–SiO₂–MgO–FeO–MnO–Al₂O₃ slags and molten steel during LF refining process based on the ion and molecule coexistence theory. Metall. Mater. Trans. B 42, 1150–1180 (2011).
CAS Google Scholar
Tian, Y. W., Zhai, X. J. & Liu, K. R. Physical Chemistry of Metallurgy (Metallurgical Industry Press, 2007).
Prausntiz, J. M., Lichtenthaler, R. N. & Azevedo, E. G. Molecular Thermodynamics of Fluid-Phase Equilibria 193–370 (Prentice-Hall Inc., Englewood Cliffs, 1986).
MATH Google Scholar
Chou, K. C. New generation solution geometrical model and its further development. Acta Metall. Sin. 33, 126–130 (1997).
MATH Google Scholar
Zhang, J. Computational Thermodynamics of Metallurgical Melts (Metallurgical Industry Press, 1998).
Chang, Z. Y. et al. Effect of Al₂O₃ on viscosity of low alumina slags of Jiusteel and thermodynamic analysis. China Metall. 28, 6–9 (2018).
MATH Google Scholar
Tang, G. Z., Li, J. G., Zeng, Y. N. & Zhao, L. N. Thermodynamic activity of components in CaO–MgO–Al₂O₃–SiO₂ refining slag system. Iron Steel Vanadium Titanium 37, 127–132 (2016).
CAS MATH Google Scholar
Guo, Y. C., Zheng, H. Y., Hu, X. G. & Shen, F. M. Prediction model of Al₂O₃ activity in CaO–SiO₂–Al₂O₃–MgO quaternary slag system. J. Northeast. Univ. (Nat. Sci.) 42, 652–657 (2021).
Google Scholar
Wu, L., Jiang, Z. H., Gong, W. & Li, Y. GA-NN-based predicting model of activity of multiple slag system. J. Northeast. Univ. (Nat. Sci.) 29, 1725–1728 (2008).
CAS MATH Google Scholar
Xin, Z. C. et al. Predicting temperature of molten steel in LF-refining process using IF-ZCA-DNN model. Metall. Mater. Trans. B 54, 1181–1194 (2023).
CAS MATH Google Scholar
Guo, B. H. ECG Identification Based on Gradient Enhancement Machine Learning Algorithm (Jilin University, Changchun, 2020).
MATH Google Scholar
Jin, C. Y., Yu, J. Q., Wang, Q. & Chen, L. J. Prediction of blasting fragment large block percentage ratio based on ensemble learning CatBoost model. J. Northeast. Univ. (Nat. Sci.) 44, 1743–1750 (2023).
MATH Google Scholar
Gu, C. Y., Xu, X. Y., Wang, M. Y. & Yan, Z. CatBoost algorithm based fault diagnosis method for photovoltaic arrays. Autom. Electr. Power Syst. 47, 105–114 (2023).
MATH Google Scholar
Lu, N. N., Yu, J. K., Su, C. & Wang, H. Z. Activity calculation for the components in CaO–Al₂O₃ and CaO–SiO₂–Al₂O₃ slags. J. Northeast. Univ. Nat. Sci. 34, 1743–1746 (2013).
CAS MATH Google Scholar
Qu, Z. D., Xie, Y., Meng, X. L., Xu, J. F. & Wang, K. P. Evolution rules of inclusions in high quality bearing steel produced by BOF-LF-RH-CC and EAF-LF-VD-CC process. Steelmaking 36, 76–80 (2020).
Google Scholar
Zhao, B., Qiao, T., Wu, W. & Zhi, J. G. Thermodynamic calculation of activity for CaO CaF₂–SiO₂–Al₂O₃ quaternary slag system in crankshaft steel. China Metall. 32, 49–57 (2022).
Google Scholar
Huang, Z. Q., Yang, Z. P. & Di, L. M. Effect of MgO in blast furnace type slags containing TiO₂ on the activity of CaO. J. Northeast. Univ. Technol. 4, 426–430 (1987).
MATH Google Scholar
Yu, J. Y. Activity Model and Its Applications of CaO–SiO₂–MgO–Al₂O₃ Quaternary Slag System. (Northeastern University, 2016).
Xu, H., Wang, J., Wu, L. S. & Dong, Y. C. Experiment study on desulfurization of low-fluoride LF refining slag for aluminum killed steel. Metall. Eng. 2, 42–48 (2015).
CAS MATH Google Scholar
Li, M. et al. Formation and controlling of Type-D inclusions in bearing steel. Chin. J. Eng. 40, 31–35 (2018).
MATH Google Scholar
Wu, K., Liang, Z. G., Zhang, E. H. & Li, H. M. Research on the slag-metal sulfur partition and the kinetics equation of desulfurization in LF refining process. ACTA Metall. Sin. 37, 1069–1072 (2001).
CAS MATH Google Scholar
Li, D. J., Xu, M. C., Li, X. W., Liu, X. & Yu, F. Z. Study of affecting desulfurization factors in 170–190 t LF refining process. Special Steel 39, 32–35 (2018).
MATH Google Scholar
Kume, K., Morita, K., Miki, T. & Sano, N. Activity measurement of CaO–SiO₂–AlO_1.5–MgO slags equilibrated with molten silicon alloys. ISIJ Int. 40, 561–566 (2000).
CAS Google Scholar
Wen, Q. L. et al. Activity of CaO in CaO–SiO₂–Al₂O₃–MgO slags. ISIJ Int. 58, 792–798 (2018).
CAS MATH Google Scholar
Mirjalili, S. & Lewis, A. The whale optimization algorithm. Adv. Eng. Softw. 95, 51–67 (2016).
MATH Google Scholar
Dai, C. Y., Ma, L. J., Jiang, H. C. & Li, H. S. An improved whale optimization algorithmbased on multiple strategies. Comput. Eng. Sci. 46, 1635–1647 (2024).
MATH Google Scholar
Zhou, G. L. & Zhou, F. Sea surface temperature prediction method based on an IWOA optimized Res-BiGRU deep learning model. Mar. Environ. Sci. 43, 806–816 (2024).
MATH Google Scholar
Cai, Y., Yuan, Y. & Zhou, A. H. Predictive slope stability early warning model based on CatBoost. Sci. Rep. 14, 25727 (2024).
CAS PubMed PubMed Central Google Scholar
Dorogush, A. V., Ershov, V. & Gulin, A. CatBoost: Gradient boosting with categorical features support. preprint at arXiv, 181011363 (2018). https://doi.org/10.48550/arXiv.1810.11363
Xin, Z. C. et al. Explainable machine learning model for predicting molten steel temperature in the LF refining process. Int. J. Miner. Metall. Mater. 31, 2657–2669 (2024).
MATH Google Scholar
CatBoost, Parameter tuning. https://catboost.ai/docs/en/concepts/parameter-tuning (Accessed: 2025–01–20)
Li, S. et al. Construction and application of activity models for CaO–SiO₂–Al₂O₃–TiO₂ slag system. Chin. J. Rare Metals 44, 540–546 (2020).
Google Scholar
Yu, Z. B., Zang, X. M., Yang, J., Li, S. S. & Kong, L. Z. Method for determining activity of components in slag. J. Univ. Sci. Technol. Liaoning 47, 7–15 (2024).
MATH Google Scholar
Li, S. L., Zeng, Q. S., Feng, D. Y. & Xia, G. J. Time-optimal trajectory optimization of collaborative manipulator based on improved genetic algorithm. Autom. Instrum. 39, 60–65 (2024).
MATH Google Scholar
Qin, F. M., Zhong, Y. H. & Chen, Z. Robust optimization study on active distribution network based on beta distribution minimum confidence interval. Guangxi Electr. Power 47, 8–15 (2024).
MATH Google Scholar

Download references

Acknowledgements

This project is funded by the National Natural Science Foundation of China, under Grant Number 52374321 and 51974023, the funding of State Key Laboratory of Advanced Metallurgy, University of Science and Technology Beijing, under Grant Number 41621005, and the Youth Science and Technology Innovation Fund of Jianlong Group-University of Science and Technology Beijing, under Grant Number 20231235.

Author information

Authors and Affiliations

State Key Laboratory of Advanced Metallurgy, University of Science and Technology Beijing, Beijing, 100083, China
Zi-cheng Xin, Jiang-shan Zhang & Qing Liu
School of Automation and Electrical Engineering, University of Science and Technology Beijing, Beijing, 100083, China
Zi-cheng Xin

Authors

Zi-cheng Xin
View author publications
Search author on:PubMed Google Scholar
Jiang-shan Zhang
View author publications
Search author on:PubMed Google Scholar
Qing Liu
View author publications
Search author on:PubMed Google Scholar

Contributions

Zicheng Xin, Qing Liu: Conceptualization, Project Administration, Supervision, Methodology, Funding Acquisition; Jiangshan Zhang: Data Curation, Formal Analysis, Validation; Zicheng Xin, Qing Liu: Writing—original draft, Writing—review and editing. All authors have read and agreed to the published version of the manuscript.

Corresponding authors

Correspondence to Jiang-shan Zhang or Qing Liu.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Xin, Zc., Zhang, Js. & Liu, Q. Predicting CaO activity in multiple slag system using improved whale optimization algorithm and categorical boosting. Sci Rep 15, 9533 (2025). https://doi.org/10.1038/s41598-025-93980-9

Download citation

Received: 18 November 2024
Accepted: 11 March 2025
Published: 19 March 2025
Version of record: 19 March 2025
DOI: https://doi.org/10.1038/s41598-025-93980-9

Keywords

This article is cited by

Intelligent classification and dynamic evolution simulation study on air conditioner product demand characteristics
- Zhongyi Wu
- Cheng Liang
- Zhi Shen
Scientific Reports (2026)