Abstract
The activity of slag components is one of the primary factors influencing the thermodynamic properties of slag. In this study, a feasible model was established to predict the a(CaO) using improved whale optimization algorithm (IWOA) and Categorical Boosting (CatBoost). The effects of other variables on a(CaO) were listed in descending order of influence as follows: w(CaO), w(SiO2), temperature, w(MgO), and w(Al2O3). And the IWOA-CatBoost model achieved the highest R2 value of 0.9200, lowest RMSE of 0.0042, and lowest MAE of 0.0030 in predicting the a(CaO). The performance of the optimal IWOA-CatBoost model was evaluated and compared with that of known models. The results demonstrate that the IWOA-CatBoost model outperformed existing models and methods, such as the Factsage, ion and molecule coexistence theory, and genetic algorithm—backpropagation neural network. The accurate calculation of slag component activity is of great significance to the analysis of the thermodynamic properties of slag. Meanwhile, the approach and algorithm used to develop the a(CaO) prediction model can also be applied to predicting the activity of other slag components or other metallurgical applications (e.g., predicting molten steel temperature, steel composition, and alloy yield).
Similar content being viewed by others
Introduction
With the advancement of the steel industry toward higher-end, smarter and greener production, the requirements for product quality continue to increase1. Slag has a significant impact on the quality of molten steel, and desulfurization is one of its primary tasks in LF refining process2,3,4. The desulfurization reaction primarily occurs through interfacial reactions between the steel and slag, and the activity of each slag component significantly impacts the desulfurization process. In addition, activity theory can explain a range of critical phenomena in metallurgical processes, such as phase equilibria and phase transformations, element migration, and the direction of chemical reactions. With the deepening research on physicochemical properties of slag, the research on slag component activity has been paid more attention by metallurgists.
Metallurgists have developed a range of thermodynamic models for molten slag, including the complete ionic solution model5, the regular solution model6, the new generation solution geometrical model7, and the ion and molecule coexistence theory (IMCT)8. Chang et al.9 calculated the liquidus temperature, activity, and cooling crystallization process of slag using the FactSage thermodynamic software, and analyzed the effect mechanism of Al2O3 on the slag viscosity. Tang et al.10 calculated the slag components activity in the CaO–MgO–Al2O3–SiO2 refining slag system to explore the thermodynamic equilibrium relationships among refining slag, molten steel, and inclusions using the FactSage. Guo et al.11 developed an activity calculation model for the CaO–SiO2–MgO–Al2O3 slag system based on the IMCT, and validated this model using experimental data. However, FactSage thermodynamic software typically assumes that reactions reach equilibrium. In actual metallurgical processes, the reactions between slag and metal do not always achieve thermodynamic equilibrium, especially under conditions of rapid reactions or non-equilibrium states. As a result, the calculated activity value may not accurately reflect situation in actual process. Additionally, most models and methods involve assumed conditions and overlook the effects of interactions between components on activity, resulting in deviations between calculated and experimental values. Therefore, the thermodynamic model should be further developed and improved11.
With the rapid development of machine learning theory, Wu et al.12 established an activity prediction model for multiple slag systems based on a genetic algorithm (GA)—back propagation (BP) neural network algorithm and demonstrated a good agreement between this model’s calculated values and experimental values. However, the study validated the accuracy of the constructed model solely by comparing calculated values with literature values in graphical form, without conducting a comprehensive comparative analysis with known models and methods, and employed the traditional GA-BP neural network algorithm. Since the emergence of deep learning, deep neural network has been widely applied across diverse industries13. However, nearly all deep neural network algorithms currently require large datasets for effective training. In fact, challenges such as limited data availability and high data collection costs are common, so small sample datasets learning is particularly important. Categorical Boosting (CatBoost), an improved decision tree algorithm, was applied to the prediction of a(CaO) in multiple slag system with small sample datasets in this study. The CatBoost algorithm can achieve high predictive accuracy with small sample set14. Jin and Gu et al.15,16 validated the feasibility of CatBoost for small-sample prediction in the contexts of the blasting fragment large block percentage ratio (a regression study with 36 data samples) and fault diagnosis of photovoltaic array (a classification study with 55 training samples and 110 training samples), respectively.
Based on the aforementioned analysis, the data of a(CaO) obtained using the same activity measurement method was first collected in this study. Next, a correlation analysis was performed to assess the effect of various factors on a(CaO). Furthermore, the convergence factor of the standard whale optimization algorithm was improved to enhance the global search ability in the early stage and the local optimization speed in the later stage. Then, a prediction model of a(CaO) was established based on the improved whale optimization algorithm (IWOA)—CatBoost. Finally, various statistical evaluation metrics were employed to compare and assess the established model against existing models and methods (such as FactSage, IMCT, GA-BP), demonstrating the accuracy of the established model. Meanwhile, the modeling approach presented in this study can also be applied to predicting the activity of other slag components.
Data collection and data analysis
Slag is a multi-component melt mainly composed of different oxides and is a typical by-product in steelmaking. Refining slag plays an important role in the steelmaking process, as sulfur is primarily removed from the molten steel through interfacial chemical reaction of the steel-slag. Meanwhile, the activity of CaO in the slag also has a significant impact on the content of CaO in inclusions17,18. Therefore, studying the activity of slag components is of great significance. The following analysis, based on metallurgical mechanisms, examines the impact of various factors (temperature and slag composition, including CaO, SiO2, MgO, and Al2O3 content) on CaO activity, desulfurization reactions, and inclusion control.
In terms of the impact of different factors on CaO activity, as the temperature increases, the activity of CaO increases. The possible reason is that SiO2, as a reactant, always involves CaO in the reaction. According to Le Chatelier’s principle and IMCT, compounds such as CaSiO3 and Ca2SiO4 decompose to form Ca2+ and O2−, which leads to an increase in the activity of Ca2+ and O2−, thus raising the activity of CaO19. As the CaO content increases, the activity of CaO also increases. Meanwhile, a large amount of CaO exists in the slag in the form of Ca2+ and O2−, which causes the activity of other components in the slag to gradually decrease17. As the SiO2 content increases, the activity of CaO gradually decreases19. Under constant basicity, the activity of CaO first increases and then decreases as the MgO content increases20. As the Al2O3 content increases, the activity of CaO decreases. Under basic slag conditions, Al2O3 is acidic. The combination of Al2O3 and CaO forms the CaO·Al2O3 compound, which reduces the free CaO content in the slag. Meanwhile, some of the free O2- is consumed when forming aluminates, leading to a decrease in the CaO activity in the slag21.
In terms of the influence of various factors on desulfurization reactions and inclusion control, as the CaO content increases, more O2- is provided, the optical basicity increases, the sulfur capacity of the slag increases, and the sulfur distribution ratio between steel and slag increases, all of which facilitate desulfurization22. Under a fixed Al2O3 content, as the CaO content increases, the CaO activity increases while the Al2O3 activity decreases, which facilitates the slag’s adsorption of Al2O3 inclusions and enhances its deoxidation capacity. However, if the CaO content becomes excessively high, the increased CaO activity results in a higher CaO content in the inclusions23. Meanwhile, an excessively high CaO content leads to the precipitation of solid phase particles from the slag, which increases the viscosity, reduces fluidity, and deteriorates the desulfurization kinetics of slag24. Under basic slag conditions, SiO2 is a stronger acidic oxide than Al2O3, and an excess of SiO2 results in a decline in the desulfurization efficiency of the slag. Under specific conditions, SiO2 increases the viscosity of the slag and decreases its surface tension. To promote the infiltration, adsorption, and dissolution of inclusions, it is crucial to minimize the surface tension of the slag while ensuring that the viscosity remains stable19. MgO, being a basic oxide, can provide O2-, but its desulfurization capacity is slightly lower than that of CaO24. Under certain conditions, as the MgO content increases, the desulfurization capacity of the slag improves. However, when the MgO content surpasses a certain threshold, further increases lead to a decrease in desulfurization capacity. This is due to the high melting point of MgO (2800 °C), which, with increasing content, results in reduced slag fluidity and deteriorated desulfurization kinetics. Al2O3 itself lacks desulfurization capacity. In basic slags, Al2O3 acts as an acidic oxide. As the Al2O3 content increases, the effective CaO content in the slag decreases, leading to a reduced desulfurization capacity. This also hampers the removal of Al2O3 inclusions from the molten steel22. Increasing the Al2O3 content within a certain range can reduce the viscosity of the slag, improve its fluidity, and enhance the desulfurization kinetics25.
For the determination of slag component activity, commonly used experimental methods include the vapor pressure method, chemical equilibrium method, partition coefficient method, and electromotive force method. Metallurgical researchers usually use the chemical equilibrium method to measure slag component activity. Accordingly, 123 experimental data sets of a(CaO) measured by this method were collected for modeling research, as shown in Table 1
A scatter matrix diagram and a Pearson correlation coefficient were used to visualize the data sets and reflect the correlation between these data, as shown in Fig. 1. In Fig. 1, the histograms along the diagonal display the distribution of each individual variable, while the scatter plots in the lower triangles illustrate the relationships between pairs of variables. For example, the left-most plot in the bottom row of Fig. 1 shows the relationship between temperature and the a(CaO). The temperature values were 1823 K and 1873 K. The range of the w(CaO), w(SiO2), w(MgO), and w(Al2O3) was 3.9–55.6%, 7.1–87.6%, 0–48.2%, and 0–56.2%, respectively. Meanwhile, the effects of other variables on a(CaO) are listed in descending order of influence as follows: w(CaO), w(SiO2), temperature, w(MgO), and w(Al2O3). The a(CaO) increased with the increase of the temperature. A higher w(CaO) was beneficial to improve the a(CaO). The w(SiO2) has a negative effect on the a(CaO). Compared to w(CaO), w(SiO2), and temperature, w(MgO) and w(Al2O3) have a smaller impact on a(CaO).
Scatter matrix diagram of temperature (T), w(CaO), w(SiO2), w(MgO), and w(Al2O3). Note: w represents the weight percent of slag components; a(CaO) denotes the activity of CaO.
Establishment of a(CaO) prediction model
In this study, the data set was first randomly divided into a training data set (80%) and a testing data set (20%). Then, an a(CaO) calculation model based on IWOA-CatBoost was developed using the training data set. Subsequently, based on the same testing data set, the calculated values of a(CaO) were obtained using FactSage10, IMCT11, GA-BP neural network algorithm12, and IWOA-CatBoost. Finally, the accuracy of the established model was evaluated using R2, RMSE, MAE, and scatter plots. The modeling workflow is shown in Fig. 2.
Modeling workflow.
Figure 3 presents the IWOA-CatBoost modeling flowchart. The specific steps of the IWOA-CatBoost modeling are outlined as follows:
-
(1)
Collect CaO activity data under different factors from existing literature (a total of 123 data sets);
-
(2)
Divide the 123 experimental data sets into a training data set (80%) and a testing data set (20%), with factors such as temperature, w(CaO), w(SiO2), w(MgO), and w(Al2O3) as input variables, and CaO activity as the output variable of the model;
-
(3)
Set the whale population size n, maximum number of iterations max_iter, and define the value ranges for the CatBoost hyperparameters: learning_rate, depth, n_estimators, l2_leaf_reg, subsample, bagging_temperature, and colsample_bylevel;
-
(4)
Set the CatBoost hyperparameters to each whale individual and initialize the whale population;
-
(5)
Calculate the fitness value of each whale individual to determine the current best individual and the optimal value of the whale population;
-
(6)
The improved IWOA algorithm is used to update the positions of the population individuals, updating parameters a, D, A, and C, where a represents the improved nonlinear convergence factor, which coordinates the algorithm’s global search and local optimization;
-
(7)
Calculate the fitness values, and through the comparison of fitness values, update the optimal solution for each whale individual and the optimal solution for the whale population, thereby obtaining a new population;
-
(8)
Determine whether the algorithm meets the termination conditions (minimizing the prediction error of CaO activity). If satisfied, proceed to (9); otherwise, proceed to (6);
-
(9)
Obtain the optimal hyperparameters (learning_rate, depth, n_estimators, l2_leaf_reg, subsample, bagging_temperature, colsample_bylevel);
-
(10)
Establish the model using the optimal hyperparameter combination;
-
(11)
Evaluate the IWOA-CatBoost model using the testing set;
-
(12)
Output the optimal hyperparameter combination (learning_rate, Depth, n_estimators, l2_leaf_reg, subsample, bagging_temperature, colsample_bylevel) and the model evaluation metrics (R2, RMSE, MAE).
IWOA-CatBoost modeling flowchart.
Improved whale optimization algorithm
Whale optimization algorithm (WOA) is an intelligent optimization algorithm proposed by Mirjalili et al.28, which has the characteristics of simple algorithm principle, few parameter setting and strong global search ability. The whale optimization process is divided into three main stages: encircling prey, bubble-net attacking method, and search for prey. The synergy of these three stages makes the whale optimization algorithm an effective tool for obtaining optimal solutions in different scenarios29.
-
(1)
Encircling prey.
The search range of the WOA algorithm is the entire solution space. Since the location of the optimal solution is unknown, a candidate solution is assumed to be the target prey. Once the prey is determined, the other whales update their positions to the target prey. This behavioral model is shown in Eq. (1).
where t represents the iteration number; X*(t) represents the position of the current optimal solution; X(t) represents the position of the whale; X(t + 1) represents the position of the whale at the next moment; D represents the distance between the position of the whale and the current optimal solution. A and C represent the parameters for updating the position of the whales, as shown in Eq. (2).
where r1 and r2 represent the random number in the range of [0, 1]; tmax represents the maximum number of iterations; a represents the convergence factor that linearly decreases from 2 to 0 as t increases.
-
(2)
Bubble-net attacking method.
During the predation phase, the constriction encirclement and spiral ascent are performed simultaneously, bringing the prey close to the sea surface for hunting. This model of hunting behavior is shown in Eq. (3).
where, b is a constant for defining the shape of the logarithmic spiral; l is a random number in [− 1, 1].
-
(3)
Search for prey.
The decision to perform either a global search (when |A|≥ 1) or a local search (when |A|< 1) is based on the value of |A|. When performing a global search, an individual whale is randomly selected to ensure the balance between local optimization and global search. The model is shown in Eq. (4).
where Xrand represents the position of a randomly selected whale individual within the population.
The above introduction is the standard WOA, where the balance between local optimization and global search significantly influences optimization accuracy and convergence speed. This balance is controlled by the parameter A. The primary factor influencing A is the convergence factor a, which linearly decreases from 2 to 0 as t increases in the standard WOA. This approach may lead to inadequate exploration of feasible solutions in the early stage and slow convergence in the later stage30. For this problem, a piecewise nonlinear convergence factor was proposed to improve both the exploration capability in the early stage and the convergence speed in the later stage, as shown in Eq. (5). Based on this, the IWOA algorithm is used to ensure a global search within the feasible solution by maintaining a large convergence factor with a slow reduction rate in the early iterations. In the later stages, the convergence factor is small, and its reduction rate is rapid to enhance the speed of local optimization.
Categorical boosting (CatBoost) algorithm
CatBoost is an algorithm developed by the Russian company Yandex, based on oblivious trees as its base learners. In the boosting algorithms, CatBoost demonstrates higher computational accuracy and shorter training times compared to XGBoost. Furthermore, CatBoost effectively addresses the overfitting issue present in LightGBM through its ordered boosting method31. Therefore, a prediction model of a(CaO) was established based on CatBoost.
CatBoost has the following characteristics32: (1) It utilizes the Ordered Target-based Statistics method for feature label classification, employing a core principle of ranking to randomly permute the data through various methods, thereby generating different permutation sequences. Subsequently, for each permutation sequence, the average target value of samples belonging to the same category is calculated by estimating each sample. When handling the categorical features of each sample, the average target value of the previous categorical labels of that sample is utilized and presented in the form of numerical variables. This approach enhances the modeling capability of categorical features. (2)To solve the problem of gradient estimation bias, the step size of the gradient is improved in the first stage by utilizing unbiased estimates and employing the ordered boosting method for gradient calculation and estimation; In the second stage, the traditional gradient boosting decision tree algorithm is used for optimization. This method can effectively reduce the bias caused by gradient estimation, thereby solving the problem of prediction shift and improving the accuracy and generalization ability of the model.
Model evaluation
Coefficient of determination (R2), root mean square error (RMSE), and mean absolute error (MAE) are adopted as the performance evaluation criteria for different models33. Table 2 shows the performance evaluation criteria.
Results and discussion
Hyperparameter optimization of CatBoost
The Improved Whale Optimization Algorithm (IWOA) was used to explore and optimize specific hyperparameters of CatBoost, with the corresponding ranges and optimal values shown in Table 3. Learning rate (learning_rate): The learning rate is used to control the convergence speed of the algorithm. A smaller learning rate can make the model more stable but may require more training iterations to reach optimal performance. Depth of tree (depth): The depth of a tree refers to the maximum depth of each tree. Increasing the depth of the tree can improve the complexity of the model, thereby enhancing its performance. However, if the depth of the tree is too large, it may lead to overfitting. Number of tree (n_estimators): The number of trees refers to the number of trees in the model. Increasing the number of trees can enhance the model’s complexity, thereby improving its performance. However, if the number of trees is too large, it may lead to overfitting. L2 regularization coefficient (l2_leaf_reg): The L2 regularization coefficient is used to control the degree of regularization in the model. In general, smaller values of l2_leaf_reg tend to make the model more prone to overfitting, while larger values of l2_leaf_reg tend to make the model more prone to underfitting. Subsample ratio (subsample): The main purpose of the subsample parameter is to reduce overfitting by randomly selecting a portion of the data to simulate the diversity of the training set. Bagging temperature (bagging_temperature): The Bagging parameter is used to control the proportion of samples in each iteration step. A smaller Bagging parameter can reduce the variance of the model, thereby improving its stability. Feature sampling ratio (colsample_bylevel): The feature sampling ratio refers to the proportion of features considered for splitting at each node. A smaller feature sampling ratio can reduce the model’s variance, thereby improving the model’s stability34. Other hyperparameters were set to their default values as provided by Python’s CatBoost library. This approach aimed to improve model performance by focusing optimization efforts on key parameters, while allowing default settings for other parameters.
Comparison of IWOA-CatBoost model with other models and methods
Different performance evaluation metrics were used to compare the MLR, MLP, and KNN models with the optimal IWOA-CatBoost model, as shown in Table 4.
Table 4 show the performance of the various models and methods. In Table 4, the R2 value of the IWOA-CatBoost model were better than those of the FactSage, IMCT, and GA-BP model. The IWOA-CatBoost model achieved the highest R2 value of 0.9200, lowest RMSE of 0.0042, and lowest MAE of 0.0030 in predicting the a(CaO). Meanwhile, the R2 value of the IWOA-CatBoost model of the a(CaO) was 0.3804 higher than those of the FactSage, 0.4309 higher than those of the IMCT, and 0.1625 higher than those of the GA-BP model, respectively. The RMSE and MAE values of the IWOA-CatBoost model of the a(CaO) were 0.0088 and 0.0053 lower than those of the FactSage, 0.0053 and 0.0028 lower than those of the IMCT, and 0.0032 and 0.0020 lower than those of the GA-BP model, respectively. The possible analysis of the above results is as follows: (1) FactSage is a thermodynamic calculation software widely used in metallurgy for slag system analysis. It utilizes relevant databases to calculate the thermodynamic properties of slag35. However, it may not capture the complex non-linear interactions between slag components as effectively as data-driven models like CatBoost. Moreover, Factsage’s predictive accuracy is highly dependent on the quality and extent of the thermodynamic databases it uses36, which may limit its applicability and accuracy in predicting CaO activity compared to machine learning models that incorporate real-time data for modeling and prediction. (2) IMCT is a classical model used for predicting slag properties, based on a transfer function that models the relationship between slag composition and activity. While it provides reasonable estimates in many cases, it is limited by its linear relationships and least square method, which can lead to less accurate predictions in complex slag systems where interactions are more intricate. (3) GA-BP is a hybrid model combining genetic algorithms for optimization and a backpropagation neural network for prediction. While GA-BP performs well in capturing non-linear relationships, its performance heavily depends on the fine-tuning of the network’s hyperparameters. Additionally, the genetic algorithm optimization may lead to overfitting or convergence to local minima, which could affect the model’s predictive accuracy37. (4) The IWOA algorithm is used to ensure a global search within the feasible solution by maintaining a large convergence factor with a slow reduction rate in the early iterations. In the later stages, the convergence factor is small, and its reduction rate is rapid to enhance the speed of local optimization. CatBoost’s ability to handle feature interactions and non-linearities makes it more adaptable to complex slag systems32. IWOA optimizes the hyperparameters of CatBoost, which reduces the risk of overfitting compared to GA-BP and avoids the reliance on thermodynamic databases like Factsage.
In addition, the scatter plot, confidence interval, and absolute error plot were used to evaluate the performance of various models and methods. Figure 4 shows the comparison of the experimental and calculated a(CaO) on different models and methods using the same testing data set. The closer the scatter to the 45-degree diagonal line, the smaller the error between the calculated and experimental values. The coefficient of determination (R2) was used to evaluate the model’s goodness of fit, with the value closer to 1 indicating a stronger fitting ability. The confidence interval, shown as light blue shading and typically set at 95%, was used to reflect the uncertainty in the estimation results. A narrower confidence interval indicates greater stability in the model’s predictive performance38. In Fig. 4, the overall scatter plots of the IWOA-CatBoost prediction model of the a(CaO) was closer to the 45-degree diagonal dotted line than that of the FactSage, IMCT, and GA-BP model. Meanwhile, according to the R2 values, the IWOA-CatBoost model (R2 = 0.9200) demonstrates the strongest fitting ability, followed by the GA-BP model (R2 = 0.7575), whereas the FactSage and IMCT model display relatively weaker fitting performance. Additionally, as shown in Fig. 4, the IWOA-CatBoost model has the narrowest confidence interval, suggesting greater stability in its predictions. Based on various model evaluation metrics, the overall performance of the IWOA-CatBoost model is superior to that of the other models. Meanwhile, the regression line slope of the IWOA-CatBoost model is closer to 1 compared to other models, and the scatter distribution of predicted versus actual values is relatively concentrated, reflecting high prediction accuracy.
Comparison of the experimental and calculated a(CaO) on different models and methods using the same testing data set: (a) FactSage, (b) IMCT, (c) GA-BP, (d) IWOA-CatBoost.
Figure 5 presents a comparison of the absolute values of the errors for different models and methods. In comparison with the FactSage, IMCT, and GA-BP, the absolute error of the a(CaO) value calculated by the IWOA-CatBoost model is overall closer to the zero reference line. Through an analysis of scatter plots, confidence intervals, and absolute error plots of different models, the IWOA-CatBoost model demonstrates advantages over the FactSage, IMCT, and GA-BP models in terms of confidence interval width, stability, fitting ability, and generalization performance. In the future, a dedicated activity database can be developed and data-sharing can be realized to further optimize the hyperparameters of the model established in this study. Meanwhile, the approach and algorithm used to develop the a(CaO) prediction model can also be applied to predicting the activity of other slag components or other metallurgical applications (e.g., predicting molten steel temperature, steel composition, and alloy yield), with stronger applicability, higher calculation accuracy and stronger generalization ability.
Comparison of the absolute values of the errors for different models and methods.
Conclusions
Through the collection and analysis of experimental data of a(CaO), a feasible model was established to predict the a(CaO) using the IWOA-CatBoost. The following conclusions can be drawn.
-
(1)
Through correlation analysis, the effects of other variables on a(CaO) were listed in descending order of influence as follows: w(CaO), w(SiO2), temperature, w(MgO), and w(Al2O3). The optimal structures of the IWOA-CatBoost model had learning_rate of 0.3276, depth of 8, n_estimators of 684, l2_leaf_reg of 8.6729, subsample 0.6301, bagging_temperature of 0.5256, and colsample_bylevel of 0.8468.
-
(2)
The performance of the optimal IWOA-CatBoost model was evaluated and compared with that of existing models and methods. The IWOA-CatBoost model achieved the highest R2 value of 0.9200, lowest RMSE of 0.0042, and lowest MAE of 0.0030 in predicting the a(CaO), demonstrating superior stability, fitting accuracy, and generalization capability, thereby supporting its feasibility for calculating a(CaO). Meanwhile, the establishment method of a(CaO) prediction model can also be applied to the prediction of other slag components activity or other metallurgical applications.
Data availability
For data inquiries, please contact Zicheng Xin (sklxzc@163.com).
References
Yin, R. Y. Topic of times of metallurgy-Get through process, communicate different levels and open up a new theory. Iron Steel 56, 4–9 (2021).
Xin, Z. C. et al. Sulphide capacity prediction of CaO–SiO2–MgO–Al2O3 slag system by using regularized extreme learning machine. Ironmak. Steelmak. 48, 275–283 (2021).
Xin, Z. C. et al. Mathematical modelling and plant trial on slagging regime in a ladle furnace for high-efficiency desulphurization. Ironmak. Steelmak. 48, 1123–1132 (2021).
Yang, X. M., Shi, C. B., Zhang, M., Chai, G. M. & Wang, F. A thermodynamic model of sulfur distribution ratio between CaO–SiO2–MgO–FeO–MnO–Al2O3 slags and molten steel during LF refining process based on the ion and molecule coexistence theory. Metall. Mater. Trans. B 42, 1150–1180 (2011).
Tian, Y. W., Zhai, X. J. & Liu, K. R. Physical Chemistry of Metallurgy (Metallurgical Industry Press, 2007).
Prausntiz, J. M., Lichtenthaler, R. N. & Azevedo, E. G. Molecular Thermodynamics of Fluid-Phase Equilibria 193–370 (Prentice-Hall Inc., Englewood Cliffs, 1986).
Chou, K. C. New generation solution geometrical model and its further development. Acta Metall. Sin. 33, 126–130 (1997).
Zhang, J. Computational Thermodynamics of Metallurgical Melts (Metallurgical Industry Press, 1998).
Chang, Z. Y. et al. Effect of Al2O3 on viscosity of low alumina slags of Jiusteel and thermodynamic analysis. China Metall. 28, 6–9 (2018).
Tang, G. Z., Li, J. G., Zeng, Y. N. & Zhao, L. N. Thermodynamic activity of components in CaO–MgO–Al2O3–SiO2 refining slag system. Iron Steel Vanadium Titanium 37, 127–132 (2016).
Guo, Y. C., Zheng, H. Y., Hu, X. G. & Shen, F. M. Prediction model of Al2O3 activity in CaO–SiO2–Al2O3–MgO quaternary slag system. J. Northeast. Univ. (Nat. Sci.) 42, 652–657 (2021).
Wu, L., Jiang, Z. H., Gong, W. & Li, Y. GA-NN-based predicting model of activity of multiple slag system. J. Northeast. Univ. (Nat. Sci.) 29, 1725–1728 (2008).
Xin, Z. C. et al. Predicting temperature of molten steel in LF-refining process using IF-ZCA-DNN model. Metall. Mater. Trans. B 54, 1181–1194 (2023).
Guo, B. H. ECG Identification Based on Gradient Enhancement Machine Learning Algorithm (Jilin University, Changchun, 2020).
Jin, C. Y., Yu, J. Q., Wang, Q. & Chen, L. J. Prediction of blasting fragment large block percentage ratio based on ensemble learning CatBoost model. J. Northeast. Univ. (Nat. Sci.) 44, 1743–1750 (2023).
Gu, C. Y., Xu, X. Y., Wang, M. Y. & Yan, Z. CatBoost algorithm based fault diagnosis method for photovoltaic arrays. Autom. Electr. Power Syst. 47, 105–114 (2023).
Lu, N. N., Yu, J. K., Su, C. & Wang, H. Z. Activity calculation for the components in CaO–Al2O3 and CaO–SiO2–Al2O3 slags. J. Northeast. Univ. Nat. Sci. 34, 1743–1746 (2013).
Qu, Z. D., Xie, Y., Meng, X. L., Xu, J. F. & Wang, K. P. Evolution rules of inclusions in high quality bearing steel produced by BOF-LF-RH-CC and EAF-LF-VD-CC process. Steelmaking 36, 76–80 (2020).
Zhao, B., Qiao, T., Wu, W. & Zhi, J. G. Thermodynamic calculation of activity for CaO CaF2–SiO2–Al2O3 quaternary slag system in crankshaft steel. China Metall. 32, 49–57 (2022).
Huang, Z. Q., Yang, Z. P. & Di, L. M. Effect of MgO in blast furnace type slags containing TiO2 on the activity of CaO. J. Northeast. Univ. Technol. 4, 426–430 (1987).
Yu, J. Y. Activity Model and Its Applications of CaO–SiO2–MgO–Al2O3 Quaternary Slag System. (Northeastern University, 2016).
Xu, H., Wang, J., Wu, L. S. & Dong, Y. C. Experiment study on desulfurization of low-fluoride LF refining slag for aluminum killed steel. Metall. Eng. 2, 42–48 (2015).
Li, M. et al. Formation and controlling of Type-D inclusions in bearing steel. Chin. J. Eng. 40, 31–35 (2018).
Wu, K., Liang, Z. G., Zhang, E. H. & Li, H. M. Research on the slag-metal sulfur partition and the kinetics equation of desulfurization in LF refining process. ACTA Metall. Sin. 37, 1069–1072 (2001).
Li, D. J., Xu, M. C., Li, X. W., Liu, X. & Yu, F. Z. Study of affecting desulfurization factors in 170–190 t LF refining process. Special Steel 39, 32–35 (2018).
Kume, K., Morita, K., Miki, T. & Sano, N. Activity measurement of CaO–SiO2–AlO1.5–MgO slags equilibrated with molten silicon alloys. ISIJ Int. 40, 561–566 (2000).
Wen, Q. L. et al. Activity of CaO in CaO–SiO2–Al2O3–MgO slags. ISIJ Int. 58, 792–798 (2018).
Mirjalili, S. & Lewis, A. The whale optimization algorithm. Adv. Eng. Softw. 95, 51–67 (2016).
Dai, C. Y., Ma, L. J., Jiang, H. C. & Li, H. S. An improved whale optimization algorithmbased on multiple strategies. Comput. Eng. Sci. 46, 1635–1647 (2024).
Zhou, G. L. & Zhou, F. Sea surface temperature prediction method based on an IWOA optimized Res-BiGRU deep learning model. Mar. Environ. Sci. 43, 806–816 (2024).
Cai, Y., Yuan, Y. & Zhou, A. H. Predictive slope stability early warning model based on CatBoost. Sci. Rep. 14, 25727 (2024).
Dorogush, A. V., Ershov, V. & Gulin, A. CatBoost: Gradient boosting with categorical features support. preprint at arXiv, 181011363 (2018). https://doi.org/10.48550/arXiv.1810.11363
Xin, Z. C. et al. Explainable machine learning model for predicting molten steel temperature in the LF refining process. Int. J. Miner. Metall. Mater. 31, 2657–2669 (2024).
CatBoost, Parameter tuning. https://catboost.ai/docs/en/concepts/parameter-tuning (Accessed: 2025–01–20)
Li, S. et al. Construction and application of activity models for CaO–SiO2–Al2O3–TiO2 slag system. Chin. J. Rare Metals 44, 540–546 (2020).
Yu, Z. B., Zang, X. M., Yang, J., Li, S. S. & Kong, L. Z. Method for determining activity of components in slag. J. Univ. Sci. Technol. Liaoning 47, 7–15 (2024).
Li, S. L., Zeng, Q. S., Feng, D. Y. & Xia, G. J. Time-optimal trajectory optimization of collaborative manipulator based on improved genetic algorithm. Autom. Instrum. 39, 60–65 (2024).
Qin, F. M., Zhong, Y. H. & Chen, Z. Robust optimization study on active distribution network based on beta distribution minimum confidence interval. Guangxi Electr. Power 47, 8–15 (2024).
Acknowledgements
This project is funded by the National Natural Science Foundation of China, under Grant Number 52374321 and 51974023, the funding of State Key Laboratory of Advanced Metallurgy, University of Science and Technology Beijing, under Grant Number 41621005, and the Youth Science and Technology Innovation Fund of Jianlong Group-University of Science and Technology Beijing, under Grant Number 20231235.
Author information
Authors and Affiliations
Contributions
Zicheng Xin, Qing Liu: Conceptualization, Project Administration, Supervision, Methodology, Funding Acquisition; Jiangshan Zhang: Data Curation, Formal Analysis, Validation; Zicheng Xin, Qing Liu: Writing—original draft, Writing—review and editing. All authors have read and agreed to the published version of the manuscript.
Corresponding authors
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Xin, Zc., Zhang, Js. & Liu, Q. Predicting CaO activity in multiple slag system using improved whale optimization algorithm and categorical boosting. Sci Rep 15, 9533 (2025). https://doi.org/10.1038/s41598-025-93980-9
Received:
Accepted:
Published:
Version of record:
DOI: https://doi.org/10.1038/s41598-025-93980-9







