Abstract
The growing interest in sustainable construction materials has prompted the investigation of alternative resources and sophisticated predictive techniques to enhance material performance. Waste foundry sand (WFS), a secondary product resulting from the metal casting procedure, present a viable alternative to natural aggregates, while the cement strength class (CSC) plays a crucial role in determining the properties of mortar. Although considerable research has been conducted on these elements separately, their combined influence on the compressive strength of mortar has not been thoroughly examined. This study aims to explore the interactive effects of varying percentages of WFS and different CSCs on the compressive strength of cement mortar, utilizing Gene Expression Programming (GEP), a cutting-edge machine learning approach. Compared to Artificial Neural Network (ANN) and other Machine Learning (ML) models, GEP offers enhanced transparency and robust predictive accuracy, making it more suitable for data-driven decision-making in sustainable construction. A comprehensive experimental dataset was created by varying WFS percentages (0%, 10%, 20%, 30%, 40% and 50%) and CSCs (32.5, 42.5, 52.5 MPa). The mix designs were evaluated under two conditions: random and sorted data modes, both with and without CSC as an input variable. GEP models were constructed to forecast compressive strength, incorporating WFS percentage, sand/cement ratio (S/C), water/cement ratio (W/C), and CSC as primary inputs. The addition of CSC as an input variable significantly improved predictive accuracy, achieving a high correlation coefficient (R = 0.99) and a low root mean square error (RMSE = 2.3). The results underscore the necessity of considering both WFS and CSC in tandem within predictive models to effectively optimize mortar mix designs. By merging sustainable materials with advanced modeling methodologies, this research aids in resource conservation and the creation of high-performance, eco-friendly construction materials. The study provides a solid framework for engineers and researchers to advance material design and sustainability within the construction sector.
Introduction
The construction industry’s environmental impact has spurred research into sustainable practices, focusing on integrating industrial byproducts like waste foundry sand (WFS) into construction materials. While WFS shows promise as a natural sand alternative, its application in cement-based materials is hindered by composition variability concerns. Despite extensive research on WFS and cement strength class (CSC) individually, their combined effect on compressive strength remains underexplored1,2. Traditional empirical models often fail to capture the complex interactions among material components in predicting compressive strength. To overcome this limitation, the present study utilizes Gene Expression Programming (GEP), an advanced machine learning technique, to develop a predictive model based on varying levels of waste foundry sand (WFS) and cement strength classes (CSC). This approach supports the optimization of mortar mix designs and promotes sustainable construction through resource efficiency and waste valorization. The study also examines the life cycle of WFS, which originates from silica sand used in metal casting. After repeated use in molds, the sand loses its functional properties and becomes a byproduct, contributing to environmental challenges due to landfill disposal. Through washing, sieving, and chemical treatment, WFS can be processed for reuse. Once treated, it serves as an effective replacement for natural aggregates in cementitious materials, enhancing sustainability in construction applications.
The incorporation of WFS into structural applications promotes waste valorization, conserves natural resources, and aligns with circular economy principles by reducing reliance on virgin materials. Growing demand for sustainable, high-performance construction materials has driven research toward innovative mix design optimization and advanced predictive modeling methods3. Compressive strength of cement mortar4 is significantly influenced by key factors such as the strength class of cement (CSC)5 and alternative aggregate materials6, waste foundry sand (WFS)7, making them critical elements in the research of construction materials. While prior studies have looked at the individual impacts of CSC, WFS, and their predictive modeling through methods like gene expression programming (GEP)8, there has been no research that examines their combined effect on mortar compressive strength. This gap highlights the significance of the current study, which aims to explore the simultaneous influence of WFS and CSC on the strength of cement mortar using GEP. The objective of this research is to develop a more comprehensive predictive model, thereby enhancing material selection and performance optimization for sustainable construction practices.
In a study, Reis et al.9 studied the effect of different cements in concrete on the environment. The studies showed that different energy is consumed to produce cement with different grades, and the higher the grade of cement, the finer the grains and the more electrical energy is consumed. However, given that better hydration occurs and fewer polluting gases are produced, using cements with higher softness is more beneficial and economical for the environment. Kazemi et al.10 performed an investigation to evaluate the impact of CSC on the compressive strength of cement mortar. Their experimental framework incorporated six different W/C (0.5, 0.45, 0.4, 0.35, 0.3, and 0.25), three S/C (3, 2.75, and 2.5), and three distinct CSCs (52.5, 42.5, and 32.5 MPa). The findings revealed that the ideal mix design is contingent upon the CSC. For example, the 28 days compressive strength of specimens utilizing 32.5 MPa cement and a W/C ratio of 0.25 reached its maximum at an S/C ratio of 2.75, while for 52.5 MPa cement with the same W/C ratio, the peak strength was observed at an S/C ratio of 2.5. In a related study, Ghaemifard et al.11 explored the effects of freeze–thaw cycles (200, 150, 100, 50, and 0 cycles) on mortar samples composed of various CSCs. Their results demonstrated that samples containing 52.5 MPa cement experienced a slower rate of strength degradation during freeze–thaw cycles compared to those with 42.5 and 32.5 MPa cement. Specifically, after 50 and 100 freeze–thaw cycles, the strength reduction was markedly less for samples made with 52.5 MPa cement than for those with lower-strength classes. These investigations highlight the critical need to comprehend the interactions between CSC, mix design, and environmental factors to enhance the performance of cement mortar.
The influence of waste materials on the compressive strength of concrete and mortar has been extensively researched, with numerous scholars investigating their viability as sustainable substitutes for natural aggregates12,13,14,15,16. A research investigation carried out by Prabo et al.17, it was noted that substituting up to 20% of natural aggregate with waste fines did not significantly affect the strength of concrete; however, exceeding this threshold resulted in diminished performance. Likewise, Arumathi et al.18 explored the replacement of natural aggregate with waste aggregate and discovered that at W/C of 0.4, a 30% substitution of natural aggregate resulted in only a 1.9% a reduction in compressive strength compared to samples composed entirely of natural aggregate. At a W/C ratio of 0.5, this reduction increased to 6.1%. These findings suggest that waste aggregate can serve as an effective alternative to natural aggregate, thereby aiding in resource conservation. In a separate investigation, Çevik et al.19 An analysis of cement mortar with varying ratios of waste fine aggregate revealed that replacing more than 15% of natural aggregate significantly reduces compressive strength compared to mixes with only natural aggregate. These findings highlight the potential of incorporating waste materials into cementitious composites to improve sustainability, provided replacement levels are carefully managed to maintain structural performance.
Although many studies have been conducted focusing on manufacturing samples with different cements20,21, aggregates22,23,24, and additives25,26,27,28,29,30,31,32, studies have shown that the process of constructing and curing cement-based samples imposes a lot of cost and time on the projects33. Therefore, to solve these problems, researchers were looking for alternative solutions to achieve the desired results34. Since the results obtained from the production of mortar and concrete samples depend on several input parameters35,36, including the water/cement ratio37,38, sand/cement ratio39,40, type of cement41, type of aggregate42,43, and additives44,45,46, the solution provided must be nonlinear47,48, simplified, and practical49. Therefore, the presentation of meta-heuristic algorithms became important50,51.
In recent years, prediction methods52,53 such as fuzzy logic (FL)54,55, artificial neural networks (ANN)56,57, multiple linear regression (MLR)58,59, multi expression programming (MEP)60,61, genetic programming (GP)62,63,64,65,66,67 genetic algorithm (GA)68,69, gene expression programming (GEP)70,71 have been among the most widely used methods to reduce time and cost while maintaining the accuracy of results. GEP method for forecasting the compressive strength of cement-based materials has been investigated in numerous studies, showcasing its efficacy as a predictive instrument. Mahdinia et al.72 utilized GEP methodologies to model the compressive strength of cement mortar, emphasizing the critical influence of software parameters, such as the linking function and the number of inputs, on improving prediction accuracy. In a similar vein, Iqbal et al.73 carried out a research study focused on the prediction of the mechanical properties of green concrete that incorporates WFS, employing GEP. In this investigation, they applied the GEP technique to estimate the compressive strength of the concrete. Their research was based on a dataset comprising 234 concrete mix designs obtained from earlier studies. The GEP model incorporated four input variables: the (W/C), (%WFS), (%WFS/C), and the fineness modulus. The resulting predictive equation exhibited a high degree of accuracy, attaining a correlation coefficient (R) of 0.85 and a minimal root mean square error (RMSE) of 4. These results highlight the promise of GEP as a dependable modeling technique for the purpose of forecasting mechanical characteristics in the context of sustainable construction. In a study, Behnood et al.74 investigated the effect of WFS on concrete properties. In this study, 234 compressive strength data, 163 flexural strength data, and 85 elastic modulus data were collected from various articles. The parameters of (W/C) ratio, (%WFS), (%WFS/C), and modulus of elasticity of WFS were used as inputs of GEP, and the parameters of modulus of elasticity, compressive and flexural strength were the outputs of this model. The results show that the presented GEP model has a high R2, so it is an accurate model that can be used for future research.
Extensive research has been conducted on waste foundry sand (WFS) and cement strength class (CSC) separately; however, there is a notable lack of understanding regarding their joint impact on the compressive strength of cement mortar, particularly when utilizing advanced predictive methodologies such as gene expression programming (GEP). This study aims to explore the simultaneous effects of varying proportions of WFS and different CSCs on the compressive strength of cement mortar samples. The research involves the preparation of mix designs in two distinct approaches: random and sorted. The datasets are evaluated under two conditions—one that incorporates CSC as an input variable and another that omits it—to examine the influence of including CSC in the GEP model. By concurrently introducing WFS percentage and CSC as input parameters, the study seeks to ascertain their collective contribution to predictive accuracy. The anticipated results are expected to yield significant insights into the interaction between these variables, thereby facilitating the creation of more precise predictive models and informing sustainable and optimized mortar mix designs for practical use.
Materials, preparation of specimens and curing
In this study, 36 mixing designs comprising three (CSCs) with strengths of 52.5, 42.5, and 32.5 MPa, a single (S/C) proportion of 2.75, six different levels of (WFS) at 50, 40, 30, 20, 10, and 0 percent, and two (W/C) ratios of 0.5 and 0.4 were formulated as delineated in Table 1. (A total of 1,260 specimens were prepared for 36 mix designs. For each mix, five specimens were tested at seven different curing ages: 3, 7, 14, 21, 28, 56, and 91 days.)
The dosage of (HRWR) in the assorted mixtures was adequate to achieve a flow value of 110 ± 5 within 25 droplets on the flow table75. The compacted cubes measuring 160 \(\times\) 40 \(\times\) 40 mm undergo tamping in a two-layer process for each composition, following the guidelines of ASTM C348-0276.
The characteristics of the cement utilized in this investigation are detailed in Tables 2 and 3, encompassing both chemical and physical attributes.
In alignment with ASTM C-30577, the preparation of the cement paste commenced with the mixing of potable water, maintained at a temperature of 20 ± 2 °C, with different types of cementitious materials. Subsequently, this cement paste was combined with fine aggregates sourced from Mashhad, as illustrated in Fig. 1, along with waste foundry sand obtained from the Mubarake steel facility in Isfahan. The characteristics of these materials are detailed in Table 4.
The particle size distribution of aggregates is depicted in Fig. 2. Subsequently, the cementitious mortar was introduced into the molds having the conventional cubic dimensions of 160 \(\times\) 40 \(\times\) 40 mm and demolded post a duration of 24 h. Subsequently, all samples were placed in a water container set at a specified temperature for the process of curing.
The distribution of particle sizes in fine aggregates and waste foundry sand (ASTM C33)78.
The ultimate compressive strength of the samples was assessed after curing for 3, 7, 14, 21, 28, 56 and 91 days. In order to enhance both the compressive strength and workability, a high range water reducing (HRWR) agent, characterized by its distinct carboxylic ether polymer composition, was employed.
Figure 3 illustrates the process of sample manufacturing and testing, with each step clearly illustrated. Initially, the materials are combined to create the cement mortar, as depicted in the first step. Subsequently, in the second step, the prepared sample is positioned within the flow table test apparatus. The third step presents the workability results obtained from this test. The fourth step displays the molds utilized in the sample production, followed by the fifth step, where the samples are submerged in a water tank until they reach the appropriate age for testing. Steps six and seven illustrate the testing of the samples, and step eight presents the sample following the completion of the compressive strength test.
Figure 4 presents six samples derived from three distinct mixing designs, displayed both before and after cement mortar compressive strength testing. The designations of these samples are referenced in Table 1, corresponding to mixing designs 6, 18, and 30, which underwent testing at 28 days of age. The sole variable among the samples is the cement strength class parameter. Notably, mixing design 6, utilizing CSC325MPa, exhibits a lower compressive strength compared to the other samples. In contrast, mixing design 30, which incorporates CSC525MPa, demonstrates the highest compressive strength and exhibits minimal damage among the three samples.
The variables for both input and output of the experimental samples are presented in Table 5.
Figure 5 demonstrates how the compressive strength of concrete (Fc) is influenced by the percentage of waste foundry sand (WFS), the cementitious strength component (CSC) and the ratio of water to cement (W/C). The color gradient visually represents changes in Fc, where red regions indicate higher compressive strength values, while blue and green shades represent lower values. The trend suggests that increasing CSC leads to an increase in Fc, particularly at lower WFS values. However, at higher WFS percentages, Fc declines, indicating that excessive WFS may negatively impact strength. This observation highlights the importance of optimizing the WFS content to balance sustainability and mechanical performance in concrete mixtures.
The Pearson correlation matrix (Table 6) reveals key relationships among concrete mix components and compressive strength (Fc). The strongest positive correlation with Fc is curing age (r = 0.54579, p < 0.0001), confirming that longer curing improves strength, followed by cement class (CSC, r = 0.46572, p < 0.0001) and sand (S, r = 0.36025, p < 0.0001), indicating their contribution to strength. Conversely, HRWR (r = −0.50857, p < 0.0001), waste fine sand (WFS, r = −0.36025, p < 0.0001), and water-to-cement ratio (W/C, r = −0.22062, p = 0.000418) show negative correlations, suggesting excessive water content, HRWR dosage, and fine particles reduce strength. Water (W) and W/C exhibit a perfect inverse correlation (r = − 1), as expected in mix design, while sand (S) and WFS are also perfectly negatively correlated (r = − 1), confirming their trade-off in the mixture. The strong correlation between HRWR and WFS (r = 0.91791, p < 0.0001) suggests HRWR is often used to counteract workability issues caused by WFS. Overall, cement content, sand, and curing time enhance strength, while excessive admixtures and water content weaken it, aligning with concrete mix design principles.
Figure 6 illustrates the Pearson correlation matrix, highlighting the relationships among key mix components and their influence on the compressive strength of concrete, where positive and negative correlations are visually represented by red and blue hues, respectively. The heatmap visually represents the relationships among key parameters affecting compressive strength (Fc). Age (r = 0.54579) and cement content (CSC, r = 0.46572) exhibit strong positive correlations with Fc, confirming that longer curing and higher cement content enhance concrete strength. Sand (S, r = 0.36025) also contributes positively, indicating its role in structural integrity. Conversely, HRWR (r = − 0.50857), waste fine sand (WFS, r = − 0.36025), and water-to-cement ratio (W/C, r = − 0.22062) negatively impact Fc, suggesting that excessive admixtures, fine aggregates, and water content weaken the mixture. The inverse correlation between water (W) and W/C (r = − 1), as well as between sand (S) and WFS (r = − 1), highlights their trade-offs in mix design. Additionally, HRWR and WFS (r = 0.91791) show a strong positive correlation, indicating that HRWR is often added to counterbalance workability issues introduced by WFS. Overall, the heatmap underscores the importance of optimizing mix proportions to maximize compressive strength while minimizing detrimental effects from excessive additives and water content.
Figure 7 provides a comprehensive visualization of the relationships between different concrete mix parameters and their impact on compressive strength (Fc). The pair plot provides a comprehensive visualization of the relationships between different concrete mix parameters, including CSC, W, S, HRWR, W/C, WFS, Age and Compressive Strength (Fc). The diagonal elements of the plot display the distribution of each individual variable, revealing their spread and potential skewness. Off-diagonal scatter plots illustrate the pairwise relationships between variables, where trends and correlations can be visually assessed. For instance, a noticeable negative correlation is observed between HRWR and W/C, indicating that as the high-range water reducer increases, the water-to-cement ratio decreases. Similarly, there is a visible trend between Age and Fc, suggesting that concrete strength increases over time. Additionally, the plot helps identify potential non-linear relationships and clusters within the data, offering valuable insights for further statistical analysis and modeling.
The distribution of these variables is illustrated through a frequency histogram in Fig. 8. This figure presents histograms of key variables influencing mortar compressive strength (Fc), including CSC compressive strength (MPa), water/cement ratio (W/C), percentage of WFS, high-range water reducer (HRWR) dosage (ml), curing age (days), and Fc (MPa). Each subplot shows frequency (green bars) and cumulative percentage (orange line) distributions to illustrate the range and distribution of these parameters in the dataset. The discussion will highlight key observations, such as the narrow range of CSC (32.5–52.5 MPa) and W/C (0.4–0.5), indicating controlled mix design, while WFS (0–50%) and age (3–91 days) show wider variability, reflecting diverse experimental conditions. The Fc distribution (0–60 MPa) will be linked to these factors, suggesting their combined influence on strength development.
Evolutionary methodologies: GA, GP and GEP
Gene expression programming (GEP) represents a modern advancement of Genetic programming (GP) and was introduced by Ferreira79. Broadly speaking, the procedural stages of GEP bear resemblance to those of GP80. Both methodologies, incorporating genetic algorithm concepts, adhere to the principles of natural selection based on Darwinian Theory81. The fundamental procedures of the genetic methodology are articulated as outlined82.
-
(1)
The input variables and parameters of the genetic algorithm are essential components to consider.
-
(2)
The creation of an initial population of individuals as parents sets the foundation for the next generation.
-
(3)
Providing a fitness value to each initial individual is a vital phase in the procedure.
-
(4)
The application of genetic operations is necessary for the population of the current generation to evolve.
-
(a)
During the crossover operation, the genetic algorithm alters the strings of two parents, resulting in the generation of two offspring. This process involves genetically recombining substrings at a randomly selected crossover point. The offspring generated are then placed in the population based on the minimum fitness criterion.
-
(b)
The operation of mutation involves the random alteration of a string belonging to a pair of selected parents with an arbitrary string. Subsequently, the resultant new offspring is introduced into the population in place of one of the selected parents.
-
(a)
-
(5)
Conducting step 4 is imperative for a sufficient number of generations to meet the stipulated termination criteria.
Individuals within Genetic Programming (GP) are typically denoted by parse trees within List Processing (LisP) programs. In GEP, after the segment linked to parse tree encoding (Head), the vacant cells are filled in the process of generation (Tail). An illustration of a chromosome, along with its corresponding LisP code and mathematical representation, is depicted in Fig. 9. The decoding of chromosomes into expression trees (ETs) is accomplished through the utilization of rules that also play a role in the interaction among ETs. These rules constitute the two primary parameters for GEP languages, where the language of the gene and ETs are distinct languages employed within this framework. The interplay between the phenotype and the gene sequence contributes to the further proliferation of GEP within the context of the Karva language83. To produce offspring, one must select two parents based on the strength of their mortar and exchange their chosen branches. The genetic programming crossover operation, as demonstrated in the example, bears a striking resemblance to the techniques of pruning and grafting observed in arboriculture. Its structure exhibits conformity with LisP programs84.
An illustration of genetic programming crossover can be found in Fig. 10, depicting the identification of branches to be altered.
The mutate operator functions by selecting a node within the parse tree and modifying the element of said node either by introducing a different element accidentally or by substituting it with a randomly generated branch. Figure 11 provides an example of mutation, showcasing the random alteration of elements. In addition, the process of gene substitution in GEP is illustrated in Fig. 12.
Development of the prediction model
In this research, to obtain the appropriate model, 4 GEP models have been implemented. In all these models, the GEP settings are the same, including (Function set, Genetic operators, RNC, and Numerical Constant) and their linking function is the addition function (Table 7).
Both models of the four implemented models are different from each other. For example, GEP1 and GEP2 have 5 input parameters (WFS, W/C, S/C, HRWR, and age), and GEP3 and GEP4 are implemented with 6 input parameters (5 parameters in GEP1 and GEP2 plus the influential parameter of the CSC) (Table 8). It is observed that in GEP1 and GEP3, random data distribution (RDD) is used while in GEP2 and GEP4, sorted data distribution (SDD) is utilized. Also, the difference between models 1 and 2 is in the way of calling the inputs to the GEP. In GEP1, all 252 mixing designs are given to the GEP, and the GEP randomly selects 200 numbers as the train and 52 as the test. It is considered, but in GEP2, before the mixing designs are entered into the GEP, we separate the 200 mixing designs as a train and the remaining 52 mixing plans as a test.
Regarding calling the input data, GEP1 and GEP3 are similar to each other, as well as GEP2 and GEP4. In all models, 80% of the data is assigned to the train and the remaining 20% to the test. The stiffness functions used in both train and test phases are r-square \({(R}^{2})\), root mean squared error \((RMSE)\), mean absolute percentage error \((MAPE)\), relative absolute error \((RAE)\) performance index \((PI)\), relative root mean square error \((RRMSE)\) and correlation coefficient \((R)\) as following Eqs. (1–7). In these equations n is the total number of data Pi are predicted values and Ai are actual values for ith data of n (Fig. 13).
Analysis and insights
Experimental observations
Figures 14 and 15 shows the variation of compressive strength of cement mortar samples based on different percentages of WFS with two W/C ratios of 0.4 and 0.5, respectively. Each shape includes 3 parts: a) CSC 32.5 MPa b) CSC 42.5 MPa and c) CSC 52.5 MPa, in each part of each shape different ages of the samples are also shown. As expected, at a fixed WFS percentage, age, W/C ratio and CSC are the main factors affecting the decrease and increase of compressive strength of cement mortar. However, the optimal WFS ratio is different for different mix designs. For example, the optimal amount of WFS for mortar with CSC 32.5 is 20%, which is the same in different W/C ratio while the optimal amount of WFS for cement 52.5 MPa and 42.5 MPa occurs at 10% and 0%, respectively. This result means that in both Figs. 13 and 14, the trend of the graph is upward until reaching the optimal point, and the compressive strength of the samples at all ages increases until reaching the optimal point, and after the introduced optimal point, which is different in the sections of each graph, the trend of the graph is downward and all the compressive strength of the samples decreases. In general, the highest compressive strength value of cement mortar occurs at the age of 91 days with a strength class of 52.5 MPa and 0 WFS percentage and a W/C ratio of 0.4, while the lowest compressive strength value of cement mortar occurs at the age of 3 days with 32.5 MPa cement and percentage the WFS is 50% and the water/cement ratio is 0.5.
Figure 15 shows the graphs of the compressive strength ratio of samples with different cements at different percentages of WFS. This figure consists of two parts (a) and (b), where part (a) is for samples made with W/C = 0.4 and part (b) is for samples made with W/C = 0.5. Each of the graphs in part (a) and (b) consists of 3 curves, one of which shows the compressive strength ratio of samples made with CSC 425 MPa to samples made with CSC 325 MPa. The other curve is for the compressive strength ratio of samples made with CSC 525 MPa to samples made with CSC 325 MPa and similarly the last curve is for the compressive strength ratio of samples made with CSC 525 MPa to samples made with 425 MPa. These ratios shown for each point of the graph are incremental ratios. For example, in section a, at the point where the ratio of 0.2 is reported for the CSC 525 MPa to CSC 425 MPa curve, it means that WFS is 0%, the increase in compressive strength of the sample made with CSC 525 MPa is 1.2 times that of the same sample made with CSC 425 MPa. Similarly, in section (b), this figure is also for W/C = 0.5, and all the cases reported for section (a) are also valid for section (b).
Predictive modeling outcomes and analysis
Figures 16, 17, 18 and 19 shows the experimental and prediction results of all 252 cement mortar mixing designs. In each of the Figs, parts a, c and e include the results related to the train and parts b, d and f include the results related to the test. By examining parts a and b in Figs. 16 and 17, as well as parts a and b in Figs. 18 and 19, it shows the importance of the model of calling the input data to the GEP in the condition that the number of inputs and all the settings of the GEP are fixed, to give by comparing Figs. 16 and 18 as well as 17 and 19, it can be seen that the simultaneous effect of CSC and %WFS on the prediction of the compressive strength of cement mortar, that by examining these figures, the importance of the correct recall model as well as the effect of cement strength class in cement mortar samples containing waste foundry sand is determined. After comparing Figs. 16, 17, 18 and 19, we conclude that GEP4 with SDD call model and 6 input data including percentage of WFS, CSC (MPa), S/C ratio, W/C ratio, HRWR (ml) and age of sample (day) is the best model introduced with R2 = 0.98, which indicates the good performance of GEP4.
The correlation of the experimental and predicted Fc values for GEP2 model: (a) shows train R2, (b) shows test R2, (c) shows train prediction, (d) shows test prediction, (e) shows train ratio of Fc, (f) shows test ratio of Fc (g) train values of residual versus predicted Fc and (h) test values of residual versus predicted Fc.
The correlation of the experimental and predicted Fc values for GEP2 model: (a) shows train R2, (b) shows test R2, (c) shows train prediction, (d) shows test prediction, (e) shows train ratio of Fc, (f) shows test ratio of Fc (g) train values of residual versus predicted Fc and (h) test values of residual versus predicted Fc.
The correlation of the experimental and predicted Fc values for GEP3 model: (a) shows train R2, (b) shows test R2, (c) shows train prediction, (d) shows test prediction, (e) shows train ratio of Fc, (f) shows test ratio of Fc (g) train values of residual versus predicted Fc and (h) test values of residual versus predicted Fc.
The correlation of the experimental and predicted Fc values for GEP4 model: (a) shows train R2, (b) shows test R2, (c) shows train prediction, (d) shows test prediction, (e) shows train ratio of Fc, (f) shows test ratio of Fc, (g) train values of residual versus predicted Fc and (h) test values of residual versus predicted Fc.
The comparison of \({R}^{2}\), \(RMSE, MAPE\), \(RAE\), \(PI\), \(RRMSE\) and \(R\) for all GEP modelling are shown in Table 9 and for GEP4, an expression tree is also presented in Fig. 20 and Eq. 9.
Figure 21 presents the box plot analysis of the residuals for four Gene Expression Programming (GEP) models, comparing their performance on (a) training data and (b) test data. The vertical axis represents the residual values (difference between predicted and actual values), while the horizontal axis lists the different GEP models. The red diamonds indicate the mean residual along with ± 1 standard deviation (SD), while the blue vertical bars represent the mean ± 1.96 SD, capturing approximately 95% of the data distribution. For the training data (Fig. 21a), GEP models exhibit different levels of dispersion in residuals. GEP1 and GEP2 show larger variability compared to GEP3 and GEP4, indicating potential overfitting or underfitting issues. GEP3 and GEP4 display more compact residual distributions, suggesting relatively stable predictions. For the test data (Fig. 21b), a similar pattern is observed. The variability in residuals is more pronounced in GEP1 and GEP2, whereas GEP3 and GEP4 maintain tighter distributions, implying better generalization performance. A comparison between training and test residuals suggests that GEP3 and GEP4 might be better candidates for predictive modeling due to their lower residual spread, while GEP1 and GEP2 may require further refinement to improve their accuracy and robustness.
The error distribution of the four Gene Expression Programming (GEP) models is analyzed using histograms, as illustrated in Fig. 22. This figure presents the residuals (error values) for both the training and test datasets across different models. Subfigures (a)–(d) show individual histograms for each GEP model, while subfigure (e) provides a combined view of all models, allowing for direct comparison. The histograms in Fig. 22 reveal key characteristics of the model errors. The error distributions for both training and test datasets generally resemble a normal distribution centered around zero, though slight asymmetries and long tails suggest the presence of outliers or systematic bias in some models. Across all models, the training errors (blue bars) are more concentrated around zero compared to the test errors (red bars), indicating better performance on training data, while the wider spread of test errors suggests some degree of overfitting. Variations in error values show that GEP3 and GEP4 exhibit narrower distributions, implying better generalization and lower error variance, whereas GEP1 and GEP2 have broader distributions with larger residual values, indicating potential challenges in model accuracy. The combined histogram in subfigure (e) provides an overview of all models, confirming that most errors are concentrated near zero, though differences in spread highlight variations in model robustness.
Overall, models with smaller variance and a symmetric error distribution are generally preferred for predictive stability, making GEP3 and GEP4 the most reliable due to their balanced error distribution and lower spread, while GEP1 and GEP2 may require further optimization for improved predictive performance on test data. The best model presented is model GEP4, which has 4 genes. This model includes 6 input parameters (CSC, %WFS, W/C, S/C, age and HRWR), which are present in different genes to predict the compressive strength of cement mortar and are shown in the expression tree of Fig. 21 and Eq. (8). As is clear in Eq. (8), due to the importance of %WFS, CSC and age, these parameters are repeated more often in the genes, and the HRWR, which has less effect, is repeated less often in the expression tree and the presented equation. In Eq. (8), there are 4 functions that are connected to each other with a plus sign, these 4 functions represent 4 genes and the plus sign represents the same linking function. In the above equation, if the CSC, WFS, W/C, S/C, age and HRWR parameter are respectively entered in MPa, %, ratios, ratios, days, and ml the output of this equation will be the compressive strength of the cement mortar in MPa.
Figure 23a shows the frequency of compressive strength of different mixing designs after prediction in 6 intervals (0–10, 11–20, 21–30, 31–40, 41–50 and 51–60) and in Fig. 23b the comparison of the frequency of compressive strength of different mixing designs before and after prediction has been done. By examining part b and considering the very high data overlap, concluded that the presented model has a very high accuracy.
In Fig. 24, the Probability Density Functions (PDFs) illustrate the error distributions for the four Gene Expression Programming (GEP) models, distinguishing between training and test datasets.
The red curves represent either kernel density estimation (KDE) or normal distribution fits, providing insight into the spread and concentration of errors. GEP3 and GEP4 exhibit a more symmetric, narrow error distribution in both training and test datasets, indicating a more reliable predictive performance. In contrast, GEP1 and GEP2 display wider error spreads, particularly in the test sets, suggesting greater variability and potential overfitting. The higher standard deviation in test errors compared to training errors for GEP1 and GEP2 also implies a degradation in generalization capability. The analysis suggests that while all models capture trends within the dataset, GEP3 and GEP4 may offer superior generalization due to their more stable and centralized error distributions. Figure 25 illustrates the cumulative probability distribution of prediction errors for the four GEP models, comparing their performance on training and test datasets. In the training data (Fig. 25a), the error distributions of all models show a relatively smooth transition, with GEP3 (red) and GEP4 (green) having the steepest slopes, indicating lower error variance and more concentrated errors around zero. Conversely, in the test data (Fig. 25b), the distributions appear more spread out, particularly for GEP2 (blue) and GEP4 (green), suggesting that these models exhibit a wider range of error values and potential overfitting to the training data.
The shift in the curves between training and test sets highlights the generalization capability of each model, where GEP1 (black) and GEP3 (red) maintain relatively consistent distributions across both datasets, indicating better robustness. The deviations observed in GEP2 and GEP4 suggest higher sensitivity to unseen data, potentially leading to reduced predictive accuracy. This analysis confirms that while all models exhibit reasonable error distributions, GEP1 and GEP3 demonstrate better generalization performance, making them more reliable choices for predictive modeling. Figure 26 illustrates a Taylor Diagram, which evaluates the performance of four different GEP models (Train & Test) by comparing their standard deviation (STD), correlation coefficient (R), and RMSE against the experimental data (cyan point). The angular position represents the correlation coefficient, where points closer to the 0° axis indicate higher correlation with the reference data. The radial distance corresponds to the standard deviation, meaning models closer to the experimental data have a similar spread.
The distance from the experimental data point reflects the RMSE, with smaller distances indicating better predictive accuracy. Observing the diagram, GEP models exhibit varying levels of agreement with the reference data, where models positioned closer to the cyan point demonstrate superior performance. The test data points (red, blue, pink, and purple) are generally farther from the reference compared to train data points (black, brown, yellow, and cyan), indicating that some models may experience overfitting, leading to reduced generalization performance. Among the models, GEP1 and GEP3 (Train data) appear to have the closest agreement with the experimental data, suggesting a more reliable predictive capability.
Synergistic influence of CSC and WFS on strength prediction
This study examines the combined influence of various CSC and the incorporation of WFS on the predictive modeling of compressive strength in cement mortar. The analysis will utilize the initial three data sets from prior research conducted by Gi Ryu et al. (including 16 mix designs)85, BoYeol et al. (including 24 mix designs86 and Young Moon et al. (including 36 mix designs)87 collected and after that, the collected data was initially implemented by the model proposed by Iqbal with the five input parameters (WFS, W/C, S/C, HRWR and Age) and with all the functional and general settings related to this model (Fig. 27), and then it was implemented by the GEP4 model presented in this research (Fig. 28).
Figure 29 shows the prediction diagram of the compressive strength of concrete samples using ANN method, to compare the superiority of the over the ANN in Fig. 30, the data related to Fig. 29 has been using the model GEP4 presented in this prediction research. The comparison of Figs. 27 and 28 shows the better performance of GEP4 compared to the model presented by Iqbal, which shows the importance of the simultaneous effect of the CSC and WFS as an input parameter in the prediction model.
The correlation of the experimental and predicted Fc values for Silva collection 88 by GEP4 model: part (a) shows train R2, part (b) shows test R2, part (c) shows train prediction, part (d) shows test prediction, part (e) shows train ratio of Fc and part (f) shows test ratio of Fc.
Also, comparison of Fig. 30, in addition to showing the effect of the CSC parameter as a main input parameter, the superiority of the GEP method over the ANN is clearly seen. Since cement is one of the main materials used in designing cement mortar, not introducing it as an effective input parameter in the modeling can have a great negative effect on the prediction results, according to the results obtained from the graphs Above, it is recommended that in the research related to waste foundry sand, the simultaneous effect of the CSC and WFS should be used in the predictions.
The comparison of Figs. 28, 29, and 30 highlights the progressive improvement in predictive modeling for the compressive strength (Fc) of cement mortar when incorporating advanced AI-based models. Figure 28, representing the Iqbal model, shows the weakest performance, with train R2 = 0.7744 and test R2 = 0.7162, indicating moderate but unreliable predictions. The significant deviations between experimental and predicted values, along with large fluctuations in the ratio analysis, reveal high instability and poor generalization of this model. In contrast, Fig. 30, based on the Artificial Neural Network (ANN) model, improves prediction accuracy (R2 = 0.8802) and shows a stronger correlation between predicted and experimental values. However, some discrepancies remain, suggesting that ANN alone does not fully capture the complex interactions between input parameters. Finally, Fig. 30, showcasing the GEP4 model, achieves the highest prediction accuracy with train R2 = 0.9759 and test R2 = 0.9756, demonstrating excellent generalization, minimal error, and stable predictions. The consistent alignment between predicted and experimental values, along with a nearly constant Fc ratio, confirms the superiority of the GEP4 model over both ANN and Iqbal models. These findings emphasize the importance of incorporating comprehensive input parameters (CSC and WFS) and advanced hybrid AI models (GEP4) for accurate and reliable predictive modeling in cement mortar research.
Table 10 presents a numerical comparison of the predictive performance of the Iqbal empirical model, the Artificial Neural Network (ANN), and the optimized GEP4 model. As shown, the GEP4 model demonstrably excels in performance compared to the other models, achieving the highest coefficient of determination (R2 = 0.981) and the lowest root mean square error (RMSE = 1.827). These results demonstrate the superior generalization ability and accuracy of the GEP4 model in capturing the complex, nonlinear relationships between input parameters and compressive strength. In contrast, the Iqbal and ANN models exhibit lower predictive power, confirming the advantages of symbolic regression approaches like GEP in this application.
Conclusion
This study explored the influence of waste foundry sand (WFS) and cement strength class (CSC) on the compressive strength of cement mortar using Gene Expression Programming (GEP) as a predictive tool. The experimental results confirmed that both WFS content and CSC significantly affect mortar strength, with specific combinations yielding optimal performance. Incorporating CSC as a model input improved predictive accuracy, demonstrating the importance of material quality in data-driven mix design optimization. The integration of WFS, a sustainable industrial by-product, into mortar mixtures presents a viable path toward eco-efficient construction without compromising mechanical performance. Overall, the study offers a robust framework that combines experimental insight with machine learning to support sustainable material usage and enhance mortar mix design.
-
Model GEP4, considering the simultaneous effect of CSC and WFS as input parameters and random selection of data as software settings, provides the highest prediction performance.
-
If all input parameters and GEP setting are constant, a model whose input data is entered randomly has better performance.
-
Comparing the results of the model presented in this study with the proposed Silva ANN model shows the powerful role of the combined effect of the GEP model and the CSC input parameter.
-
Comparing the results of Iqbal’s proposed method with the method presented in this study shows the significant impact of GEP settings and selection of appropriate input parameters.
-
The effect of CSC in the present study, as well as in comparison with other prediction methods such as neural networks and data collected from other studies and changes made to GEP settings, indicates the direct effect of this parameter on the output results, and it seems that this input parameter should be considered in all research that depends on cement.
This study showed that WFS and CSC significantly affect mortar strength. Using GEP, it identified optimal mixes and improved prediction accuracy, supporting sustainable, high-performance mortar design. Future investigations could broaden this methodology to include other waste materials and examine their applicability across various structural and environmental contexts.
Data availability
The datasets used and/or analyzed during the current study available from the corresponding author on reasonable request.
References
Javed, M. F. et al. Comparative analysis of various machine learning algorithms to predict strength properties of sustainable green concrete containing waste foundry sand. Sci. Rep. 14(1), 14617 (2024).
Koçyiğit, Ş. Performance evaluation of geopolymer mortars containing waste ferrochrome slag and fly ash for sustainable green building. Sci. Rep. 14(1), 14606 (2024).
Sun, Q., Gupta, R., Zhu, Z., Sharma, A. & Qiang, S. A new mathematical model for elastic modulus prediction in mortar and concrete. J. Build. Eng. 111761 (2024).
Mahdinia, S., Eskandari-Naddaf, H. & Shadnia, R. Effect of main factors on fracture mode of mortar, a graphical study. Civil Eng. J. 3(10), 897–903 (2017).
Joshua, O. et al. Assessment of the utilization of different strength classes of cement in building constructions in Lagos, Nigeria. Int. J. Civil Eng. Technol. 8(9), 1221–1233 (2017).
Kumar, G. S. & Deoliya, R. Recycled cement and recycled fine aggregates as alternative resources of raw materials for sustainable cellular light weight flowable material. Constr. Build. Mater. 326, 126878 (2022).
Ashraf, M. et al. Exploring the rheological and mechanical properties of alkali activated mortar incorporating waste foundry sand: A comprehensive experimental and machine learning investigation. Results Eng. 24, 102973 (2024).
Alyousef, R. et al. Forecasting the strength characteristics of concrete incorporating waste foundry sand using advance machine algorithms including deep learning. Case Stud. Constr. Mater. 19, e02459 (2023).
Reis, D. C., Abrão, P. C., Sui, T. & John, V. M. Influence of cement strength class on environmental impact of concrete. Resour. Conserv. Recycl. 163, 105075 (2020).
Eskandari-Naddaf, H. & Kazemi, R. ANN prediction of cement mortar compressive strength, influence of cement strength class. Constr. Build. Mater. 138, 1–11 (2017).
Ghaemi‐Fard, M., Eskandari‐Naddaf, H. & Ebrahimi, G. R. Genetic prediction of cement mortar mechanical properties with different cement strength class after freezing and thawing cycles. Struct. Concr.
Ahmed, D. N., Naji, L. A., Faisal, A. A. H., Al-Ansari, N. & Naushad, M. Waste foundry sand/MgFe-layered double hydroxides composite material for efficient removal of Congo red dye from aqueous solution. Sci. Rep. 10(1), 2042 (2020).
Mydin, M. A. O. et al. Study on fresh and hardened state properties of eco-friendly foamed concrete incorporating waste soda-lime glass. Sci. Rep. 14(1), 18733 (2024).
Sathvik, S. et al. Analyzing the influence of manufactured sand and fly ash on concrete strength through experimental and machine learning methods. Sci. Rep. 15(1), 4978 (2025).
Hematabadi, H. et al. Structural performance of additive manufactured wood-sodium silicate composite beams for sustainable construction. Sustain. Struct. 4(3), 000054 (2024).
Mohamed, O. A., Zuaiter, H. A. & Jawa, M. M. Carbonation and chloride penetration resistance of sustainable structural concrete with alkali-activated and ordinary Portland cement binders: A critical. Sustain. Struct. 5(2), 000075 (2025).
Prabhu, G. G., Hyun, J. H. & Kim, Y. Y. Effects of foundry sand as a fine aggregate in concrete production. Constr. Build. Mater. 70, 514–521 (2014).
Sarumathi, K., Elavenil, S. & Vinoth, A. Use of waste foundry sand with multiscale modeling in concrete. Asian J. Civil Eng. 20, 163–170 (2019).
Çevik, S., Mutuk, T., Oktay, B. M. & Demirbaş, A. K. Mechanical and microstructural characterization of cement mortars prepared by waste foundry sand (WFS). J. Aust. Ceram. Soc. 53, 829–837 (2017).
Ghorbani, S., Naghizadeh, R., Ghasemi, E. & Rezaie, H. Calcium aluminate cement: A study on the effect of additives for dental applications. Adv. Cement Res. 1–32 (2024).
Shin, D., Hammond, J., Hennessey, H., Kumar, A., Cavanagh, M., Kerr, M. C., Papadopoulos, G., & Atoui, R. Intracardiac cement embolism following percutaneous vertebroplasty. Ann. Thoracic Surg. Short Rep. (2025).
Xu, Y., Ye, F., Xiong, B. & Demartino, C. Mortar with natural light-weight expanded vermiculite aggregate: Physical and mechanical properties. Constr. Build. Mater. 440, 137226 (2024).
Gowdhamramkarthik, P. & Kumar, G. A. Effects on structural light weight aggregate concrete cast using controlled permeable formwork liner. Constr. Build. Mater. 404, 133310 (2023).
Dabbaghi, F. et al. Experimental and numerical investigation on post-fire seismic performance of light weight aggregate reinforced concrete beams. Eng. Struct. 268, 114791 (2022).
Onuaguluchi, O., Mohamed, B., Adwan, A., Li, L. & Banthia, N. Sludge-derived biochar as an additive in cement mortar: Mechanical strength and life cycle assessment (LCA). Constr. Build. Mater. 425, 135959 (2024).
Dmitrieva, M., Puzatova, A., Leitsin, V., Kogai, A., Sokolnikova, S. & Kogai, V. The effect of thermal modified peat additive on cement mortar. Mater. Today Proc. (2023).
Kumpueng, P., Phutthimethakul, L. & Supakata, N. Production of Cement mortars from glass powder and municipal incinerated bottom ash. Sci. Rep. 14(1), 1569 (2024).
Tan, Y., Awang, H. & Kaus, N. M. Integration of fly ash and ground granulated blast furnace slag into palm oil fuel ash based geopolymer concrete: A review. Sustain. Struct. 4(2), 000050 (2024).
Hafez, R., Ftah, R. & Abdelsamie, K. The influence of nucleus dates waste and ceramic wastes in sustainable concrete. Sustain. Struct. (2024).
Chowdhurya, J. A., Islama, M. S., Islama, M. A., Al Bari, M. A. & Debnatha, A. K. Analysis of mechanical properties of fly ash and boiler slag integrated geopolymer composites. Sustain. Struct. 5(2), 000073 (2025).
Asif, U. & Javed, M. F. Optimizing plastic waste inclusion in paver blocks: Balancing performance, environmental impact, and cost through LCA and economic analysis. J. Clean. Prod. 478, 143901 (2024).
Asif, U., Memon, S. A., Javed, M. F. & Kim, J. Predictive modeling and experimental validation for assessing the mechanical properties of cementitious composites made with silica fume and ground granulated blast furnace slag. Buildings 14(4), 1091 (2024).
Korouzhdeh, T., Eskandari-Naddaf, H. & Gharouni-Nik, M. An improved ant colony model for cost optimization of composite beams. Appl. Artif. Intell. 31(1), 44–63 (2017).
Mottaghi, H., Masoodi, A. R. & Gandomi, A. H. Multiscale analysis of carbon nanotube-reinforced curved beams: A finite element approach coupled with multilayer perceptron neural network. Results Eng. 23, 102585 (2024).
Ghorbani, S. et al. Simultaneous effect of granite waste dust as partial replacement of cement and magnetized water on the properties of concrete exposed to NaCl and H2SO4 solutions. Constr. Build. Mater. 288, 123064 (2021).
Zhang, H., Li, H., Guo, H., Li, Y. & Wei, L. Mechanical properties of alkali activated geopolymer cement mortar for non vibratory compacted trench backfilling. Sci. Rep. 15(1), 12347 (2025).
Kim, S. et al. Thermoelectric cement-based composites containing carbon nanotubes (CNTs): Effects of water-to-cement ratio and CNT dosage. Case Stud. Constr. Mater. 21, e03861 (2024).
Moradi, M., Tavana, M., Habibi, M. & Amiri, M. Effect of water to cement ratio on mechanical properties of FRC subjected to elevated temperatures: Experimental and soft computing approaches. Heliyon 10(21), (2024).
Shanmugasundaram, N. & Praveenkumar, S. Influence of manufactured sand gradation and water cement ratios on compressive strength of engineered cementitious composites. Mater. Today Proc. (2023).
Heidari, A. & Shourabi, F. N. Mechanical properties of ultra-high performance concrete based on reactive powder concrete: Effect of sand-to-cement ratio, adding glass fiber and calcium carbonate. Constr. Build. Mater. 368, 130108 (2023).
Düzgün, A., Güneş, F., Kocacıklı, M. & Yaluğ, Ö. S. Effect of cementation technique and cement type on the amount of excess cement in implant-supported cement-retained crown restorations: An in vitro study. J. Prosthetic Dentistry (2024).
Vintimilla, C. & Etxeberria, M. Limiting the maximum fine and coarse recycled aggregates-Type B used in structural concrete. Constr. Build. Mater. 459, 139791 (2025).
Ghorbani, M., Biklaryan, M., Beygi, M. H. & Lotfi-Omran, O. Effects of nanosilica and aggregate type on the mechanical, fracture and shielding features of heavyweight concrete. Nucl. Eng. Des. 431, 113713 (2025).
Kalderis, D., Anastasiou, E., Petrakis, E. & Konopisi, S. Utilization of biochar from olive tree pruning as additive to cement mortars. J. Clean. Prod. 469, 143137 (2024).
He, S., Li, Y., Zhou, Y., Zhang, H. & Yu, P. Optimal analysis of the proportion between modified lime mud and active mineral additive in mortar: Insights from particle size distribution and packing density modeling. J. Build. Eng. 91, 109500 (2024).
Boumaza, A., Khouadjia, M. L. K., Isleem, H. F., Hamdi, O. M. & Khishe, M. Effect of blast furnace slag on the fresh and hardened properties of volcanic tuff-based geopolymer mortars. Sci. Rep. 15(1), 13651 (2025).
Aggelis, D., Momoki, S. & Shiotani, T. Experimental study of nonlinear wave parameters in mortar. Constr. Build. Mater. 47, 1409–1413 (2013).
Nguyen, M. H., Phan, H. N., Do, V. H. & Huynh, P. N. An experiment-based nonlinear model of shear force-slip relationship for perfobond strips in an unreinforced narrow joint with high-strength steel fiber mortar. Case Stud. Constr. Mater. 18, e02092 (2023).
Mohammed, A. et al. ANN, M5P-tree and nonlinear regression approaches with statistical evaluations to predict the compressive strength of cement-based mortar modified with fly ash. J. Market. Res. 9(6), 12416–12427 (2020).
Sevim, U. K., Bilgic, H. H., Cansiz, O. F., Ozturk, M. & Atis, C. D. Compressive strength prediction models for cementitious composites with fly ash using machine learning techniques. Constr. Build. Mater. 271, 121584 (2021).
Pourabbas Bilondi, M. et al. Experimental studies on mix design and properties of ceramic-glass geopolymer mortars using response surface methodology. Sci. Rep. 15(1), 282 (2025).
Wani, S. R. & Suthar, M. Using machine learning approaches for predicting the compressive strength of ultra-high-performance concrete with SHAP analysis. Asian J. Civil Eng. 26(1), 373–388 (2025).
Parhi, S. K. & Patro, S. K. Prediction of compressive strength of geopolymer concrete using a hybrid ensemble of grey wolf optimized machine learning estimators. J. Build. Eng. 71, 106521 (2023).
Gutiérrez-García, F.-J., Alayón-Miranda, S. & González-Díaz, E. Fuzzy model for predicting the strength of mortars made with Pozzalani cement and volcanic sand from electrical resistivity. J. Build. Eng. 79, 107840 (2023).
Liu, L. et al. Novel modified ANFIS based fuzzy logic model for performance prediction of FRCM-to-concrete bond strength. Adv. Eng. Softw. 182, 103474 (2023).
Wani, S. R. & Suthar, M. A comparative analysis of the Predictive performance of tree-based and artificial neural network approaches for compressive strength of concrete utilising waste. Int. J. Pavement Res. Technol. 1–22 (2024).
Wani, S. R. & Suthar, M. Using soft computing to forecast the strength of concrete utilized with sustainable natural fiber reinforced polymer composites. Asian J. Civil Eng. 25(8), 5847–5863 (2024).
Singh, S., Patro, S. K. & Parhi, S. K. Evolutionary optimization of machine learning algorithm hyperparameters for strength prediction of high-performance concrete. Asian J. Civil Eng. 24(8), 3121–3143 (2023).
Parhi, S. K. & Patro, S. K. Compressive strength prediction of PET fiber-reinforced concrete using Dolphin echolocation optimized decision tree-based machine learning algorithms. Asian J. Civil Eng. 25(1), 977–996 (2024).
Arabshahi, A., Gharaei-Moghaddam, N. & Tavakkolizadeh, M. Development of applicable design models for concrete columns confined with aramid fiber reinforced polymer using Multi-Expression Programming. in Structures. (Elsevier, 2020).
Mahdinia, S. & Tavakkolizadeh, M. Proposed a model for compressive strength of cement mortar by using multi expression programing. In 12th International Congress on Civil Engineering (2021).
Rojo-López, G., González-Fonteboa, B., Pérez-Ordóñez, J. L. & Martínez-Abella, F. Parametric analysis in sustainable self-compacting mortars using genetic programming. Constr. Build. Mater. 404, 133189 (2023).
Rojo-López, G., González-Fonteboa, B., Pérez-Ordóñez, J. L. & Martínez-Abella, F. Genetic programming to understand the influence of new sustainable powder materials in the fresh performance of cement pastes. J. Build. Eng. 88, 109186 (2024).
Rauf, A., Asif, U., Onyelowe, K., Javed, M. F. & Alabduljabbar, H. Experimental analysis and gene expression programming optimization of sustainable concrete containing mineral fillers. Sci. Rep. 14(1), 29280 (2024).
Asif, U. et al. Predicting the mechanical properties of plastic concrete: An optimization method by using genetic programming and ensemble learners. Case Stud. Constr. Mater. 20, e03135 (2024).
Asif, U. & Memon, S. A. Interpretable predictive modeling, sustainability assessment, and cost analysis of cement-based composite containing secondary raw materials. Constr. Build. Mater. 473, 140924 (2025).
Asif, U. Comparative analysis of evolutionary computational methods for predicting mechanical properties of fiber-reinforced 3D printed concrete. Innov. Infrastruct. Solut. 10(6), 259 (2025).
Badakhshan, E., Veylon, G., Peyras, L. & Vaunat, J. A simplified method for predicting overflow-induced crack propagation in gravity dams using genetic algorithm and material-based model. Int. J. Rock Mech. Min. Sci. 181, 105842 (2024).
Shi, Q. et al. Trajectory optimization of wall-building robots using response surface and non-dominated sorting genetic algorithm III. Autom. Constr. 155, 105035 (2023).
Shahmansouri, A. A., Bengar, H. A. & Ghanbari, S. Compressive strength prediction of eco-efficient GGBS-based geopolymer concrete using GEP method. J. Build. Eng. 31, 101326 (2020).
Abid, M., Waqar, G. Q., Mao, J., Javed, M. F. & Almujibah, H. Mechanical properties, microstructure and GEP-based modeling of basalt fiber reinforced lightweight high-strength concrete containing SCMs. J. Build. Eng. 96, 110378 (2024).
Mahdinia, S., Eskandari-Naddaf, H. & Shadnia, R. Effect of cement strength class on the prediction of compressive strength of cement mortar using GEP method. Constr. Build. Mater. 198, 27–41 (2019).
Iqbal, M. F. et al. Prediction of mechanical properties of green concrete incorporating waste foundry sand based on gene expression programming. J. Hazard. Mater. 384, 121322 (2020).
Behnood, A. & Golafshani, E. M. Machine learning study of the mechanical properties of concretes containing waste foundry sand. Constr. Build. Mater. 243, 118152 (2020).
ASTM C-230. Standard Specification for Flow Table for Use in Tests of Hydraulic Cement (ASTM International, 2008).
ASTM C-348. Standard Test Method for Flexural Strength of Hydraulic-cement Mortars (American Society for Testing and Materials, 2002).
ASTM C-303. Standard Practice for Mechanical Mixing of Hydraulic Cement Pastes and Mortars of Plastic Consistency (ASTM International, 1999).
ASTM C-33. Standard Specification for Concrete Aggregates (ASTM International, 2009).
Ferreira, C. & Gepsoft, U., What is Gene Expression Programming. (2008).
Holland, J. H. Genetic algorithms. Sci. Am. 267(1), 66–73 (1992).
Koza, J. R. Genetic programming III: Darwinian invention and problem solving (Morgan Kaufmann, 1999).
Willis, M.-J., Hiden, H. G., Marenbach, P., McKay, B. & Montague, G. A. Genetic programming: An introduction and survey of applications. In Genetic Algorithms in Engineering Systems: Innovations and Applications, 1997. GALESIA 97. Second International Conference On (Conf. Publ. No. 446). (IET, 1997).
Ferreira, C. Gene Expression Programming: Mathematical Modeling by an Artificial Intelligence (Springer, 2006).
Ferreira, C. Gene expression programming in problem solving. In Soft Computing and Industry 635–653 (Springer, 2002).
Ryu, H.-G. & Kwon, Y.-J. An analysis of the properties of mortar according to the change of the replacement rate of waste foundry sands. J. Korean Recycled Constr. Resour. Inst. 4(4), 99–104 (2009).
Park Bo-Yeol, J. J.-H. & Hyun-Ki, J. An experimental study on the mortar using waste foundry sand as fine aggregate. 71–75. (2006).
Han-Young, M., Yong-Kyu, S. & Jung-Kyu, J. Fundamental properties of Mortar and Concrete using waste foundry sand. J. Korea Concrete Inst. 17(1), 141–147 (2005).
Silva, F. A. et al. Use of nondestructive testing of ultrasound and artificial neural networks to estimate compressive strength of concrete. Buildings 11(2), 44 (2021).
Author information
Authors and Affiliations
Contributions
S.M., M.R.T. and A.R.M. conceptualized the study. S.M. developed the methodology, performed the formal analysis, implemented the software, created the visualizations, and wrote the original draft. M.R.T. provided supervision and resources and contributed to reviewing and editing the manuscript. A.R.M. contributed to supervision, investigation, and resources, and was involved in both the original drafting and review and editing of the manuscript. All authors reviewed the manuscript.
Corresponding authors
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.
About this article
Cite this article
Mahdinia, S., Tavakkolizadeh, M. & Masoodi, A.R. Experimental and GEP-based evaluation of compressive strength in eco-friendly mortars with waste foundry sand and varying cement grades. Sci Rep 15, 38690 (2025). https://doi.org/10.1038/s41598-025-22514-0
Received:
Accepted:
Published:
Version of record:
DOI: https://doi.org/10.1038/s41598-025-22514-0



































