Abstract
Lightweight high-strength concrete (LWHSC) is increasingly used as a sustainable material that reduces structural weight while maintaining performance. Promoting sustainability involves using less high-carbon cement and more supplementary cementitious materials (SCMs) like fly ash, silica fume, and slag. However, testing LWHSC’s mechanical behavior is costly and time-consuming, highlighting the need for reliable prediction tools. To address this, this study employs machine learning models multi-expression programming (MEP) and random forest (RF) to forecast the mechanical properties of LWHSC containing SCMs using a large dataset with eight key parameters, including water-to-binder ratio, cement, fly ash, slag, silica fume, aggregate, lightweight aggregate, and basalt fiber. Performance was evaluated with R2, MAE, RMSE, and MSE. Both models captured strength trends, but MEP was more accurate, especially for compressive strength (R2 = 0.98–0.99) versus RF (0.87–0.91), and similarly for tensile and flexural strengths. Errors mostly stayed below 3 MPa for CS, 0.5 MPa for TS, and 2 MPa for FS. Taylor diagrams confirmed MEP predictions closely matched experimental data. Additionally, SHapely Additive ExPlanations (SHAP) investigation demonstrated that the water-to-binder ratio and lightweight aggregate had a favorable impact on the mechanical properties of the LWHSC. The study highlights MEP as a robust, dependable tool for designing sustainable LWHSC by effectively combining SCMs with machine learning.
Introduction
Concrete is the most widely produced product utilized worldwide1. Lightweight concrete (LWC) was originally developed to lower the density by including voids into conventional concrete in order to minimize the self-weight of modern structures, therefore decreasing dead loads in design while preserving acceptable strength and durability. This was accomplished by adding voids to the aggregates with lightweight aggregates (LWA), producing voids in the cement paste with cellular or foamed concrete, eliminating fine aggregates, and combining these techniques2. Researchers tend to favor using LWA over the other approaches. Significant efforts have been undertaken in recent decades to improve the characteristics of concrete for a variety of applications, including enhancing its strength and durability while minimizing the environmental impact of its manufacture. Pumice3, expanded slate4, expanded clay5, expanded polystyrene beads6, perlite7, and cenosphere8 are among the LWAs that have been the subject of intensive research over time to produce LWC for several applications9. LWC has a lower density because there are more empty spaces there. Because of this, it is a better option for thermal insulation and has lower production cost and greater fire resistance than normal-weight concrete (NWC)10. As a result, it has been used in both structural and nonstructural applications, including offshore and floating structures, high-rise skyscrapers, and long-span bridges11,12,13. The Romans employed natural lightweight pumice materials to make LWC two thousand years ago to build the 44-meter-diameter dome of the Pantheon14. Historically, LWA was mostly composed of raw natural elements such as tuff, pumice, and scoria. Artificial LWA, such as expanded clay/perlite/shale/slate and sintered fly ash (FA), have recently been developed on a massive scale to fulfill the growing worldwide energy demand and the shortage of natural LWA15.
Because of voids in the material, LWC usually has a lower compressive strength than NWC while having a lower density14. As such, the use of LWC is limited to applications where the concrete needs to have greater strength and improved durability. In light of this, researchers have spent over a decade working to maintain the unique characteristics of LWC while enhancing its mechanical properties, most notably its compressive strength. This effort resulted in the invention of lightweight high-strength concrete (LWHSC). The advantages of high-strength concrete (HSC) and LWC are synergistically integrated by LWHSC. HSC is defined by ACI21316 as concrete whose compressive strength is more than 40 MPa. Although Euro code 217 does not define HSC specifically, it is commonly considered to refer to concrete having a compressive strength of more than 40–50 MPa. The majority of researchers believe that LWHSC has a strength of more than 40 MPa and a density of less than 2000 kg/m318,19. Because LWHSC is more affordable than ordinary concrete, it is the favored material for high-rise buildings, modular construction, precast concrete structures, and marine constructions. Because LWHSC’s main goal is to reduce the self-weight of modular elements while preserving their high strength, it can be applied to modular constructions. Because LWHSC has a lower self-weight than other materials, it can be used to cast precast concrete members for roof framing and long-span bridge girders without requiring the handling of large concrete elements or resolving production-related challenges on-site14,20.
Cement, which accounts for 8% of global greenhouse gas emissions, is being replaced by sustainable byproducts known as supplemental cementitious materials (SCMs) in the construction industry21,22. Economically speaking, these materials are often more affordable than cement, which lowers the total cost of construction. Additionally, using them improves the mechanical properties and resilience of concrete, which may save maintenance costs over the course of a structure’s lifetime. From an environmental standpoint, using these by-products in place of cement reduces the requirement for clinker, the primary component of cement that is associated with high carbon dioxide emissions23,24. By lowering greenhouse gas emissions, this replacement supports international efforts to combat climate change25,26. Utilizing these industrial byproducts also diverts them from landfills, reducing pollution to the environment and promoting environmentally friendly waste management practices25,27. According to several investigations28,29, high-strength lightweight concrete (HSLWC) produced with silica fume (SF) exhibited strong bond strength and chloride permeability comparable to NWC HSLWC was created by Klic et al.30 using SF and FA to lower density and boost strength. According to earlier research, 25-year-old LWC with pozzolanic materials such as SF, FA, and blast furnace slag (BS) had less chloride ion penetration than reference concrete without SCMs31,32. Wilson et al.27 found that adding 5–10% SF to cement by weight increased HSLWC’s resistance to freezing and thawing. The mechanical properties and durability of HSLWC were improved by the use of SF and metakaoline (MK)33, with even greater advantages noted at a lower water-to-binder ratio of 0.2534. By decreasing permeability and shrinkage, the addition of MK and BS improved the early-age strength and durability of HSLWC35,36. Bleeding and up-floating of LWA are successfully prevented in HSLWC by using FA-BS-SF37. To counteract the negative effects of LWA, nano-silica was added to HSLWC in addition to micro-size SCMs38. Results indicated a significant improvement in strength and durability with the addition of these byproducts39.
Despite tremendous advancements, earlier research had substantial shortcomings that support the necessity of machine learning (ML) techniques. The intricate, nonlinear interactions between various SCM combinations, LWA types, and mix parameters are frequently difficult for traditional empirical models to capture. Moreover, a lack of generalizability across material systems, inconsistent testing circumstances, and small dataset sizes plague many prior experimental investigations. Recent studies have shown that machine learning (ML) techniques, including artificial neural networks (ANN), random forests (RF), and gradient boosting, can perform better than traditional regression techniques in strength prediction and sustainability assessment40,41,42.
So far, only a few research have studied the prediction of mechanical properties for LWHSC using sophisticated ML approaches, and even fewer have assessed the role of recycled concrete powder (RCP), SCMs, or recycled aggregates in a sustainability context43,44. Experimental work can be laborious and costly; machine learning methodologies such as Multi-Expression Programming and Random Forest (RF) models provide an efficient alternative by utilizing existing data to forecast the mechanical properties of LWHSC that incorporates supplementary cementitious materials.
Machine learning (ML) models are surpassing conventional methods and transforming constitutive model creation. Their vast capacity for processing information, capacity to identify interactions at several scales, and versatility in handling different datasets all help to improve accuracy and efficiency45. An analysis of Multi-Expression Programming (MEP) and RF in the context of concrete prediction and analysis demonstrates important developments in predictive modeling. MEP is an evolutionary algorithm that builds mathematical models from data, which makes it perfect for complex material behavior predictions. MEP has been used in several research to forecast the mechanical characteristics of concrete. As an example, to predict the split tensile strength (TS) and elastic modulus (E) of waste-foundry-sand concrete (WFSC), Chen et al.46 created Multi-Expression Programming (MEP) models. They were able to achieve strong correlations of R = 0.892 for TS and R = 0.996 for E. By using MEP to predict the 28-day compressive strength of fiber-reinforced self-compacting concrete (FR-SCC), Inqiad et al.47 were able to demonstrate the predictive reliability of the method with an objective function (OF) value of 0.031. MEP was also used by M. Khan et al.48 to predict the slump, compressive strength, and elastic modulus of bentonite plastic concrete (BPC). They found remarkable correlations for slump, compressive strength, and elastic modulus: R = 0.9999, R = 0.9831, and R = 0.9300, respectively. Apart from concrete, Jalal et al.49 used MEP to model the compaction properties of expansive soils, including maximum dry density and optimum moisture content, using a dataset of 195 cases. They showed dependable predictive accuracy that was confirmed by several statistical indicators (MAE, RMSE, NSE, R). Furthermore, Farooq et al.50 used the RF approach to estimate the compressive strength of high-strength concrete (HSC) and outperformed traditional algorithms, obtaining a coefficient of determination R2 = 0.96 with low prediction errors. Although MEP and RF have shown promise in modeling concrete and geotechnical qualities, more research is needed to determine how effectively they work synergistically to predict the mechanical properties of LWHSC that contains SCMs. To address this gap, the current study combines MEP and RF models to provide strong predictive frameworks for LWHSC with SCMs, improving the interpretability and accuracy of predictions for mix design optimization.
Although a lot of research has been done on LWHSC, comprehensive studies that apply advanced machine learning algorithms to forecast its mechanical properties are still lacking. Specifically, not enough studies have been conducted to investigate the mechanical behavior of LWHSC by adding supplemental cementitious materials (SCMs) through machine learning. The inability of traditional empirical and statistical models to accurately capture the intricate connections between material composition and mechanical characteristics impedes the advancement of more effective formulations for advanced structural applications. To address these gaps, this study collected an extensive dataset from the existing literature to develop constitutive models for the prediction of the mechanical properties of LWHSC. The findings of this study have the potential to considerably advance sustainable construction practices by providing practical solutions for the construction industry’s transition to greener materials and practices. Table 1 contains a quick summary of major studies to provide a more complete picture of recent machine learning applications in sustainable concrete.
Research methodology
Multi expression programming
MEP, a robust linear-based technique within Genetic Programming (GP), utilizes linear chromosomes to store solutions effectively. Operating similarly to GEP, MEP encodes multiple solutions within a single chromosome, enhancing efficiency55. The process involves generating solutions based on the fittest chromosome identified through fitness function comparison. Parent selection occurs via a binary environment, leading to recombination and the creation of two distinct offspring56. Subsequent mutation and iteration refine the offspring until the optimal program is identified, triggering termination based on predefined criteria, illustrated in Fig. 1, while Fig. 2 shows the architecture of MEP. MEP model calibration necessitates adjustment of key parameters, including code length, subpopulation size, subpopulation number, function set, and crossover probability57. Notably, increasing the number of subpopulations extends processing time, while the complexity of MEP formulation is strongly influenced by code length.
MEP provides various advantages over other forms of genetic techniques, such as genetic programming (GP). GP uses a tree crossover evolutionary process to build a large number of parse trees, which increases processing time and storage needs58. Furthermore, because GP is both a phenotype and a genotype, it is difficult to provide a simple formulation for the intended job. MEP allows a broad variety of expressions, including implicit parallelism. MEP also has the ability to store many solutions to an issue on a single chromosome56,59. MEP can discriminate between symptoms and genotypes due to linear progressions60. MEP is projected to be more effective than other machine learning algorithms due to its ability to encode many responses on a single chromosome. This unique feature enables MEP to look for a more viable solution. Unlike previous GP algorithms, MEP provides clear decoding operations and pays special attention to cases where the intended expression requirements are ambiguous59. MEP may deal with a variety of difficulties, including division by zero and improper expressions61. Furthermore, multi-gene genetic programming (MGGP) and MEP are expansions of traditional GP designed to address difficult optimization problems. While their methodologies are similar, they differ significantly in how solutions are presented and evolved. In MGGP, an individual is represented by a collection of genes, each of which may encode a distinct subcomponent or module of the solution48,62. These genes could be trees or other structures relevant to the problem domain. In MEP, an individual is represented as a set of numerous expressions, which are often linear or matrix-based63. Each expression contributes to the overall result and can be examined independently. Furthermore, MGGP typically uses genetic operators like as crossover and mutation at the gene level64. It means that crossover and mutation processes can take place inside individual genes, allowing the solution’s subcomponents to be swapped or changed. MEP, on the other hand, commonly incorporates expression-level mutation operators, which modify individual expressions or portions of expressions to generate new candidate solutions. Crossover procedures in MEP may entail mixing complete expressions from multiple individuals56.
Random forest regression
Breiman proposed RF regression in 200165, which is considered an enhanced classification regression method. The fundamental characteristics of RF include its speed and flexibility in establishing a link between input and output functions. Furthermore, RF outperforms other machine learning algorithms when dealing with huge datasets. RF creates numerous decision trees during the training phase and combines their predictions to achieve more accurate and dependable predictions, as shown in Fig. 366. RF has been employed in a variety of industries, including banking to forecast client responses67, stock market price direction68, medicine/pharmaceutical business69, e-commerce70, and so on. RF is formed by combining many tree predictors, with a random vector influencing each tree’s choice. This vector is sampled separately and distributed uniformly across the forest65. Each tree in the forest is trained on a unique training set, which may provide somewhat different results71. During the training phase, each tree is constructed from a randomly chosen subset of the training data, known as bootstrapped samples, which implies that certain data points may be repeated while others are discarded. Furthermore, RF employs feature randomization, which selects a random collection of features for each tree node split. This technique reduces overfitting and increases tree variety. RF evaluates its training performance using the Out-of-Bag (OOB) error72. This entails evaluating each tree’s predictions for data points not included in the bootstrap sample. The OOB error provides an accurate approximation of the model’s generalization error. The final output is produced by averaging the various tree forecasts for regression tasks73,74.
The RF approach is perfect for regression tasks because of a few benefits. For instance, RF is renowned for its inherent resistance to overfitting and great precision. It requires little tweaking and performs effectively on a range of datasets75. Additionally, since individual trees may be made separately, RF can be parallelized. Because of this, RF is suitable for training in remote computing environments and managing large datasets76.
Datasets used in modeling
The novel MEP-based and RF machine learning methods were used in the current study. To train our model based on the worldwide sample and produce MEP and RF models, a comprehensive database was built from the experimental work conducted in the literature. 170 CS data points, expressed in megapascals (MPa), are included in the training database18,77,78,79,80,81,82,83,84,85,86,87, 161 FS19,79,80,81,83,86,87,88,89,90,91,92,93,94,95, and 186 data points for TS19,33,78,79,81,82,83,84,86,87,89,92,95,96,97. The supplementary material contains information on the dataset that was utilized for the modelling. For modelling purposes, water to cement ratio (W/B), cement (C), fly ash (FA), blast furnace slag (BS), silica fume (SF), natural fine aggregate (NFA), lightweight aggregate (LWA), and basalt fiber (BF) content are the eight most important input components that were taken into account. Additionally, Table 2 presents a variety of statistical values for the data to offer additional insights into the database. This table shows that CS, derived from 170 observations, exhibits values between 19.65 MPa and 86.10 MPa, with a mean of 46.79 MPa, reflecting substantial variability in structural performance. FS based on 161 data points, ranges from 0.73 MPa to 13.35 MPa, with an average of 5.57 MPa, while tensile strength, calculated from 186 observations, spans 0.46 MPa to 12.82 MPa, with a mean of 3.72 MPa. The skewness values for all examined variables fall within the acceptable interval of − 3 to + 3, indicating appropriate distributional symmetry98, whereas the kurtosis values lie within the recommended range of − 10 to + 10, signifying an absence of pronounced outliers or extreme deviations from normality. Collectively, these descriptive statistics underscore the diversity and balanced nature of the dataset, establishing its suitability for subsequent statistical modeling and in‑depth analysis99,100.
Furthermore, Pearson correlation heat maps were generated to assess the interrelationships among input variables for CS, TS, and FS, as shown in Fig. 4. Understanding these correlations is essential, as strong associations between predictors can lead to multicollinearity, which complicates the interpretation of regression and machine learning models101,102,103,104,105,106,107,108,109,110,111,112,113,114,115,116,117,118. The analysis indicated that most pairwise correlation coefficients (r-values) remained below the recommended threshold of 0.8, suggesting a low risk of multicollinearity and ensuring more reliable model estimation49,119,120. These results confirm that the dataset maintains an acceptable level of independence among its variables, thereby supporting the development of robust and interpretable predictive models.
The prediction model is known to be influenced by the pattern and distribution of the input data. Figure 5 illustrates the frequency distribution of the input data. This chart illustrates that the input data frequencies are much greater, and the distribution is not homogeneous, indicating that the models apply to a wider range of data.
The influence of input parameters on the mechanical properties of LWHSC was investigated using Hex contour plots for CS, FS, and TS, as shown in Figs. 6, 7 and 8. The darker regions in these figures indicate areas with a higher concentration of input variables. For CS (Fig. 6), the W/B ratio (Fig. 6a) was most densely populated between two regions, from 0.23 to 0.30 and from 0.37 to 0.42, indicating a preference for lower ratios in effective mixtures. Cement content (Fig. 6b) commonly ranged from 390 to 510 kg/m3, while FA (Fig. 6c) and slag (Fig. 6d) were concentrated between 25 and 50 kg/m3 and 20–50 kg/m3, respectively. SF (Fig. 6e) was primarily clustered between 4% and 10%. NFA and LWA (Fig. 6f, g) showed concentrations in the ranges of 300–1300 kg/m3 and 200–800 kg/m3, respectively. BF content (Fig. 6h) was most commonly used between 0.5% and 1.0%. A similar pattern was observed for FS (Fig. 7), where the W/B ratio (Fig. 7a) was concentrated between 0.26 and 0.42, and cement content (Fig. 7b) ranged from 300 to 520 kg/m3. FA and slag (Fig. 7c, d) were frequently used in the 40–80 kg/m3 and 10–40 kg/m3 ranges, respectively, while silica fume (Fig. 7e) remained between 6% and 60%. NFA and LWA (Fig. 7f, g) showed similar distributions to those in CS, and BF (Fig. 7h) was densely populated between 0.8% and 1.0%. For TS (Fig. 8), the W/B ratio (Fig. 8a) was mainly between 0.20 and 0.43, cement content (Fig. 8b) was found between 380 and 500 kg/m3, and both FA and slag (Fig. 8c and d) showed concentrations in the 40–100 kg/m3 and 20–40 kg/m3 intervals, respectively. SF (Fig. 8e) was again centered around 25–80%, with NFA and LWA aggregates (Fig. 8f, g) following previous patterns. BF (Fig. 8h) remained most frequent between 0.8% and 1.0%. With these input parameters, the values of CS generally ranged from 25 to 70 MPa, with the majority concentrated between 35 and 50 MPa. FS values ranged approximately from 2.5 to 7.5 MPa, predominantly clustering around 5 to 6 MPa, while TS values fell between 2 and 5 MPa, with a dense concentration near 4 MPa. These trends suggest that optimized input ranges consistently support enhanced mechanical performance across all strength categories.
MEP and RF models development
Prior to creating a model, it is critical to carefully select the input elements that have a major impact on the features of LWHSC77,81. The most influential factors from the literature were studied for LWHSC. The eight main factors selected were Water to cement ratio (W/B), Cement (C), Fly ash (FA), Blast Furnace Slag (BS), Silica fume (SF), Natural Fine Aggregate (NFA), Lightweight Aggregate (LWA), and Basalt fiber (BF) content. Thus, the mechanical strength predicted models were created based on the characteristics provided in Eq. (1)
Figure 9 shows the methodology utilized in this investigation. Several MEP setup factors must be determined before developing a valid and adaptive model. The setup variables are determined based on earlier recommendations and a trial-and-error method121. The population size determines how many programs are generated. A large-scale population model may be more complex, but it is more precise and reliable and it takes longer to achieve convergence. However, if the size goes above a specific range, the model may overfit. Table 3 displays the setup variables used for the model built in this study. The function uses simple mathematical operators (ln, exp, -, ×, ÷, +) to simplify the final formulations. The number of generations reflects how accurate the procedure was before being terminated. A multi-generation run will create the most error-free simulation model. Various variable combinations were utilized to improve the model, and the best combination was chosen to produce the model with the fewest errors. The primary difficulty with ML prediction simulation is overfitting the prediction model. When used with original data, the model performs admirably; however, when given unknown data, the model performs much worse. To avoid overfitting, it has been proposed that the model be tested on previously unknown data121,122. As a result, the data is proportionally divided into two categories. After training, the model is tested on a dataset that was not used to train the model. The database was split into two subsets: 30% for testing and 70% for training. The created models perform admirably on all datasets. The MEPX tool (version 2023.3.5) was used in this work to perform MEP modeling.
Initially, the modeling process produces the best answers for the population. The technique is repeated, with each iteration getting closer to the final result. The fitness of each subsequent generation is determined. MEP modeling procedure continues until the fitness value remains unchanged. If the results are not exact, the operation is repeated by gradually increasing the population size and tweaking additional hyperparameters. After evaluating the fitness function of each model, the model with the lowest fitness is picked.
The Random Forest (RF) model was developed using Python’s Scikit-Learn library, with hyperparameters optimized through an iterative tuning process to enhance predictive accuracy. The number of estimators, each representing a decision tree, was balanced between performance improvements and computational cost. This aligns with findings from Liu et al., where RF was successfully employed to predict the compressive strength of high-performance concrete, yielding robust results123. Moreover, RF was shown to outperform other regression techniques in predicting compressive strength of self-compacting concrete, demonstrating its effectiveness in modeling complex mix designs124. In eco-friendly concrete incorporating rice husk ash, RF achieved strong predictive capabilities (with accuracy often surpassing Gaussian Process Regression and Decision Tree models)125. Additionally, RF demonstrated high reliability in estimating the compressive strength of sustainable self-consolidating concrete containing SCMs like fly ash and GGBS126. These studies reinforce RF’s suitability and that of its tuned hyperparameters for predicting mechanical behaviors in SCM-based concrete systems. Table 3 summarizes the final optimized hyperparameters used in this study.
MEP and RF models evaluation
Various measures have been used to analyze a model’s viability and performance. Each indicator has its own method of estimating the performance of these models. The developed model was assessed using three statistical techniques: mean absolute error (MAE), mean squared error (MSE), and root mean square error (RMSE)46,127. It is worth noting that the RMSE, MSE, and MAE are three statistical metrics used extensively in machine learning to evaluate error levels128. A lower RMSE, MSE, or MAE value suggests that the model is more precise and accurate. Equations (2)–(4) represent the aforementioned statistical checks. In addition, the coefficient of determination R2 was calculated for each model.
The symbols “mi” and “ei” represent the model and experimental values, respectively, whereas the variable “n” denotes the total quantity of samples. The R2 factor was calculated for both the actual and predicted outcomes due to its fair estimation results and enhanced performance. R2 values approaching 1 signify more efficiency in the developed model129. RMSE effectively addresses the higher magnitudes of error. A lower root mean square error (RMSE) score denotes less error impact and enhanced model performance130. On the other hand, it was found that MAE is more significant when applied to smooth and continuous data131.
Interpretation of models with SHAP analysis
Lundberg and Lee132 introduced a method known as SHAP (Shapely Additive Explanations) for evaluating machine learning models. The capacity of ML models to learn from seen data and anticipate outcomes in fresh data has captured all attempts to create robust estimating tools. However, most ML modeling approaches suffer from excessive complexity and limited interpretability. The SHAP technique improves the explainability of prediction models. In the present study, SHAP is utilized to see the input parameters’ importance and influence on the response parameter. Even though much ML research in construction materials has attained incredible precision in forecasting their targets, insufficient consideration is made to the interpretation of the ML models. Numerous research computes characteristic significance in ML models using the decision path, and intuitive techniques133. To evaluate every input variable’s proportional importance to the output, as well as to check whether input variables improve favorably or adversely the output, a SHAP based technique was developed. References134,135 provide a full discussion of the SHAP technique. Shapley value indicates the significance of each independent variable to the response properties. This approach is the same as parametric analysis in which one variable is altered while other parameters are held fixed to evaluate how changes in one input parameter are affecting the output. The SHAP is used in this work to provide both parameter importance and influence interpretations of each input variable. Additionally, it facilitates an examination of how input variables positively or negatively affect the model’s output132,136,137,138,139,140,141,142,143. Based on cooperative game theory, SHAP represents the average proportional contribution of an attribute value across all potential combinations. The SHAP number of a character is defined as the average predicted value of a variable with this feature without the mean forecasted value of observations. The outcome of an ML model is assumed as a linear sum of its input characteristics scaled by matching SHAP values to enable interpretability.
Results and discussion
Outcomes of MEP modeling
Regression analysis
Figures 10 and 11 compare experimental and predicted CS of LWHSC using the MEP model, showing strong agreement with R2 values of 0.98378 (training) and 0.98383 (testing) and Pearson’s r of 0.99186 and 0.99188, respectively. Similarly, Figs. 12 and 13 present TS results, with R2, adjusted R2, and Pearson’s r of 0.99307, 0.99302, and 0.99653 for training, and 0.99212, 0.99197, and 0.99605 for testing, indicating excellent predictive accuracy. While Figs. 14 and 15 show FS comparisons, where R2, adjusted R2, and Pearson’s r were 0.94331, 0.9428, and 0.97124 for training, and 0.95681, 0.95587, and 0.97817 for testing, confirming strong model reliability.
These findings are consistent with prior studies using machine learning methods to predict concrete characteristics. Inqiad et al.47 found that MEP was highly accurate in forecasting the 28-day compressive strength of fiber-reinforced self-compacting concrete, with R2 values exceeding 0.97. Similarly, Khan et al.48 demonstrated the ability of MEP to forecast the workability and mechanical properties of bentonite plastic concrete with R values more than 0.98, highlighting its robustness in capturing nonlinear connections. Furthermore, predicted experiments on expansive soils employing MEP49 verified its high dependability and narrow error margins, which are consistent with the accuracy found in the current study for LWHSC. These comparisons confirm that MEP not only makes correct predictions in our work, but also consistently outperforms other cementitious materials, bolstering its usefulness in real-world engineering applications.
Error analysis
This section shows the absolute error that exists between the actual and model values for the given datasets. These plots provide a glimpse of the maximum error occurrence in the MEP model. Figures 16 and 17 illustrate the 3D error distributions for MEP-predicted CS, TS, and FS of LWHSC during the training and testing phases, respectively. In the training phase (Fig. 16), CS errors range from 0.002 to 7.8 MPa, averaging 1.2 MPa, with most values under 3 MPa. TS errors fall between 0.0005 and 0.68 MPa (average 0.25 MPa), while FS errors range from 0.001 to 2.4 MPa (average 0.8 MPa). In the testing phase (Fig. 17), CS errors vary from 0.25 to 6.0 MPa, generally staying below 3.0 MPa. TS errors range from 0.00 to 0.50 MPa, mostly under 0.30 MPa, and FS errors lie between 0.00 and 2.0 MPa, with most values below 1.0 MPa. These results demonstrate that prediction errors remain consistently low in both phases, highlighting the MEP model’s high accuracy and reliable generalization across all mechanical properties.
Outcomes of RF model
Regression analysis
Figures 18 and 19 illustrate the comparison between experimental and predicted CS of lightweight high-strength concrete (LWHSC) using the RF model, showing strong agreement with R2 values of 0.9065 (training) and 0.87558 (testing) and Pearson’s r of 0.9521 and 0.93572, respectively. Figures 20 and 21 present the TS results, where the model achieved R2, adjusted R2, and Pearson’s r values of 0.85115, 0.8500, and 0.92258 for training, and 0.80237, 0.79864, and 0.89575 for testing, indicating substantial predictive accuracy. Figures 22 and 23 compare predicted and experimental FS, with R2, adjusted R2, and Pearson’s r values of 0.92445, 0.92377, and 0.96148 for training, and 0.94441, 0.9432, and 0.97181 for testing, confirming the model’s strong reliability in estimating LWHSC properties.
Notably, the performance of our RF model is supported by recent literature, which has demonstrated that RF can provide good prediction accuracy for concrete strength estimation. Xu et al.144 employed RF to estimate compressive strength of high-performance concrete and found R2 = 0.93, indicating reliable prediction capacity. Khan et al.145 found R2 = 0.9922 when using RF to estimate compressive strength in recycled coarse aggregate concrete, indicating strong alignment with experimental results in similar circumstances. Zhang et al.146 employed RF and other ML models with hyperparameter tuning and found robust generalization for recycled aggregate concrete, which is consistent with the significant prediction success seen in our investigation. These findings collectively reinforce that, while RF has slightly poorer precision than MEP in our results, its performance is nevertheless comparable to top-tier RF applications in the existing literature.
Error analysis
Figures 24 and 25 present the error distributions of the RF model for predicting the compressive strength (CS), split tensile strength (TS), and flexural strength (FS) of LWHSC in both training and testing phases. During training (Fig. 24), the RF model showed high accuracy, with CS errors ranging from 0.00 to around 5.2 MPa, most of which were below 2.0 MPa. TS errors were between 0.00 and 0.40 MPa, while FS errors fell within 0.00 to 2.0 MPa. The predictions closely matched the actual values, indicating a precise fit. In the testing phase (Fig. 25), the model continued to perform well, with CS errors ranging from 0.00 to 6.0 MPa and generally staying under 3.0 MPa. TS errors varied between 0.00 and 0.45 MPa, and FS errors ranged from 0.00 to 2.2 MPa. Although testing errors showed a slightly wider distribution, especially for FS, most values still remained near the actual results. Overall, the RF model delivered strong predictive performance in both phases, maintaining low errors and demonstrating its reliability in estimating all key strength properties.
Performance evaluation of MEP and RF models
The number of data points needed to build a model is important since it influences the model’s validity. A satisfactory model requires a data set proportion of 3 to the number of inputs, with a ratio of 5 preferred147,148,149. This work maintains a ratio of 21 for CS, 23 for TS, and 20 for FS. As previously stated, the performance of all models is evaluated using multiple statistical measures (R2, MAE, MSE, and RMSE). Statistical tests on the training and testing datasets were performed to measure the effectiveness and competency of the created models. The values of all error measurements for MEP and RF algorithms are provided in Table 4 and illustrated by radar graphs shown in Figs. 26 and 27, respectively. The table provides a good correlation between the model-estimated and actual values, as R2 values are closer to 1 (ideal condition). The MAE, MSE, and RMSE values for all datasets and both algorithms are notably lower, which indicates the good precision and generalization ability of MEP and RF models. All the statistical checks were within the limits specified by the earlier studies87.
Comparative analysis of the performance of established models
Figure 28 depicts the Taylor diagram, which is useful for a more in-depth comparison of the established models’ performance. Taylor150 invented the Taylor diagram, which is a visual representation that aids in determining the accuracy of several models by demonstrating which model is most realistic and best aligns with the actual data. In a Taylor diagram, different models or datasets are shown on the same graph. Their relative performance can be measured based on how well they align with the reference data (actual dataset)151. Furthermore, this diagram provides a comprehensive method for comparing multiple models based on their correlation, variance, and RMSE from reference data. It provides a thorough analysis of the models’ correlation and standard deviation with the observed data152,153. Figure 28 illustrates each model as a point, with its performance represented by the distance from the reference point. Higher correlation and fewer errors indicate that a model is closer to the reference point, and hence more accurate in predicting outcomes154,155.
Figure 28 presents Taylor diagrams comparing the MEP and RF models for CS, TS, and FS. In each case, the MEP model demonstrates a stronger match with the experimental data. For CS, MEP achieves a high correlation coefficient of approximately 0.99 and a standard deviation of about 13.95, which closely matches the experimental standard deviation (14). In contrast, the RF model shows a lower correlation of around 0.92 and a smaller standard deviation of 13.1, indicating weaker alignment with the data. A similar trend is observed in TS prediction, where MEP attains a correlation of 0.985 and a standard deviation of 1.525, both very close to the actual values, while RF lags slightly with a correlation of 0.96 and a standard deviation of 1.475. For FS, the MEP model again performs better, with a correlation near 0.99 and a standard deviation of 2.625, closely matching the experimental 2.64, whereas RF records a lower correlation of 0.94 and a standard deviation of 2.57. These results confirm that MEP consistently provides higher accuracy, better variance matching, and improved predictive reliability. Therefore, it can be concluded that the MEP model significantly outperforms the RF model across all strength parameters, making it the more robust and dependable choice for modeling the mechanical properties of LWHSC.
Comparing MEP and RF models with related work
The present work MEP and RF models are compared with similar models previously developed in the literature for estimating the properties of LWHSC, as presented in Table 5. Sifan et al.156 predicted the compressive strength of LWHSC by utilizing Gradient Boosting Regression (GBR) ML technique. The results from his work showed an R2 value of 0.95 which is found to be lower than the present study. Similarly, Kumar et al.157 forecasted the CS of LWHSC by using Gaussian Progress Regression (GPR), Ensemble Learning (EL), Support Vector Machine Regression (SVMR), and optimized GPR, SVMR, and EL. Findings from their study showed lower performance as compared to our present study. Furthermore, Yaser et al.158 developed a deep learning model that identified important variables such the water-to-cement ratio, aggregate ratios, and superplasticizer content in order to predict the compressive strength (UCS) of lightweight pumice concrete. The model outperformed the Multilayer Perceptron (MLP), Support Vector Machine (SVM), and Decision Tree (DT) models with R2 = 0.914, accuracy = 0.97, and AUC = 0.971. Our MEP model, in contrast, demonstrated greater accuracy, particularly for compressive strength (R2 = 0.98–0.99). Additionally, using 2,568 samples, NS Alghrairi et al.159 developed nine models to forecast the compressive strength (CS) of lightweight concrete (LWC) with and without nanomaterials. Density, water content, cement, water-to-binder ratio, and nanomaterial content were among the input factors. Gradient-Boosted Trees (GBT), Random Forest (RF), Tree Ensemble (TE), Extreme Gradient Boosting (XGB), Keras Neural Network (KNN), Simple Regression (SR), Probabilistic Neural Network (PNN), Multilayer Perceptron (MLP), and Linear Regression (LR) were the models that performed the best (R2 = 0.90, RMSE = 5.286). On the other hand, our MEP model demonstrated better prediction performance for LWC strength, achieving higher accuracy (R2 = 0.98–0.99). In a comparable fashion, Fazal Hussain et al.160 used 420 data points from 43 investigations to build a machine learning (ML)-based method for mix design optimization of lightweight aggregate concrete (LWAC). Eleven input parameters pertaining to mix and aggregate features were used to train five machine learning algorithms: Support Vector Machine (SVM), Artificial Neural Network (ANN), Decision Tree (DT), Gaussian Process Regression (GPR), and Extreme Gradient Boosting (XGBoost). The outcome was compressive strength. The GPR model outperformed the others (R2 = 0.99, RMSE = 1.34, MSE = 1.79, MAE = 0.69). By contrast, our MEP model demonstrated higher predictive effectiveness for LWAC strength, with similar high accuracy (R2 = 0.98–0.99) with enhanced trend catching.
In addition to LWC and LWHSC, several researchers have used different machine learning methods to forecast the mechanical characteristics of other kinds of concrete, like rubberized concrete and high-strength concrete (HSC). Here is a summary of these studies for comparison as well.
Using inputs including cement, aggregate ratio, water, and superplasticizer, Farooq et al.50 used supervised machine learning (ML) techniques, such as Random Forest (RF) and Gene Expression Programming (GEP), to estimate the compressive strength of high-strength concrete (HSC). While GEP also generated good predictions with empirical relations between real and estimated values, the RF model demonstrated strong performance with R2 = 0.96. Their models showed good accuracy, but they were not as predictive as the MEP and RF models created in this study. Similarly, David Sinkhonde and Destine Mashava et al.161 estimated the compressive strength of rubberized concrete incorporating brick powder using the ANN model with 6 input parameters. The performance of the ANN model was found to be significantly poorer with R2 value of 0.83 in comparison to the performance achieved in the present study. Senthil Vadivel et al.162 utilized XG boost, CatBoost, Extra trees regressor, and Bagging regressor algorithms for predicting the 28 days compressive, split tensile, and flexural strength of rubberized concrete in replacement of fine aggregate with fine rubber (FR), and Coarse Aggregate with Coarse Rubber (CR). From ML models of 5 input parameters, the best R2 value of 0.98 was achieved. Furthermore, Musa Adamu et al.163 employed the ANN algorithm to forecast the CS, STS, FS, and E properties of rubberized concrete incorporating fly ash and nano-silica, and achieved an R2 value of 0.99. Additionally, Jingkui Zhang et al.164 developed ELM based predictive model using 6 input parameters. The performance of the ELM model was found to be poorer in comparison to the performance achieved in the present study. Furthermore, the MEP model in this study outperformed all models in the literature in predicting the compressive, flexural and split tensile strength of LWHSC with more input variables. More specifically, the MEP model exhibited better statistical error values (MAE, RMSE, and MSE) as compared to previous literature ML algorithms. Hence, it can be concluded that MEP and RF machine learning algorithms can be used with confidence for the prediction of various mechanical properties of LWHSC.
Analysis of the influence of input parameters on compressive strength
To understand how various mixture components, affect concrete strength, SHAP (SHapley Additive exPlanations) study is crucial. It provides a thorough understanding of the average contribution, variability, and direction of influence of each input parameter, enabling enhanced mix design optimization to improve mechanical performance92,168. In this work, average SHAP values were used to assess the importance of input variables in predicting compressive strength (CS).
The mean SHAP values for features obtained from the MEP and RF models are displayed in Fig. 29. The parameters are the W/B ratio, LWA, NFA, Cement, SF, FA, BF and Slag in decreasing order of importance. While LWA and NFA also show moderate effects (≈ 4.3 and above), the W/B ratio has the greatest mean SHAP value (≈ 8), suggesting a major influence on model predictions. Due to inadequate data in the dataset, BF and Slag contribute very little, while Cement, SF, and FA have smaller contributions (1–2.5).
The SHAP summary plot (Fig. 30) further demonstrates how each feature influences predicted CS. Each dot represents a feature’s SHAP value for a data point, with color denoting magnitude (red = high, blue = low). Dots on the right indicate an increase in CS, while those on the left indicate a decrease. The W/B ratio exhibits a strong positive relationship, confirming its major role in strength prediction; LWA typically exerts a smaller influence, whereas NFA has a mostly neutral but occasionally variable effect; Cement, SF, and FA show both positive and negative effects, depending on their proportions; BF tends to increase strength, while Slag shows negative SHAP values, suggesting a possible decrease in CS at higher contents.
Analysis of the influence of input parameters on tensile strength
Figure 31 illustrates the mean SHAP values for various mix design parameters affecting the tensile strength (TS) of lightweight high strength concrete. The results, derived from the MEP and RF models, reveal that cement has the highest mean SHAP value, confirming its dominant role in determining TS. This aligns with established knowledge of cement’s critical contribution to strength development. The W/B ratio ranks second, emphasizing its substantial effect on the mechanical behavior of the mixture. FA also demonstrates a considerable positive influence, though less pronounced than cement and W/B. LWA and SF show moderate impacts, while NFA, BF, and Slag exhibit minimal mean SHAP values, suggesting relatively limited but non-negligible contributions to the model’s TS predictions.
The SHAP bee swarm plot (Fig. 32) further explains how input parameters influence TS predictions. Cement displays a wide spread of both positive and negative SHAP values, indicating strong variability in its impact across samples. The W/B ratio and FA show mixed effects centered near the zero axis, reflecting their context-dependent influence on tensile strength. LWA also demonstrates both positive and negative SHAP distributions, implying that its effect varies depending on the combination of other mix parameters. In contrast, SF shows predominantly negative SHAP values, suggesting a reduction in tensile strength beyond optimal content levels consistent with experimental observations. NFA, BF, and Slag cluster closely around zero, indicating largely neutral or minor effects on TS overall.
Analysis of the influence of input parameters on flexural strength
Similar to compressive strength, SHAP analysis was employed to interpret the influence of each input parameter on the flexural strength (FS) of lightweight high strength concrete. Figure 33 presents the mean SHAP values, showing the average contribution of each variable to the model output. Among all parameters, LWA exhibits the highest mean SHAP value, confirming its dominant influence on FS prediction. The W/B ratio ranks second, exerting a significant yet smaller effect. NFA and Cement have moderate impacts, whereas SF, FA, and Slag contribute minimally, indicating limited average influence on flexural strength estimation.
The SHAP bee swarm plot (Fig. 34) further illustrates the relative effects of input parameters on FS. Positive SHAP values for LWA, W/B ratio, and NFA indicate that increasing these parameters enhances flexural strength, with the W/B ratio and NFA showing consistently favorable impacts across the dataset. In contrast, SF exhibits predominantly negative SHAP values, suggesting that higher SF content tends to reduce FS. FA, Slag, and BF show near-zero SHAP distributions, implying generally neutral effects, though some BF data points display positive contributions under specific conditions. Cement shows a mild positive trend, reinforcing its known but limited role in improving strength in LWA-based concrete.
Conclusion
This study developed machine learning models based on multi-expression programming (MEP) and random forest (RF) to predict the compressive strength (CS), split tensile strength (TS), and flexural strength (FS) of lightweight high-strength concrete (LWHSC) incorporating supplementary cementitious materials (SCMs). The following conclusions can be drawn:
-
Performance of model Both MEP and RF models successfully predicted the mechanical properties of LWHSC, with MEP consistently outperforming RF. The MEP model achieved very high correlation coefficients (R2 = 0.98–0.99 for CS, = 0.99 for TS, and 0.94–0.96 for FS), whereas RF showed comparatively lower but still acceptable performance (R2 = 0.88–0.91 for CS, 0.80–0.85 for TS, and 0.92–0.94 for FS). These findings suggest that MEP’s symbolic regression feature helps it better represent the intricate nonlinear relationships between SCM content and strength development in LWHSC.
-
Statistical evaluation Statistical indicators (MSE, MAE, RMSE) evaluation confirmed the superiority of MEP, which produced much lower errors) across all strength parameters compared to RF. While RF occasionally showed reasonable accuracy, its error margins were several times higher than those of MEP, reinforcing the reliability of MEP in predicting concrete strength.
-
Error evaluation Error analysis demonstrated that most prediction deviations remained below 3 MPa for CS, 0.5 MPa for TS, and 2 MPa for FS in both training and testing phases, underscoring the robustness and reliability of the developed models.
-
Comparative validation Taylor diagram comparisons confirmed that MEP predictions closely matched experimental variance and distribution, establishing it as the more accurate and dependable approach compared to RF. In contrast to earlier research employing GEP, ANN, SVM, DT, KNN, PNN, MLP, GPR or hybrid tree-based models, the current MEP and RF techniques especially MEP provided equivalent or superior accuracy.
-
Engineering and practical implications MEP’s exceptional performance demonstrates its capacity to comprehend LWHSC material behavior while retaining high prediction precision, which is especially useful for optimizing sustainable design. By measuring the impact of SCM combinations on mechanical performance, the developed RF and especially MEP models can assist mix proportioning decisions and minimize cement use and experimental effort.
-
SHAP Analysis The SHAP analysis offered insightful information about how various mixture components affect LWHSC mechanical performance. The most important factors influencing strength development were found to be cement, LWA, and the W/B ratio. Slag and silica fume (SF) contributed relatively little, but fly ash (FA) had a substantial impact, especially on tensile strength. Notably, under some circumstances, basalt fiber (BF) showed a favorable impact on strength.
This study is novel in its comprehensive application of MEP and RF to model LWHSC incorporating SCMs, offering a systematic alternative to time-consuming and resource-intensive laboratory testing. The predictive framework developed herein can be directly applied in practice to optimize mix design, contributing to the development of sustainable, high-performance concretes that align with global efforts to reduce carbon emissions in the construction industry.
Overall, this study shows that MEP offers a reliable, comprehensible, and effective framework for modeling LWHSC properties utilizing SCMs. Beyond its prediction accuracy, the suggested method advances sustainable concrete technology by supporting carbon reduction objectives in the construction sector, enabling data-driven mix design, and reducing the need for intensive laboratory testing. For more thorough sustainability evaluations, future studies may expand this framework to include durability characteristics and hybrid ML–mechanistic models.
Study limitations and potential future research directions
The current work developed and evaluated machine learning models (MEP and RF) for predicting CS, TS, and FS of LWHSC with cementitious materials. While the results indicate that both models have high predictive potential, several limitations should be considered.
-
Dataset size and variability The dataset was compiled from previously published studies, resulting in variances in experimental circumstances. Although sufficient for model training, adding bigger and more diverse experimental records, particularly from controlled laboratory testing, would improve the models’ reliability and generalizability.
-
Input variables Only eight input variables were considered W/B, cement, FA, BS, SF, NFA, LWA, and BF. Other elements that may influence mechanical performance include the curing regime, admixtures, temperature, and humidity, which should be investigated in future studies.
-
Modeling approach The study used standalone models as its modeling approach. Future research could include hybrid and ensemble methods, such as Genetic Algorithm Particle Swarm Optimization (GA-PSO) and deep learning frameworks, which have the potential to improve predicted accuracy and stability.
-
Interpretability Although the present study demonstrated strong predictive reliability through multiple statistical indicators and comparative analyses, future work should employ k-fold cross-validation or repeated resampling techniques to further verify model robustness and ensure generalizability across diverse datasets and experimental conditions.
-
Sustainability assessment To better illustrate the sustainability effects of the optimized mix designs, future studies should include thorough analyses of CO2 savings, embodied-carbon reductions, and cement-replacement benefits using life-cycle assessment or standardized emission factors.
-
Research scope This research focuses on mechanical properties. Extending the modeling framework to include LWHSC’s durability, long-term behavior, and microstructural performance would provide a more complete knowledge of its usefulness in sustainable construction.
Data availability
All data generated or analysed during this study are included in this published article [and its supplementary information files].
Abbreviations
- LWHSC:
-
Light weight high-strength concrete
- NWC:
-
Normal weight concrete
- LWA:
-
Light weight aggregates
- LWC:
-
Light weight concrete
- HSC:
-
High strength concrete
- MEP:
-
Multi-expression programming
- RF:
-
Random forest
- GP:
-
Genetic programming
- SCMs:
-
Supplemental cementitious materials
- CS:
-
Compressive strength
- TS:
-
Tensile strength
- FS:
-
Flexural strength
- ML:
-
Machine learning
- MGGP:
-
Multi-gene genetic programming
- NFA:
-
Natural fine aggregate
- MAE:
-
Mean absolute error
- MSE:
-
Mean squared error
- RMSE:
-
Root mean squared error
- PDP:
-
Partial dependence plots
- SHAP:
-
SHapley additive explanations
References
Tayebani, B., Said, A. & Memari, A. Less carbon producing sustainable concrete from environmental and performance perspectives: A review. Constr. Build. Mater. 404, 133234 (2023).
Fořt, J. et al. A review of the role of lightweight aggregates in the development of mechanical strength of concrete. J. Build. Eng. 2024, 109312 (2024).
Zeyad, A. M., Amin, M. & Agwa, I. S. Effect of air entraining and pumice on properties of ultra-high performance lightweight concrete. Archives Civil Mech. Eng. 24(1), 11 (2023).
Sadek, M. M., Ismail, M. K. & Hassan, A. A. Stability of lightweight self-consolidating concrete containing coarse and fine expanded slate aggregates. ACI Mater. J. 117(3), 133–143 (2020).
Bogas, J. A., de Brito, J. & Figueiredo, J. M. Mechanical characterization of concrete produced with recycled lightweight expanded clay aggregate concrete. J. Clean. Prod. 89, 187–195 (2015).
Alhnifat, R. S. & Al-Dala’ien, R. N. Behavior of lightweight concrete incorporating Pozzolana aggregate and expanded polystyrene beads. Eng. Sci. 25, 934 (2023).
Benjeddou, O., Ravindran, G. & Abdelzaher, M. A. Thermal and acoustic features of lightweight concrete based on marble wastes and expanded perlite aggregate. Buildings 13(4), 992 (2023).
Arunachalam, K. P. et al. Experimental study on the mechanical properties and microstructures of cenosphere concrete. Materials 16(9), 3518 (2023).
Sifan, M. et al. Development and prospectives of lightweight high strength concrete using lightweight aggregates. Constr. Build. Mater. 362, 129628 (2023).
Kumar, R. & Srivastava, A. Influence of lightweight aggregates and supplementary cementitious materials on the properties of lightweight aggregate concretes. Iran. J. Sci. Technol. Trans. Civil Eng. 47(2), 663–689 (2023).
Cho, S. & Kim, M. O. Effect of aggregate type on the shear behavior of reinforced lightweight concrete beams. Appl. Sci. 14(14), 5992 (2024).
Yang, Y., Lin, B. & Zhang, W. Experimental and numerical investigation of an arch–beam joint for an arch Bridge. Archives Civil Mech. Eng. 23(2), 101 (2023).
Huang, H. et al. Experimental study on Cyclic performance of steel-hollow core partially encased composite spliced frame beam. Soil Dyn. Earthq. Eng. 163, 107499 (2022).
Lu, J. X. Recent advances in high strength lightweight concrete: from development strategies to practical applications. Constr. Build. Mater. 400, 132905 (2023).
Chandra, S. & Berntsson, L. Lightweight Aggregate Concrete(Elsevier, 2002).
Guide for structural lightweight-aggregate concrete (American Concrete Institute, 2014).
Institution, B. S. Eurocode 2: Design of Concrete Structures: Part 1–1: General Rules and Rules for Buildings(British Standards Institution, 2004).
Xiong, G. et al. Preparation of high strength lightweight aggregate concrete with the vibration mixing process. Constr. Build. Mater. 229, 116936 (2019).
Liu, H. et al. High strength flowable lightweight concrete incorporating low C3A cement, silica fume, stalite and macro-polyfelin polymer fibres. Constr. Build. Mater. 281, 122410 (2021).
Huang, H. et al. Experimental study of predamaged columns strengthened by HPFL and BSP under combined load cases. Struct. Infrastruct. Eng. 17(9), 1210–1227 (2021).
Sakir, S. et al. Utilization of by-products and wastes as supplementary cementitious materials in structural mortar for sustainable construction. Sustainability 12(9), 3888 (2020).
Abd-Elrahman, M. H. et al. Effect of utilizing peanut husk Ash on the properties of ultra-high strength concrete. Constr. Build. Mater. 384, 131398 (2023).
Ghanim, A. A. J. et al. Effect of modified nano-titanium and fly Ash on ultra‐high‐performance concrete properties. Struct. Concrete. 24(5), 6815–6832 (2023).
Hakeem, I. Y. et al. Using a combination of industrial and agricultural wastes to manufacture sustainable ultra-high-performance concrete. Case Stud. Constr. Mater. 19, e02323 (2023).
He, H. et al. Deciphering size-induced influence of carbon Dots on mechanical performance of cement composites. Constr. Build. Mater. 425, 136030 (2024).
Shan, H. et al. Effect of carbon Dots with different sizes on chloride binding of cement. Constr. Build. Mater. 425, 136103 (2024).
Wilson, H. & Malhotra, V. Development of high strength lightweight concrete for structural applications. Int. J. Cem. Compos. Lightweight Concrete. 10(2), 79–90 (1988).
Chia, K. S. & Zhang, M. H. Water permeability and chloride penetrability of high-strength lightweight aggregate concrete. Cem. Concr. Res. 32(4), 639–645 (2002).
Mo, K. H., Alengaram, U. J. & Jumaat, M. Z. Bond properties of lightweight concrete–a review. Constr. Build. Mater. 112, 478–496 (2016).
Kılıç, A. et al. High-strength lightweight concrete made with scoria aggregate containing mineral admixtures. Cem. Concr. Res. 33(10), 1595–1599 (2003).
Moffatt, E. & Thomas, M. Performance of 25-year-old silica fume and fly Ash lightweight concrete blocks in a harsh marine environment. Cem. Concr. Res. 113, 65–73 (2018).
Thomas, M. & Bremner, T. Performance of lightweight aggregate concrete containing slag after 25 years in a harsh marine environment. Cem. Concr. Res. 42(2), 358–364 (2012).
Youm, K. S. et al. Experimental study on strength and durability of lightweight aggregate concrete containing silica fume. Constr. Build. Mater. 114, 517–527 (2016).
Nadesan, M. S. & Dinakar, P. Influence of type of binder on high-performance sintered fly Ash lightweight aggregate concrete. Constr. Build. Mater. 176, 665–675 (2018).
Jin, Z. et al. Internal superhydrophobic marine concrete: interface modification based on slag microstructure regulation. J. Building Eng. 86, 108769 (2024).
Cheng, S. et al. Effects of seawater and supplementary cementitious materials on the durability and microstructure of lightweight aggregate concrete. Constr. Build. Mater. 190, 1081–1090 (2018).
Chen, B. & Liu, J. Experimental application of mineral admixtures in lightweight concrete with high strength and workability. Constr. Build. Mater. 22(6), 1108–1113 (2008).
Atmaca, N., Abbas, M. L. & Atmaca, A. Effects of nano-silica on the gas permeability, durability and mechanical properties of high-strength lightweight concrete. Constr. Build. Mater. 147, 17–26 (2017).
Wei, J. et al. Seismic performance of concrete-filled steel tubular composite columns with ultra high performance concrete plates. Eng. Struct. 278, 115500 (2023).
Manan, A. et al. Machine learning prediction of recycled concrete powder with experimental validation and life cycle assessment study. Case Stud. Constr. Mater. 21, e04053 (2024).
Manan, A. et al. Multi-targeted strength properties of recycled aggregate concrete through a machine learning approach. Eng. Comput. 42(1), 388–430 (2025).
Manan, A. et al. Machine learning prediction model integrating experimental study for compressive strength of carbon-nanotubes composites. J. Eng. Res. (2024).
Manan, A. et al. Sustainable optimization of concrete strength properties using artificial neural networks: a focus on mechanical performance. Mater. Res. Express. 12(2), 025504 (2025).
Manan, A. et al. Environmental and human health impact of recycle concrete powder: an emergy-based LCA approach. Front. Environ. Sci. 12, 1505312 (2025).
Chaabene, W. B., Flah, M. & Nehdi, M. L. Machine learning prediction of mechanical properties of concrete: critical review. Constr. Build. Mater. 260, 119889 (2020).
Chen, L. et al. Development of predictive models for sustainable concrete via genetic programming-based algorithms. J. Mater. Res. Technol. 24, 6391–6410 (2023).
Inqiad, W. B. et al. Predicting 28-day compressive strength of fibre-reinforced self-compacting concrete (FR-SCC) using MEP and GEP. Sci. Rep. 14(1), 17293 (2024).
Khan, M. et al. Computational prediction of workability and mechanical properties of bentonite plastic concrete using multi-expression programming. Sci. Rep. 14(1), 6105 (2024).
Jalal, F. E. et al. Predicting the compaction characteristics of expansive soils using two genetic programming-based algorithms. Transp. Geotechnics. 30, 100608 (2021).
Farooq, F. et al. A comparative study of random forest and genetic engineering programming for the prediction of compressive strength of high strength concrete (HSC). Appl. Sci. 10(20), 7330 (2020).
Yang, H., Li, H. & Jiang, J. Predictive modeling of compressive strength of geopolymer concrete before and after high temperature applying machine learning algorithms. Struct. Concrete. 26(2), 1699–1732 (2025).
Manan, A. et al. Prediction of flexural strength in FRP bar reinforced concrete beams through a machine learning approach. Anti-Corros. Methods Mater. 71(5), 562–579 (2024).
Manan, A. et al. AI-based constitutive model simulator for predicting the axial load-deflection behavior of recycled concrete powder and steel fiber reinforced concrete column. Constr. Build. Mater. 470, 140628 (2025).
Manan, A. et al. Utilizing waste materials in concrete: a review on mechanical and sustainable performance. Green Mater. 1–18 (2025).
Yousif, A. A. & Mohamed, I. Prediction of compaction parameters from soil index properties case study: dam complex of upper Atbara project. Am. J. Pure Appl. Sci. 4(1), 01–09 (2022).
Oltean, M. & Grosan, C. A comparison of several linear genetic programming techniques. Complex. Syst. 14(4), 285–314 (2003).
Fallahpour, A., Olugu, E. U. & Musa, S. N. A hybrid model for supplier selection: integration of AHP and multi expression programming (MEP). Neural Comput. Appl. 28, 499–504 (2017).
Koza, J. R. P. R, Genetic programming. In Search Methodology 127–164 (Springer US), 1994).
Oltean, M. & Groşan, C. Evolving evolutionary algorithms using multi expression programming. in European conference on artificial life(Springer, 2003).
Gandomi, A. H. et al. New design equations for elastic modulus of concrete using multi expression programming. J. Civil Eng. Manage. 21(6), 761–774 (2015).
Sharifi, S., Abrishami, S. & Gandomi, A. H. Consolidation assessment using multi expression programming. Appl. Soft Comput. 86, 105842 (2020).
Mohammadi Bayazidi, A. et al. Multigene genetic programming for Estimation of elastic modulus of concrete. Math. Probl. Eng. 2014(1), 474289 (2014).
Khan, M. et al. Forecasting the strength of graphene nanoparticles-reinforced cementitious composites using ensemble learning algorithms. Results Eng. 21, 101837 (2024).
Mohammadzadeh, S. Deriving an intelligent model for soil compression index utilizing multi-gene genetic programming. Environ. Earth Sci. 75, 1–11 (2016).
Breiman, L. Random forests. Mach. Learn. 45, 5–32 (2001).
Alyami, M. et al. Application of metaheuristic optimization algorithms in predicting the compressive strength of 3D-printed fiber-reinforced concrete. Developments Built Environ. 17, 100307 (2024).
Svetnik, V. et al. Random forest: a classification and regression tool for compound classification and QSAR modeling. J. Chem. Inf. Comput. Sci. 43(6), 1947–1958 (2003).
Patel, J. et al. Predicting stock market index using fusion of machine learning techniques. Expert Syst. Appl. 42(4), 2162–2172 (2015).
Jiang, H. et al. Joint analysis of two microarray gene-expression data sets to select lung adenocarcinoma marker genes. BMC Bioinform. 5, 1–12 (2004).
Prasad, A. M., Iverson, L. R. & Liaw, A. Newer classification and regression tree techniques: bagging and random forests for ecological prediction. Ecosystems 9, 181–199 (2006).
Cutler, A., Cutler, D. R. & Stevens, J. R. Random forests. Ensemble machine learning: Methods and applications 157–175 (2012).
Nazar, S. et al. Formulation of Estimation models for the compressive strength of concrete mixed with Nanosilica and carbon nanotubes. Developments Built Environ. 13, 100113 (2023).
Yang, D. et al. Compressive strength prediction of concrete blended with carbon nanotubes using gene expression programming and random forest: hyper-tuning and optimization. J. Mater. Res. Technol. 24, 7198–7218 (2023).
Nafees, A. et al. Forecasting the mechanical properties of plastic concrete employing experimental data using machine learning algorithms: DT, MLPNN, SVM, and RF. Polymers 14(8), 1583 (2022).
Iftikhar, B. et al. Predictive modeling of compressive strength of sustainable rice husk Ash concrete: ensemble learner optimization and comparison. J. Clean. Prod. 348, 131285 (2022).
Shah, M. I. et al. Machine learning modeling integrating experimental analysis for predicting the properties of sugarcane Bagasse Ash concrete. Constr. Build. Mater. 314, 125634 (2022).
Chen, T. T., Wang, W. C. & Wang, H. Y. Mechanical properties and ultrasonic velocity of lightweight aggregate concrete containing mineral powder materials. Constr. Build. Mater. 258, 119550 (2020).
Evangelista, A. C. J. & Tam, V. W. Properties of high-strength lightweight concrete using manufactured aggregate. Proc. Inst. Civ. Eng. 173(4), 157–169 (2020).
Hong, X. et al. Effect of graphene oxide on the mechanical properties and durability of High-Strength lightweight concrete containing shale ceramsite. Materials 16(7), 2756 (2023).
Saradar, A. et al. Prediction of mechanical properties of lightweight basalt fiber reinforced concrete containing silica fume and fly ash: experimental and numerical assessment. J. Building Eng. 32, 101732 (2020).
Wu, T. et al. Mechanical properties and microstructure of lightweight aggregate concrete with and without fibers. Constr. Build. Mater. 199, 526–539 (2019).
Yang, J. et al. Effects of aggregate types and softening effect on multiaxial mechanical properties of lightweight aggregate concrete. Constr. Build. Mater. 353, 128832 (2022).
Zeng, Y. et al. Mechanical properties of chopped basalt fiber-reinforced lightweight aggregate concrete and chopped polyacrylonitrile fiber reinforced lightweight aggregate concrete. Materials 13(7), 1715 (2020).
Zhao, H. et al. Effects of porous shale waste brick lightweight aggregate on mechanical properties and autogenous deformation of early-age concrete. Constr. Build. Mater. 261, 120450 (2020).
Floyd, R. W., Hale, W. M. & Bymaster, J. C. Effect of aggregate and cementitious material on properties of lightweight self-consolidating concrete for prestressed members. Constr. Build. Mater. 85, 91–99 (2015).
Hassan, A. A., Ismail, M. K. & Mayo, J. Mechanical properties of self-consolidating concrete containing lightweight recycled aggregate in different mixture compositions. J. Building Eng. 4, 113–126 (2015).
Abid, M. et al. Mechanical properties, microstructure and GEP-based modeling of basalt fiber reinforced lightweight high-strength concrete containing SCMs. J. Building Eng. 96, 110378 (2024).
Abadel, A. A. Physical, mechanical, and microstructure characteristics of ultra-high-performance concrete containing lightweight aggregates. Materials 16(13), 4883 (2023).
Chindaprasirt, P. et al. Mechanical and thermal properties of recycling lightweight pervious concrete. Arab. J. Sci. Eng. 40, 443–450 (2015).
Iqbal, S. et al. Mechanical properties of steel fiber reinforced high strength lightweight self-compacting concrete (SHLSCC). Constr. Build. Mater. 98, 325–333 (2015).
Iqbal, S. et al. Effect of fly Ash on properties of self-compacting high strength lightweight concrete. Periodica Polytech. Civil Eng. 61(1), 81–87 (2017).
Katkhuda, H. & Hanayneh, B. Influence of Silica Fume on High Strength Lighweight Concrete.(2009).
Kayali, O., Haque, M. & Zhu, B. Some characteristics of high strength fiber reinforced lightweight aggregate concrete. Cem. Concr. Compos. 25(2), 207–213 (2003).
Liu, X. et al. Properties of self-compacting lightweight concrete reinforced with steel and polypropylene fibers. Constr. Build. Mater. 226, 388–398 (2019).
Sajedi, F. & Shafigh, P. High-strength lightweight concrete using leca, silica fume, and limestone. Arab. J. Sci. Eng. 37, 1885–1893 (2012).
Tanyildizi, H. & Çevik, A. Modeling mechanical performance of lightweight concrete containing silica fume exposed to high temperature using genetic programming. Constr. Build. Mater. 24(12), 2612–2618 (2010).
Babu, K. G. & Babu, D. S. Behaviour of lightweight expanded polystyrene concrete containing silica fume. Cem. Concr. Res. 33(5), 755–762 (2003).
Li, Z. et al. Polymorphisms of CHRNA5-CHRNA3-CHRNB4 gene cluster and NSCLC risk in Chinese population. Translational Oncol. 5(6), 448–452 (2012).
Khan, M. A. et al. Simulation of depth of wear of eco-friendly concrete using machine learning based computational approaches. Materials 15(1), 58 (2021).
Khan, S. et al. Predicting the ultimate axial capacity of uniaxially loaded Cfst columns using multiphysics artificial intelligence. Materials 15(1), 39 (2021).
Jadhav, S., Tande, S. & Dubal, A. Beneficial reuse of waste foundry sand in concrete. Int. J. Sci. Res. Publ. 7(3), 74–95 (2017).
Pathariya Saraswati, C. et al. Application of waste foundry sand for evolution of low-cost concrete. Int. J. Eng. Trends Technol. 4(10), 4281–4286 (2013).
Khatib, J. et al. Foundry sand utilisation in concrete production. in Second International conference on sustainable construction materials and technologies(Università Politecnica delle Marche Home Ancona, 2010).
de Barros Martins, M. A. et al. Study on waste foundry exhaust sand, WFES, as a partial substitute of fine aggregates in conventional concrete. Sustainable Cities Soc. 45, 187–196 (2019).
Nithya, M. et al. Properties of concrete containing waste foundry sand for partial replacement of fine aggregate in concrete. Indian J. Eng. Mater. Sci. 24(2), 162–166 (2017).
Torres, A., Bartlett, L. & Pilgrim, C. Effect of foundry waste on the mechanical properties of Portland cement concrete. Constr. Build. Mater. 135, 674–681 (2017).
Gurumoorthy, N. & Arunachalam, K. Durability studies on concrete containing treated used foundry sand. Constr. Build. Mater. 201, 651–661 (2019).
Sowmya, M. & Kumar, J. C. Mixing of waste foundry sand in concrete. Int. J. Eng. Res. Sci. Technol. 4(4), 322–335 (2015).
Prabhu, G. G., Hyun, J. H. & Kim, Y. Y. Effects of foundry sand as a fine aggregate in concrete production. Constr. Build. Mater. 70, 514–521 (2014).
Siddique, R., De Schutter, G. & Noumowe, A. Effect of used-foundry sand on the mechanical properties of concrete. Constr. Build. Mater. 23(2), 976–980 (2009).
Naik, T. R. et al. Utilization of used foundry sand in concrete. J. Mater. Civ. Eng. 6(2), 254–263 (1994).
Monosi, S. et al. Effect of two different sources and washing treatment on the properties of UFS by-products for mortar and concrete production. Constr. Build. Mater. 44, 260–266 (2013).
Singh, G. & Siddique, R. G. Strength and durability studies of concrete containing waste foundry sand(2013).
Gurumoorthy, N. & Arunachalam, K. Micro and mechanical behaviour of treated used foundry sand concrete. Constr. Build. Mater. 123, 184–190 (2016).
Kumar, A. & Rani, D. Performance of concrete using paper sludge Ash and foundry sand. Int. J. Innov. Res. Sci. Eng. Technol. 5(9), 171–176 (2016).
Salokhe, E. & Desai, D. Application of foundry waste sand in manufacture of concrete. IOSR J. Mech. Civ. Eng. 1684–2278 (2014).
Siddique, R. & Kadri, E. H. Effect of Metakaolin and foundry sand on the near surface characteristics of concrete. Constr. Build. Mater. 25(8), 3257–3266 (2011).
Khatib, J., Herki, B. & Kenai, S. Capillarity of concrete incorporating waste foundry sand. Constr. Build. Mater. 47, 867–871 (2013).
Kaur, G., Siddique, R. & Rajor, A. Micro-structural and metal leachate analysis of concrete made with fungal treated waste foundry sand. Constr. Build. Mater. 38, 94–100 (2013).
Smith, G. N. Probability and Statistics in Civil Engineering 244 (Collins professional and technical books, 1986).
Pyo, J. et al. Estimation of heavy metals using deep neural network with visible and infrared spectroscopy of soil. Sci. Total Environ. 741, 140162 (2020).
Azim, I. et al. Prediction model for compressive arch action capacity of RC frame structures under column removal scenario using gene expression programming. in Structures(Elsevier, 2020).
Pengcheng, L. et al. Prediction of compressive strength of high-performance concrete by random forest algorithm. in IOP conference series: earth and environmental science(IOP Publishing, 2020).
Rajakarunakaran, S. A. et al. Prediction of strength and analysis in self-compacting concrete using machine learning based regression techniques. Adv. Eng. Softw. 173, 103267 (2022).
Roy, T. et al. Prediction of mechanical properties of eco-friendly concrete using machine learning algorithms and partial dependence plot analysis. Smart Constr. Sustainable Cities. 3(1), 2 (2025).
Mohamed, O. A., Ati, M. & Najm, O. F. Predicting compressive strength of sustainable self-consolidating concrete using random forest. Key Eng. Mater. 744, 141–145 (2017).
Kisi, O., Shiri, J. & Tombul, M. Modeling rainfall-runoff Process Using Soft Computing Techniques Vol. 51, 108–117 (Computers & Geosciences, 2013).
Nguyen, H. et al. Efficient machine learning models for prediction of concrete strengths. Constr. Build. Mater. 266, 120950 (2021).
Zhang, W. et al. State-of-the-art review of soft computing applications in underground excavations. Geosci. Front. 11(4), 1095–1106 (2020).
Alade, I. O., Abd Rahman, M. A. & Saleh, T. A. Modeling and Prediction of the Specific Heat Capacity of Al2 O3/water Nanofluids Using Hybrid Genetic algorithm/support Vector Regression Model Vol. 17, 103–111 (Nano-Structures & Nano-Objects, 2019).
Shahin, M. A. Use of evolutionary computing for modelling some complex problems in geotechnical engineering. Geomech. Geoeng. 10(2), 109–125 (2015).
Lundberg, S. A unified approach to interpreting model predictions. arXiv preprint arXiv:1705.07874 (2017).
Ribeiro, M. T., Singh, S. & Guestrin, C. Why should i trust you? Explaining the predictions of any classifier. in Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining (2016).
Bakouregui, A. S. et al. Explainable extreme gradient boosting tree-based prediction of load-carrying capacity of FRP-RC columns. Eng. Struct. 245, 112836 (2021).
Mangalathu, S., Hwang, S. H. & Jeon, J. S. Failure mode and effects analysis of RC members based on machine-learning-based SHapley additive explanations (SHAP) approach. Eng. Struct. 219, 110927 (2020).
Dantas, A. T. A., Leite, M. B. & de Nagahama, K. Prediction of compressive strength of concrete containing construction and demolition waste using artificial neural networks. Constr. Build. Mater. 38, 717–722 (2013).
Eskandari-Naddaf, H. & Kazemi, R. ANN prediction of cement mortar compressive strength, influence of cement strength class. Constr. Build. Mater. 138, 1–11 (2017).
González-Taboada, I. et al. Prediction of the mechanical properties of structural recycled concrete using multivariable regression and genetic programming. Constr. Build. Mater. 106, 480–499 (2016).
Lim, J. C., Karakus, M. & Ozbakkaloglu, T. Evaluation of ultimate conditions of FRP-confined concrete columns using genetic programming. Comput. Struct. 162, 28–37 (2016).
Sadowski, Ł. et al. Hybrid ultrasonic-neural prediction of the compressive strength of environmentally friendly concrete screeds with high volume of waste quartz mineral dust. J. Clean. Prod. 212, 727–740 (2019).
Sebaaly, H., Varma, S. & Maina, J. W. Optimizing asphalt mix design process using artificial neural network and genetic algorithm. Constr. Build. Mater. 168, 660–670 (2018).
Velay-Lizancos, M. et al. Analytical and genetic programming model of compressive strength of eco concretes by NDT according to curing temperature. Constr. Build. Mater. 144, 195–206 (2017).
Xie, Q., Ni, J. & Su, Z. A prediction model of ammonia emission from a fattening pig room based on the indoor concentration using adaptive neuro fuzzy inference system. J. Hazard. Mater. 325, 301–309 (2017).
Xu, Y. et al. Computation of high-performance concrete compressive strength using standalone and ensembled machine learning techniques. Materials 14(22), 7034 (2021).
Khan, M. A. et al. Machine learning predictions of high-strength RCA concrete utilizing chemically activated fly Ash and nano-silica. Sci. Rep. 15(1), 10255 (2025).
Zhang, X. et al. Prediction of compressive strength of recycled aggregate concrete using machine learning and bayesian optimization methods. Front. Earth Sci. 11, 1112105 (2023).
Gholampour, A., Gandomi, A. H. & Ozbakkaloglu, T. New formulations for mechanical properties of recycled aggregate concrete using gene expression programming. Constr. Build. Mater. 130, 122–145 (2017).
Frank, I. E. T. & Te, R. Data Analysis Handbook (1994).
Gandomi, A. H. & Roke, D. A. Assessment of artificial neural network and genetic programming as predictive tools. Adv. Eng. Softw. 88, 63–72 (2015).
Taylor, K. E. Summarizing multiple aspects of model performance in a single diagram. J. Geophys. Research: Atmos. 106(D7), 7183–7192 (2001).
Taylor, K. E. Summarizing multiple aspects of model performance in a single diagram. J. Geophys. Res. Atmos. 106(D7) (2001).
Ibrahim, S. M., Ansari, S. S. & Hasan, S. D. Towards white box modeling of compressive strength of sustainable ternary cement concrete using explainable artificial intelligence (XAI). Appl. Soft Comput. 149, 110997 (2023).
Kumar, R., Rai, B. & Samui, P. A comparative study of prediction of compressive strength of ultra-high performance concrete using soft computing technique. Struct. Concrete. 24(4), 5538–5555 (2023).
Gupta, S. & Sihag, P. Prediction of the compressive strength of concrete using various predictive modeling techniques. Neural Comput. Appl. 34(8), 6535–6545 (2022).
Kioumarsi, M. et al. Compressive strength of concrete containing furnace blast slag; optimized machine learning-based models. Clean. Eng. Technol. 13, 100604 (2023).
Sifan, M. et al. Efficient mix design method for lightweight high strength concrete: A machine learning approach. in Structures (Elsevier, 2023).
Kumar, A. et al. Compressive strength prediction of lightweight concrete: machine learning models. Sustainability 14(4), 2404 (2022).
Nanehkaran, Y. A. et al. Deep learning method for compressive strength prediction for lightweight concrete. Computers Concrete. 32(3), 327–337 (2023).
Alghrairi, N. S. et al. Machine learning-based compressive strength Estimation in nanomaterial-modified lightweight concrete. Open. Eng. 14(1), 20220604 (2024).
Hussain, F. et al. Machine learning-based predictive modeling of sustainable lightweight aggregate concrete. Sustainability 15(1), 641 (2022).
Sinkhonde, D. et al. Predicting the Compressive Strength of Rubberized Concrete Incorporating Brick Powder Based on MLP and RBF Neural Networks (Waste Management Bulletin, 2025).
Vadivel, T. S. et al. Experimental investigation and machine learning prediction of mechanical properties of rubberized concrete for sustainable construction. Sci. Rep. 14(1), 22725 (2024).
Adamu, M. et al. Prediction of mechanical properties of rubberized concrete incorporating fly Ash and nano silica by artificial neural network technique. Axioms 12(1), 81 (2023).
Zhang, J. et al. Prediction of rubber fiber concrete strength using extreme learning machine. Front. Mater. 7, 582635 (2021).
Prayogo, D. et al. Prediction of concrete properties using ensemble machine learning methods. in Journal of Physics: Conference Series (IOP Publishing, 2020).
Gregori, A., Castoro, C. & Venkiteela, G. Predicting the compressive strength of rubberized concrete using artificial intelligence methods. Sustainability 13(14), 7729 (2021).
Sobuz, M. H. R. et al. Optimization of recycled rubber self-compacting concrete: experimental findings and machine learning-based evaluation. Heliyon 10(6) (2024).
Roth, A. E. Introduction To the Shapley Value 1 (The Shapley value, 1988).
Acknowledgements
The authors extend their appreciation to the Deanship of Research and Graduate Studies at King Khalid University for funding this work through large group project under grant number (RGP2/33/46).
Funding
This research work was funded by King Khalid University, Saudi Arabia, through the grant number RGP2/33/46.
Author information
Authors and Affiliations
Contributions
Muhammad Sheraz: Conceptualization, Visualization, Methodology, Validation, Writing - original draft, Writing - review & editing, Project administration. Muhammad Abid: Supervision, Resources, Software, Data curation, Formal analysis. Muhammad Alam: Data curation, Software. Muhammad Faisal Javed: Writing - review & editing. Mohammad Alsharef and Ayed Eid Alluqmani: Visualization and formal analysis Mohammed Jameel and Furqan Ahmad: Funding, Data curation, Data curation.
Corresponding authors
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Below is the link to the electronic supplementary material.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.
About this article
Cite this article
Sheraz, M., Abid, M., Alam, M. et al. Modeling the mechanical properties of lightweight high-strength concrete incorporating supplementary cementitious materials using multi-expression programming and random forest. Sci Rep 16, 3678 (2026). https://doi.org/10.1038/s41598-025-30161-8
Received:
Accepted:
Published:
Version of record:
DOI: https://doi.org/10.1038/s41598-025-30161-8


































