Abstract
Roller compacted concrete (RCC) has gained prominence in the construction industry due to its durability, cost-effectiveness, and environmental benefits, particularly with the incorporation of high-volume fly ash (HVFA). However, traditional experimental approaches to evaluating RCC’s mechanical properties, such as compressive strength (CS) and splitting tensile strength (STS), are resource-intensive and time-consuming. To address these challenges, this study explores the application of artificial intelligence (AI), specifically artificial neural networks (ANN) and a hybrid ANN-Biogeography-Based Optimization (ANN-BBO) model, to predict the CS and STS of RCC. A dataset comprising 168 RCC mixtures, incorporating various material and process parameters, was analyzed. The ANN-BBO model demonstrated superior predictive accuracy compared to a standalone ANN, with R2 values exceeding 0.98 for both CS and STS, significantly reducing error margins. The findings highlight the effectiveness of AI-driven modeling in optimizing RCC mix designs, minimizing experimental costs, and enhancing the sustainability of concrete production. This research underscores the potential of integrating AI with optimization techniques to refine RCC performance assessment, which enables and facilitates more efficient and sustainable infrastructure development.
Similar content being viewed by others
Introduction
Roller compacted concrete (RCC) has emerged as a significant type of concrete in the construction industry due to its high durability and cost-effectiveness compared to conventional concrete1,2. RCC presents a multitude of benefits that significantly enhance both its production methodologies and the various applications to which it can be subjected. Among these numerous advantages are the notable elimination of the necessity for steel reinforcement, which not only simplifies the construction process but also leads to reduced material expenditures when compared to conventional concrete options, as well as an exceptional resistance to the extreme fluctuations in temperature associated with both scorching hot and frigid cold weather conditions, alongside an impressive capacity for rapid production. Furthermore, the overall production process associated with RCC is remarkably swifter and more efficient than that of conventional concrete, thereby offering a considerable competitive edge, particularly in the context of large-scale construction projects that demand timely and effective execution3,4. In summary, the unique characteristics and properties of RCC render it an increasingly appealing choice for engineers and contractors seeking innovative solutions in the field of modern construction practices.
The mechanical properties of RCC, particularly compressive strength and splitting tensile strength, are critical indicators of its performance in structural applications. These properties influence the design and safety of structures while also playing a crucial role in overall service life and maintenance costs5. In addition to the strength performance of concrete, the carbon footprint is recognized as a key parameter6,7. Particularly following the signing of the Paris Climate Agreement in 2015, issues such as the use of alternative materials to replace those that generate significant greenhouse gas emissions during production, as well as energy efficiency, have gained prominence8.
Environmental sustainability has emerged as a major global priority in today’s world. In this context, the construction industry plays a crucial role in developing innovative solutions aimed at reducing waste generation and enhancing the performance of materials used in infrastructure development. These advancements seek to minimize the carbon footprint of construction activities while also promoting the adoption of environmentally friendly practices that contribute to a more sustainable future.
In this regard, the use of pozzolans as a partial replacement for cement has been shown to reduce the carbon footprint of concrete. Fly ash, a byproduct of coal combustion, is recognized as a valuable supplementary material that enhances both the durability and strength of concrete while simultaneously mitigating its environmental impact. Incorporating fly ash into concrete mixtures not only improves the structural performance of concrete but also encourages the recycling of industrial waste. This approach aligns with circular economy principles and supports a more efficient resource management strategy9,10,11,12. However, in concretes with high fly ash content, potential effects on workability and setting times must be carefully considered. This necessitates appropriate adjustments in mix design to achieve optimal performance without compromising quality13,14.
A laboratory-based approach can provide definitive results when evaluating the compressive strength (CS) or splitting tensile strength (STS) of high-volume fly ash roller compacted concrete (HVFA-RCC)15,16. However, this approach may present various challenges. These challenges include time and natural resource consumption, the need for retesting due to varying material properties, the limited number of prepared and tested mixtures, and the necessity for extensive data analysis to derive meaningful conclusions17,18. For these reasons, manually conducting trials optimize RCC mixtures for specific properties (e.g., strength, durability) is labor-intensive and lacks the responsiveness of data-driven methods like AI-based models.
To overcome these challenges, innovative approaches that enhance testing processes, improve material efficiency, and promote sustainability in the construction industry are required. Advanced modeling and simulations enable predictive analyses that refine mix design, reduce the need for physical testing, and optimize performance. Machine learning and deep learning techniques are rapidly emerging as powerful tools capable of analyzing complex datasets, identifying patterns, and predicting the behavior of various material combinations under diverse conditions, thereby optimizing the development process. These technologies not only improve prediction accuracy but also support a more sustainable approach by minimizing waste and resource consumption throughout the construction lifecycle. In these methodologies, various algorithms are employed to evaluate the impact of different variables on construction outcomes, ultimately enhancing decision-making and increasing efficiency in project management. By leveraging the potential of machine learning, construction professionals can significantly reduce project timelines and costs while ensuring superior quality standards in materials and designs19.
Machine learning and deep learning, particularly artificial neural networks (ANN), play a crucial role in predicting concrete’s mechanical properties with high accuracy. Studies show that ANN can achieve correlation coefficients over 0.95 when applied to datasets with input parameters related to concrete composition and curing conditions18. Artificial intelligence (AI) techniques not only speed up the design process but also enable engineers to explore a wider range of material combinations, overcoming the limits of traditional methods. This shift supports sustainability in construction by optimizing resource use and minimizing waste20. As AI tools advance, they improve efficiency, reduce material waste, and lead to more customized concrete formulations for specific project needs21.
A review of the literature reveals a growing volume of research on modelling RCC. These studies emphasize the importance of understanding the mechanical properties of RCC through advanced modeling techniques. Table 1 lists AI modeling studies conducted on roller-compacted concrete and Fig. 1 presents commonly used AI techniues in civil engineering researches.
To date, numerous AI-based prediction methods have been developed, and these methods continue to evolve in terms of accuracy, efficiency, and adaptability to various applications. Among these methods, artificial neural network (ANN) stand out as a powerful tool for modeling and predicting complex relationships within data. ANN have been widely used by researchers, particularly in assessing the mechanical properties of concrete. The ability of ANN to learn from data and improve prediction accuracy over time makes them a valuable method in civil engineering applications32,33,34. In this context, ANNs are employed to predict the properties of eco-friendly concretes. Moreover, they enable engineers to minimize the environmental impact of concrete while allowing for the development of optimal formulations that maintain structural integrity and performance parameters35,36,37.
The limitations of existing AI techniques are often linked to their dependence on large datasets for training, which can be difficult to obtain in specialized areas of civil engineering, potentially leading to inaccuracies and biases in predictions. Furthermore, the scarcity of data in rare engineering scenarios, along with concerns over data privacy and security, adds complexity to their application. Many AI tools, particularly deep learning neural networks, operate as “black boxes” with limited interpretability, which presents challenges in engineering analysis and design, where transparency in decision-making is critical. Ongoing research is focused on improving model interpretability to provide clearer insights into their reasoning. The effective application of AI in engineering necessitates expertise in both AI and civil engineering, as the development and validation of AI tools require specialized knowledge. Engineers must also be adequately trained to apply these tools appropriately and interpret the results with precision38,39.
To overcome these challenges, researchers are exploring innovative approaches to enhance data collection and model training processes, such as the use of synthetic data generation and transfer learning techniques. Additionally, binary AI models are being employed to streamline decision-making by simplifying complex data interpretations, allowing engineers to focus on the critical elements of their projects without being overwhelmed by intricate algorithms. These advancements aim to strengthen the robustness of AI models, enabling them to perform more reliably in scenarios where real-world data is limited or challenging to obtain. As a result, the integration of these techniques not only improves model performance but also expands the applicability of AI solutions across a wide range of civil engineering projects38,40.
In RCC modeling studies, most of the proposed models are independent of each other, and hybrid models integrated with meta-heuristic optimization techniques have not yet been fully evaluated. Moreover, a review of RCC modeling studies reveals that the number of studies focusing on splitting tensile strength, one of the most critical parameters of RCC, is relatively limited27,41. These gaps present a significant opportunity for new research that, through the integration of advanced optimization methods, can enhance the predictive accuracy and reliability of ANN models in RCC applications. Additionally, the integration of ANN with Biogeography-Based Optimization (BBO) emerges as an innovative approach to improving prediction accuracy by optimizing the parameters of the neural network model42,43. Thus, the lack of ANN-BBO artificial intelligence model, which has not been applied in RCC studies before, will be overcome and the analysis of the splitting tensile strength will contribute to the limited number of splitting tensile strength modeling in the literature.
This study aims to develop models for predicting the compressive strength and splitting tensile strength of RCC using the frameworks of ANN model and ANN-BBO, a model that has not been used in RCC studies before. Data obtained from a study on the formation of interlayer cold joint in RCC having high volume fly ash, aiming aims to assess mechanical, permeability, and freeze–thaw properties to prevent cold joint44, and were modeled in this research. Additionally, this study seeks to evaluate the performance of the developed models and examine their implications for future engineering applications. The findings are expected to contribute to the existing body of knowledge and provide valuable insights for future research on optimizing concrete properties. Ultimately, the results are anticipated to highlight the effectiveness of combining AI with optimization techniques, leading to more efficient and sustainable applications in civil engineering.
Data definition
In this study, a dataset comprising 168 different RCC mixtures was analyzed to predict the compressive strength (CS) and splitting tensile strength (STS) of RCC44. The dataset was collected from an extensive experimental study focused on preventing cold joint formation in high-volume fly ash roller compacted concrete (HVFA-RCC). The data acquisition process involved laboratory testing of RCC specimens with varying material compositions and curing conditions. The data collection process was conducted in multiple stages to ensure accuracy and reliability. The following steps outline the methodology used:
Material selection and preparation
The materials incorporated in RCC mixtures comprised cement, fly ash, fine aggregates (0–5 mm), coarse aggregates (5–15 mm and 15–25 mm), water, set retardant admixture, adherence-enhancing additives, and interlayer mortar application, with their selection being driven by factors such as their widespread availability and prevalent utilization in typical RCC applications.
Mix design and proportioning
A total of 168 distinct RCC mix designs were formulated, each incorporating varying proportions of cementitious materials, aggregates, and additives. These mix designs were developed to assess the influence of material variations on compressive strength (CS) and split tensile strength (STS), with the goal of optimizing sustainability through elevated levels of fly ash replacement.
Specimen casting and curing
The concrete mixtures were prepared in accordance with standardized procedures to maintain consistency. To replicate field conditions, the fresh RCC mixtures were compacted using a vibrating hammer. Subsequently, the specimens were cured under controlled temperature and humidity conditions, simulating real-world environmental exposure.
Strength testing
After the curing period, the mechanical properties of the RCC were evaluated. Compressive strength (CS) was measured using cubic specimens subjected to compression following EN 12390-3 standards, while splitting tensile strength (STS) was evaluated using EN 12390-6 procedures. These tests yielded essential data for modeling the strength characteristics of RCC.
Data recording and preprocessing
The results from mechanical testing were meticulously documented, with outliers being identified and excluded to ensure the reliability of the data. Key parameters, including cement (X1), fly ash (X2), aggregate proportions (X3–X5), water content (X6), maximum dry unit weight (X7), Vebe time (X8), admixture dosages (X9–X10), interlayer mortar application (X11), and waiting time between layers (X12), were considered as input variables. Compressive strength (CS) and splitting tensile strength (STS) values were designated as the dependent output variables.
Normalization and statistical analysis
To enable effective AI modeling, all input parameters were normalized using the Z-score method, ensuring that discrepancies in scale or units did not introduce bias into the model’s performance. Descriptive statistics, including range, mean, standard deviation, skewness, and kurtosis, were calculated to evaluate the data distribution, as detailed in Fig. 2 and Table 2.
The comprehensive and systematic approach to data collection ensured the robustness and precision of the dataset, which establishes a reliable foundation for predictive modeling through artificial intelligence techniques. Moreover, the integration of machine learning methodologies in analyzing RCC properties facilitates a more efficient exploration of the relationships between material characteristics and mechanical performance.
Figure 3 shows the correlation matrix for 12 inputs to highlight the distribution of pairwise correlation coefficients. In this regard, a linear correlation between inputs (X1 − X12) and outputs (CS (Y1) and STS (Y2)) has been performed. The coefficient values range from − 1 to 1, with a value of 1 indicating a strong positive relationship and a value of − 1 indicating a strong negative relationship. The highest correlation coefficient between the parameters and both CS and STS outputs is for the cement parameter (X1), which is less than 0.9, and the reason for this is the undeniable role of cement as the main parameter in concrete materials. For both outputs, the absolute values of the correlation coefficients are not close to 1, indicating that multicollinearity is not a concern in this analysis.
Overview of AI techniques used
Artificial neural network (ANN)
ANNs are computational models inspired by the way signals flow through the structure of a nerve cell (neuron) in the human brain. Indeed, they simulate the interaction between incoming signals and resulting output. Typically, an ANN structure includes three main components: (i) an input layer that receives input signals and data from external sources, (ii) one or more hidden layers that are located between the input and output layers and process the information internally, and (iii) an output layer that provides the results of the model processing. The number of neurons in these layers can differ and is influenced by various factors, with the neurons in each layer remaining unconnected to one another45. The input and output variables determine the number of input and output neurons required for the model. The number of hidden layer neurons can vary, and it is very important to consider their optimal number to achieve accurate results46,47. In summary, a schematic representation of the model structure and its formulation can be found in Fig. 4 and Eqs. (1) and (2), respectively.
where neti: Net input signal, Xi (i = 1,2,…,n): Set of input parameters, wij: Connection weights, bj: Bias term, hj: Neuron’s output, f: Activation function.
The adaptability of the ANN enables it to adjust its structure during the learning process based on the data it encounters. By doing so, it identifies and models intricate connections between input and output variables, enabling it to solve complex problems48.
Biogeography-based optimization (BBO)
BBO draws inspiration from the concept of biogeography, which describes how species are distributed across ecosystems, as well as their relocation and extinction49. In biogeography, islands are small ecosystems consisting of populations of diverse species that are isolated from other habitats. Islands capable of supporting a diverse array of species are classified as having a habitat suitability index (HSI). The HSI of an island is influenced by factors such as climate, soil fertility, and vegetation. These environmental factors are referred to as suitability index variables (SIVs) that contribute to determining the island’s HSI. Habitat areas with high HSI (representing favorable conditions, i.e., a good solution) tend to host larger populations, whereas habitats with low HSI (indicating poor conditions, i.e., an inadequate solution) can experience higher migration rates and reduced inhabitants. Thus, good solutions are more inclined to share SIV with those having a low HSI and vice versa. The two main operators in the optimization process are migration and mutation (Eqs. 3–5). The migration operation enables the discovery of new areas of the search space by exchanging solutions between habitats using emigration and immigration rates. The mutation operation plays a crucial role because an abrupt change in the species impacts the solution coefficients. This avoids the problem of getting trapped in a local optimum50. Figure 5 illustrates a schematic representation of the biogeographically inspired BBO model.
where h: No. of current habitants, hmax: Maximum of No. of habitants that the habitat can support, Ph: Mutation probability of the hth habitat, Pmax: argmax(Ph).
Research methodology
This section seeks to provide a thorough description of the research process undertaken to propose an AI framework for modeling the CS and STS properties of RCC. To achieve this, two AI model frameworks, single ANN and ANN integrated with BBO, are proposed. Comparing these models offers valuable insights into the effectiveness of the hybrid approach versus the single model. The BBO metaheuristic optimization technique is applied to identify optimal variables, ensuring that the prediction model enhances accuracy while minimizing errors. At a glance, Fig. 6 illustrates the workflow in this study.
Data and model preparation
Data preprocessing is considered one of the initial and essential steps in modeling because it significantly increases data quality by removing data biases caused by scale or unit differences among different parameters, thereby improving the effectiveness of model development. In this regard, we normalized all parameter data with the Z-score approach according to Eq. (6).
where X: Measured value of each parameter, μ: Mean value of the data for each parameter, σ: Standard deviation of the data for each parameter.
When assessing AI models, a key question is whether the proposed model is the best one within its hypothesis space, particularly in terms of generalization to new, unseen data. The answer to this question is closely tied to data partitioning, which is a critical step in the modeling process. In order for our model to cover this generalizability, we use the three-way hold-out method to partition the data. For this purpose, in both the CS and STS categories, all data (168 data records) are randomly split into three sets: train, validate, and test, with proportions of 70% (118), 15% (25), and 15% (25), respectively. Each of these sets plays a specific role: Training the model and learning the key patterns in the data by training set, preventing the overfitting by validation set, and evaluating the generalizability performance by testing set. Given the superior learning performance of the Levenberg–Marquardt algorithm reported in the studies51,52, this algorithm is considered the learning algorithm. Furthermore, the hyperbolic tangent function, as described in the most cited book by Haykin53, is utilized as the transfer function. To determine the number of hidden layers, a highly cited study54 pointed out that it is not always necessary to have two or more layers for real-valued functions. For continuous functions, one hidden layer is sufficient. This is ensured by the universal approximation theorem55. Any continuous function can be approximated to arbitrary accuracy by a network with a single hidden layer, for sufficiently many neurons in the hidden layer. Further, according to the type of activation function used in this study (i.e. hyperbolic tangent), the performance of networks with many hidden layers can be sensitive to the initialisation of the weights. In addition, vanishing- or exploding-gradient problem aggravated the problem when the network has multiple hidden layers54. Based on these considerations, this study employs an architecture with only one hidden layer. With 12 input parameters (X1–X12) and one output parameter (either Y1 or Y2), the model follows a 12-Hiddenneuron − 1 arrangement. A thorough analysis is necessary to define the final model architecture (i.e., Hiddenneuron), which will be discussed in the initial part of the results section.
Model performance evaluation
This subsection aims to present the statistical measures employed to examine the performance of the developed models. A detailed list of these measures is provided in Table 3. Calculating and comparing these measures helps to identify an effective model.
Results and discussion
Finalizing the model’s architecture
The goal of this subsection is to achieve the final architecture of the model, i.e., the appropriate number of Hiddenneuron. To accomplish this, the models are evaluated with different arrangements, ranging from 10 to 20 neurons. For consistency in comparison, all models are implemented using the same dataset. The statistical measures (R2 and RMSE) are calculated for each model and ranked based on the quality of the responses. Finally, the model with the highest total assigned ranks is considered the best model. Table 4 lists the results of this assessment. Based on these results, the model with 14 Hiddenneuron has the highest ranking score, representing the superior performance of the model compared to other neuron architectures. Accordingly, the 12-14-1 architecture is selected for the model.
Table 5 summarizes the settings of selected parameters to present the information more effectively.
This analysis emphasizes the critical role of optimal parameter selection in maximizing the predictive performance of statistical models in civil engineering while simultaneously underscoring the necessity of rigorous validation procedures to guarantee their reliability in informing decision-making and influencing project outcomes in practical applications, as effective parameter tuning not only refines model precision but also enhances overall efficiency and sustainability in engineering projects, ultimately fostering superior resource management, minimizing material wastage, and driving cost-effective solutions.
Regression plots
The correlation between the measured and predicted values of CS and STS in three datasets—train, validation, and test—is depicted in Figs. 7 and 8, respectively. As can be seen from Fig. 7, the ANN model can estimate the recorded CS data in the train, validate, and test datasets with R2 values of 0.9424, 0.9563, and 0.9463, while the corresponding values for the ANN-BBO model are 0.9864, 0.9925, and 0.9969, respectively. The ANN-BBO model exhibits a stronger correlation, meaning its data points align more closely with the best-fitting regression line (i.e., y = x) compared to those of the single ANN model. Additionally, in modeling the STS (Fig. 8), the hybrid model demonstrated its ability to accurately match most of the predicted responses with the actual values. This is evidenced by the concentration of data points in the results provided by the hybrid model (ANN-BBO) within the area between the black dotted lines in the plots, which indicate a ± 10% deviation from the y = x line. Accordingly, it can be concluded from the comparison of the measured vs. predicted plots that the ANN-BBO model demonstrates outstanding performance in estimating the CS and STS of RCC.
This performance underscores the transformative potential of hybrid models in enhancing predictive accuracy and reliability within civil engineering applications, fostering more efficient design methodologies and refined structural assessments. At the same time, these models streamline the design phase, reduce costs, and optimize resource allocation in engineering projects, highlighting the importance of integrating cutting-edge computational techniques with conventional engineering practices to drive safer and more sustainable infrastructure development. The growing adoption of hybrid models in civil engineering can lead to a paradigm shift, encouraging continued research and innovation aimed at optimizing materials and construction methodologies for future advancements.
Error histogram plots
The histogram plots in Figs. 9 and 10 illustrate the performance error for each dataset, along with its occurrence percentage, in modeling CS and STS, respectively. This would allow for a more detailed examination of the distribution and range of errors in the performance of the proposed models. An evaluation of the CS prediction error histogram for the ANN-BBO model (Fig. 9b) reveals that over 90% of the prediction errors—specifically, 159 out of 168 data records (approximately 94%)—lie within the range of [− 2 MPa, 2 MPa], whereas the errors for the ANN model (Fig. 9a) span a broader range of [− 4 MPa, 4 MPa]. A similar trend is observed in the STS results (Fig. 10), where the ANN-BBO model estimated over 90% of the data within the interval [− 0.2 MPa, 0.2 MPa], demonstrating substantially greater accuracy than the ANN model with a broader range of [− 0.4 MPa, 0.4 MPa]. This indicates that the ANN-BBO model is capable of providing more accurate predictions with a narrower range of errors compared to the ANN model. The results inferred from the error histograms are consistent with those presented in the previous subsection, confirming the more reliable performance of the ANN-BBO model in predicting the strength properties of RCC compared to the single ANN model.
This enhanced predictive capability arises from the hybrid approach that combines ANNs with a BBO algorithm, which effectively fine-tunes model parameters, thereby improving overall accuracy. In the context of civil engineering applications, this advancement holds considerable potential for optimizing design processes and ensuring the structural integrity of reinforced concrete components under diverse load conditions. By leveraging this hybrid model, engineers can achieve more reliable and precise predictions, ultimately enhancing the safety and efficiency of infrastructure projects.
Statistical results
To further assess the performance of the proposed models, a thorough analysis was carried out using various statistical measures, and the results are listed in Table 6. By scrutinizing the performance metrics across all datasets, it is evident that the hybrid ANN-BBO model has significantly improved in reducing errors and, as a result, increasing accuracy compared to the single ANN model. More specifically, the error-related metrics—MAE, RMSE, PI, and OBJ—demonstrate notable reductions in error by 20%, 56%, 57%, and 57%, respectively, when comparing the hybrid model’s (ANN-BBO) performance to the single model (ANN) on all compressive strength data. Likewise, a comparable trend is experienced in the errors of the predicted STS data. Besides, statistical metrics such as NSE, VAF, and A10-index indicate a strong goodness-of-fit and high efficiency of the ANN-BBO model. For example, regarding the A10-index, the ANN-BBO model achieves A10-index values of 0.988 for CS and 1 for STS across all datasets, while the corresponding values in the single ANN model are 0.929 and 0.940, respectively. Consequently, the statistical results indicate that the performance of the ANN-BBO model could surpass what the single ANN model offers, highlighting the performance improvement of the proposed hybrid model.
Statistical results reveal that the model exhibits a high degree of accuracy across various metrics, demonstrating its effectiveness in predicting compressive and splitting-tensile strengths with minimal error. These findings highlight the robustness of the ANN when combined with the biogeography-based optimization (BBO) algorithm, positioning this hybrid approach as a reliable tool for structural strength prediction in engineering applications. The model’s superior performance underscores its potential to enhance the precision and efficiency of material strength assessments in civil engineering.
Role of input parameters in modeling
To assess the contribution of each input parameter in modeling the CS and STS of RCC, a relevancy factor (RF) analysis is conducted. The absolute RF value for each parameter reflects its extent of importance in determining the strength properties of the RCC, while its sign indicates whether its effect is positive or negative on the model’s outcome. The RF equation is defined as follows:
where Xi: ith input parameter (i = 1, …, 12, based on the number of input parameters defined), Xi,N: Value of Xi for the Nth data point, \(\overline{{X }_{i}}\): Average of the values of the input parameter with index i, YN,p: Predicted value of the Nth data point, \(\overline{{Y }_{p}}\): Average of the values predicted.
Figure 11 displays the results of this analysis for both CS and STS using the outputs obtained from the best-proposed model (ANN-BBO). Both outputs are regarded as strength properties, so the influence of the parameters follows the same trend, though there are slight differences in the values. As shown in Fig. 11, parameters X1 (cement) and X6 (water) have the most significant positive effect on the strength prediction results, which is logical given the well-known prominent role of these two components on the strength properties of RCCs56. On the other hand, parameter X9 (set retardant admixture) is identified as the parameter with the least impact on the strength properties. Among them, for CS performance, the parameters related to aggregate size (i.e., X3–X5) exhibit a strong inverse relationship, while for STS performance, the most negative effect is recorded by parameter X12 (waiting time after produce a layer). To further evaluate the importance of input parameters, SHapley Additive exPlanations (SHAP), a game theory-based method, is employed to interpret the contribution of each input parameter to the model’s output57. Figure 12 illustrates the significance of each feature by showing its impact on prediction accuracy. The color gradient represents the value of each feature, ranging from low (red) to high (blue). The horizontal axis of the plot indicates each feature’s effect on the model predictions, showing whether the influence is positive or negative. A positive impact implies that the feature increases prediction accuracy for the given sample, while a negative impact suggests a decrease in accuracy. A comparison of the RF and SHAP analyses clearly shows that both yield the same results. Similarly, in SHAP analysis for both CS and STS, parameter X1 (cement) is identified as the most important, and the repeated presence of data in the blue region indicates its positive effect. In contrast, parameter X12 shows a negative impact on both outputs. Consequently, both analyses show similar results for the contribution of each input parameter in modeling the CS and STS of RCC. These analyses are valuable as they elucidate how each input parameter influences the model’s output, offering a deeper understanding of the parameters’ roles and their impact on the model’s predictions.
Future works
Future research should prioritize refining the identified parameters to enhance both CS and STS while exploring alternative materials or innovative mixing methodologies that may yield superior performance. Additionally, investigating the complex interactions among these parameters could reveal synergistic effects that not only improve material properties but also optimize production processes. Furthermore, integrating AI techniques to analyze these interactions could provide valuable predictive insights, enabling researchers to fine-tune material compositions with greater precision for specific applications.
Implementing real-time strength prediction with ANN and IoT sensors can help monitor compressive and splitting tensile strength, preventing substandard batches. Sensor-based mix adjustment using moisture sensors and PLC feedback algorithms may help adjust water content for consistent workability. An AI-based mix design optimizer, leveraging ANN and Biogeography-Based Optimization (BBO), could balance performance, cost, and sustainability. Batch scheduling assistants, based on decision rules and environmental data, are recommended to optimize production timing and reduce delays. Finally, adopting a digital twin system with IoT, BIM, and AI simulations can offer real-time monitoring and holistic optimization of the concrete production process.
Conclusions
The compressive strength and splitting tensile strength of roller compacted concrete were statistically modeled using data obtained from an experimental study aimed at preventing cold joint formation.
In conclusion, the ANN model predicted CS values with R2 values of 0.9424, 0.9563, and 0.9463 for the training, validation, and test datasets, respectively, while the ANN-BBO model demonstrated superior performance with R2 values of 0.9864, 0.9925, and 0.9969, indicating a stronger correlation and more precise alignment with the ideal regression line (y = x). The CS modeling results reveal that over 90% of the ANN-BBO model’s prediction errors fall within [− 2 MPa, 2 MPa], whereas the ANN model exhibits a wider error range of [− 4 MPa, 4 MPa]; a similar trend is observed for STS predictions, where the ANN-BBO model confines more than 90% of values within [− 0.2 MPa, 0.2 MPa], outperforming the ANN model, which shows greater deviations, thereby confirming the ANN-BBO model’s superior reliability in predicting RCC strength properties.
Performance metric analysis across all datasets further highlights the ANN-BBO model’s enhanced predictive accuracy and error reduction, with MAE, RMSE, PI, and OBJ showing reductions of 20%, 56%, 57%, and 57%, respectively, in compressive strength data, alongside a similar trend in STS predictions, while statistical indicators such as NSE, VAF, and A10-index further affirm the model’s strong fit and efficiency. Ultimately, the statistical findings validate that the ANN-BBO model significantly surpasses the standalone ANN model, emphasizing the substantial performance improvements achieved through the proposed hybrid approach, which not only enhances accuracy but also delivers more reliable predictions, solidifying its effectiveness in comparison to traditional methods.
Data availability
The dataset that supports the findings of this study is not publicly available due to contractual limitations. However, key summary data and analyses are presented within the article. Further details may be available from the corresponding author upon reasonable request.
Abbreviations
- AI:
-
Artificial intelligence
- ANFIS:
-
Adaptive neuro-fuzzy inference system
- ANN:
-
Artificial neural network
- ANOVA:
-
Analysis of variance
- BBO:
-
Biogeography-based optimization
- BGG:
-
Bagging algorithms
- BR:
-
Bayesian regularisation
- CHAID:
-
Chi-square automatic interaction detection
- CNN:
-
Convolutional neural networks
- CS:
-
Compressive strength
- DSS:
-
Decision support system
- ELM:
-
Extreme learning machine
- FL:
-
Fuzzy logic
- GA:
-
Genetic algorithm
- GB:
-
Gradient boosting
- GBM:
-
Gradient boosting machine
- GEP:
-
Gene expression programming
- GOA:
-
Grasshopper optimisation algorithm
- HIS:
-
Habitat suitability index
- HVFA:
-
High volume fly ash
- LM:
-
Levenberg–Marquardt
- M5p:
-
M5prime
- M5r:
-
M5rule
- MAE:
-
Mean absolute error
- MARS:
-
Multivariate adaptive regression splines
- ML:
-
Machine learning or multiple linear regression
- MRA:
-
Multiple regression analysis
- NSE:
-
Nash–Sutcliffe efficiency
- OBJ:
-
Objective function
- PI:
-
Performance index
- PSO:
-
Particle swarm optimization
- RAP:
-
Recycled asphalt pavement
- RCC:
-
Roller compacted concrete
- RF:
-
Random forest
- RMSE:
-
Root mean squared error
- SCG:
-
Scaled-conjugate gradient
- SHAP:
-
SHapley Additive exPlanations
- SHM:
-
Structural health monitoring
- SIVs:
-
Suitability index variables
- STS:
-
Splitting-tensile strength
- SVR:
-
Support vector machines
- VAF:
-
Variance accounted for
References
Ludwig, D., Nanni, A. & Shoenberger, J. E. Application of Roller-Compacted Concrete (RCC) Technology to Roadway Paving (US Army Engineer Waterways Experiment Station, 1994).
Liu, H. W., Tian, B., Hou, R. G. & Li, S. Research on roller compacted concrete graduation with vibration liquefaction. Adv. Mater. Res. 857, 166–172 (2014).
Shamsaei, M., Aghayan, I. & Kazemi, K. A. Experimental investigation of using cross-linked polyethylene waste as aggregate in roller compacted concrete pavement. J. Clean. Prod. 165, 290–297 (2017).
Bayqra, S. H., Mardani-Aghabaglou, A. & Ramyar, K. Physical and mechanical properties of high volume fly ash roller compacted concrete pavement (A laboratory and case study). Constr. Build. Mater. 314, 125664 (2022).
Zdiri, M., Ben Ouezdou, M. & Neji, J. Theoretical and experimental study of roller-compacted concrete strength. Mag. Concrete Res. 60(7), 469–474 (2008).
Nath, P., Sarker, P. K. & Biswas, W. K. Effect of fly ash on the service life, carbon footprint and embodied energy of high strength concrete in the marine environment. Energy Build. 158, 1694–1702 (2018).
Fernando, A., Siriwardana, C., Gunasekara, C., Shahzad, W., Sethunge, S., Zhang, K. & Rajapakse, D. Performance-based concrete for carbon footprint reduction in the construction industry: A comprehensive systematic review of current progress and future prospects. In International Conference on Engineering, Project, and Production Management. (Springer, 2023).
Lehne, J. & Preston, F. Making concrete change: Innovation in low-carbon cement and concrete. Chatham House Report. 13 (2018).
Mardani-Aghabaglou, A., Özen, S. & Altun, M. G. Durability performance and dimensional stability of polypropylene fiber reinforced concrete. J. Green Build. 13(2), 20–41 (2018).
Kilic, I. & Gok, S. Strength and durability of roller compacted concrete with different types and addition rates of polypropylene fibers. Rev. De. La Constr. 20, 205–214 (2021).
Shankar, S. S., Natarajan, M. & Arasu, A. Exploring the strength and durability characteristics of high-performance fibre reinforced concrete containing nanosilica. J. Balkan Tribol. Assoc. 30(1), 142 (2024).
Anbarasu, N. A., Sivakumar, V., Yuvaraj, S., Veeramani, V. & Velusamy, S. Pioneering the next frontier in construction with high-strength concrete infused by nano materials. Matéria (Rio de Janeiro). 30, e20240730 (2025).
Kolase, P. K. & Desai, A. K. Experimental study on monotonic and fatigue behaviour of polypropylene fibre-reinforced roller-compacted concrete with fly ash. Road Mater. Pavement Design 20(5), 1096–1113 (2019).
Adamu, M., Mohammed, B. S. & Liew, M. S. Mechanical properties and performance of high volume fly ash roller compacted concrete containing crumb rubber and nano silica. Constr. Build. Mater. 171, 521–538 (2018).
Cao, C., Sun, W. & Qin, H. The analysis on strength and fly ash effect of roller-compacted concrete with high volume fly ash. Cem. Concr. Res. 30(1), 71–75 (2000).
Yerramala, A. & Babu, K. G. Transport properties of high volume fly ash roller compacted concrete. Cement Concr. Compos. 33(10), 1057–1062 (2011).
Lam, N.-T.-M., Nguyen, D.-L. & Le, D.-H. Predicting compressive strength of roller-compacted concrete pavement containing steel slag aggregate and fly ash. Int. J. Pavement Eng. 23(3), 731–744 (2022).
Kazemi, R. & Mirjalili, S. An AI-driven approach for modeling the compressive strength of sustainable concrete incorporating waste marble as an industrial by-product. Sci. Rep. 14(1), 26803 (2024).
Tipu, R. K., Panchal, V. & Pandya, K. Multi-objective optimized high-strength concrete mix design using a hybrid machine learning and metaheuristic algorithm. Asian J. Civil Eng.. 24(3), 849–867 (2023).
Aggarwal, S., Singh, R., Rathore, A., Kapoor, K. & Patel, M. A novel data-driven machine learning techniques to predict compressive strength of fly ash and recycled coarse aggregates based self-compacting concrete. Mater. Today Commun. 39, 109294 (2024).
Ashrafian, A. et al. Classification-based regression models for prediction of the mechanical properties of roller-compacted concrete pavement. Appl. Sci. 10(11), 3707 (2020).
Fakhri, M., Amoosoltani, E., Farhani, M. & Ahmadi, A. Determining optimal combination of RCCP mixture containing RAP and crumb rubber using hybrid ANN-GA method considering energy absorbency approach. Can. J. Civil Eng. 44, 945–955 (2017).
Vahidi, E. K., Malekabadi, M. M., Rezaei, A., Roshani, M. M. & Roshani, G. H. Modeling of mechanical properties of roller compacted concrete containing RHA using ANFIS. Comput. Concr. 19(4), 435–442 (2017).
Rooholamini, H., Hassani, A. & Aliha, M. Evaluating the effect of macro-synthetic fibre on the mechanical properties of roller-compacted concrete pavement using response surface methodology. Constr. Build. Mater. 159, 517–529 (2018).
Ashrafian, A., Gandomi, A. H., Rezaie-Balf, M. & Emadi, M. An evolutionary approach to formulate the compressive strength of roller compacted concrete pavement. Measurement 152, 107309 (2020).
Chakali, Y., Sadok, A. H., Tahlaiti, M. & Nacer, T. A PSO-ANN intelligent hybrid model to predict the compressive strength of limestone fillers roller compacted concrete (RCC) to build dams. KSCE J. Civ. Eng. 25(8), 3008–3018 (2021).
Adamu, M., Haruna, S., Ibrahim, Y. E. & Alanazi, H. Investigating the properties of roller-compacted rubberized concrete modified with nanosilica using response surface methodology. Innov. Infrastruct. Solut. 7(1), 119 (2022).
Zhang, G., Hamzehkolaei, N. S., Rashnoozadeh, H., Band, S. S. & Mosavi, A. Reliability assessment of compressive and splitting tensile strength prediction of roller compacted concrete pavement: Introducing MARS-GOA-MCS. Int. J. Pavement Eng. 23(14), 5030–5047 (2022).
Debbarma, S. G. & Ransinchung, R. N. Using artificial neural networks to predict the 28-day compressive strength of roller-compacted concrete pavements containing RAP aggregates. Road Mater. Pavement Des. 23(1), 149–167 (2022).
Hoang, N.-D. Estimating the compressive strength of roller compacted concrete using a novel swarm-optimised light gradient boosting machine. Int. J. Pavement Eng. 24(2), 2270765 (2023).
Calis, G., Yildizel, S. A. & Keskin, U. S. Predicting compressive strength of color pigment incorporated roller compacted concrete via machine learning algorithms: A comparative study. Int. J. Pavement Res. Technol. 17(6), 1586–1602 (2024).
Chaabene, W. B., Flah, M. & Nehdi, M. L. Machine learning prediction of mechanical properties of concrete: Critical review. Constr. Build. Mater. 260, 119889 (2020).
Waris, M. I., Plevris, V., Mir, J., Chairman, N. & Ahmad, A. An alternative approach for measuring the mechanical properties of hybrid concrete through image processing and machine learning. Constr. Build. Mater. 328, 126899 (2022).
Biswas, R. et al. A novel integrated approach of RUNge Kutta optimizer and ANN for estimating compressive strength of self-compacting concrete. Case Stud. Constr. Mater. 18, e02163 (2023).
Kazemi, R. & Gholampour, A. Evaluating the rapid chloride permeability of self-compacting concrete containing fly ash and silica fume exposed to different temperatures: An artificial intelligence framework. Constr. Build. Mater. 409, 133835 (2023).
Yang, S., Sun, J. & Zhifeng, X. Prediction on compressive strength of recycled aggregate self-compacting concrete by machine learning method. J. Build. Eng. 88, 109055 (2024).
Tipu, R. K., Panchal, V. & Pandya, K. Enhancing chloride concentration prediction in marine concrete using conjugate gradient-optimized backpropagation neural network. Asian J. Civil Eng. 25(1), 637–656 (2024).
Wu, Y., Liu, F., Zheng, L., Wu, X. & Lai, C. CSR-SVM: Compositional semantic representation for intelligent identification of engineering change documents based on SVM. Adv. Eng. Inform. 57, 102050 (2023).
Parekh, R. & Mitchell, O. Progress and obstacles in the use of artificial intelligence in civil engineering: An in-depth review. Int. J. Sci. Res. Arch. 13(1), 1059–1080 (2024).
Mirindi, D., Khang, A. & Mirindi, F. Artificial Intelligence (AI) and automation for driving green transportation systems: A comprehensive review. In Driving Green Transportation System Through Artificial Intelligence and Automation: Approaches, Technologies and Applications, 1–19 (2025).
Boussetta, I., El Euch Khay, S. & Neji, J. Comprehensive study on mechanical and transport properties of roller-compacted concrete incorporating reclaimed asphalt pavement. Int. J. Eng. Res. Africa. 71, 61–78 (2024).
Korouzhdeh, T., Eskandari-Naddaf, H. & Kazemi, R. Hybrid artificial neural network with biogeography-based optimization to assess the role of cement fineness on ecological footprint and mechanical properties of cement mortar expose to freezing/thawing. Constr. Build. Mater. 304, 124589 (2021).
Kazemi, R., Eskandari-Naddaf, H. & Korouzhdeh, T. New insight into the prediction of strength properties of cementitious mortar containing nano-and micro-silica based on porosity using hybrid artificial intelligence techniques. Struct. Concr. 24(4), 5556–5581 (2023).
Bayqra, S. H. Assessment of cold joint formation between layers in high fly ash roller compacted concrete. PhD Thesis. Accessed from thesis center of Turkey (2023) 797198.
Yegnanarayana, B. Artificial neural networks (PHI Learning Pvt. Ltd, 2009).
Panchal, G., Ganatra, A., Kosta, Y. & Panchal, D. Behaviour analysis of multilayer perceptrons with multiple hidden neurons and hidden layers. Int. J. Comput. Theory Eng. 3(2), 332–337 (2011).
Uzair, M. and N. Jamil. Effects of hidden layers on the efficiency of neural networks. In 2020 IEEE 23rd International Multitopic Conference (INMIC). (IEEE, 2020).
Adeli, H. Neural networks in civil engineering: 1989–2000. Comput. Aided Civil Infrastruct. Eng. 16(2), 126–142 (2001).
Simon, D. Biogeography-based optimization. IEEE Trans. Evol. Comput. 12(6), 702–713 (2008).
Yang, X.-S. Nature-Inspired Metaheuristic Algorithms (Luniver press, 2010).
Hagan, M. T. & Menhaj, M. B. Training feedforward networks with the Marquardt algorithm. IEEE Trans. Neural Netw. 5(6), 989–993 (1994).
El-Bakry, M. Feed forward neural networks modeling for K-P interactions. Chaos Solitons Fractals 18(5), 995–1000 (2003).
Haykin, S. Neural Networks and Learning Machines, 3/E (Pearson Education India, 2009).
Mehlig, B. Machine Learning with Neural Networks: An Introduction for Scientists and Engineers (Cambridge University Press, 2021).
Haykin, S. Neural Networks: A Comprehensive Foundation (Prentice Hall PTR, 1994).
Rahmani, E., Sharbatdar, M. K. & Beygi, M. A comprehensive investigation into the effect of water to cement ratios and cement contents on the physical and mechanical properties of Roller Compacted Concrete Pavement (RCCP). Constr. Build. Mater. 253, 119177 (2020).
Lundberg, S. M. & Lee, S.-I. A unified approach to interpreting model predictions. Adv. Neural Inform. Process. Syst. 30 (2017).
Acknowledgement
The authors express their appreciation for the assistance the Scientific and Technological Research Council of Turkey (TUBITAK) provided under Grant Number 222M425. Additionally, the fifth author thanks the Turkish Science Academy (TÜBA). The primary author extends gratitude to TÜBİTAK for the 2211A scholarship received during their doctoral studies. The authors are grateful to Óbuda University for covering the APC for this article.
Funding
Open access funding provided by Óbuda University.
Author information
Authors and Affiliations
Contributions
M.U. collected data, validated results, and contributed to writing the manuscript. R.K. performed data analysis, developed the results, and assisted with writing. Y.K. participated in data collection, contributed to the methodology, and helped write the manuscript. N.M. was involved in data collection, methodology development, and writing. A.M. contributed to data collection, methodology design, supervision, and manuscript writing. S.M. carried out data analysis, developed results, provided supervision and project administration, and contributed to writing. All authors reviewed the manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Unverdi, M., Kazemi, R., Kaya, Y. et al. Predicting compressive and splitting tensile strength of high volume fly ash roller compacted concrete using ANN and ANN-biogeography based optimization models. Sci Rep 15, 21794 (2025). https://doi.org/10.1038/s41598-025-05700-y
Received:
Accepted:
Published:
Version of record:
DOI: https://doi.org/10.1038/s41598-025-05700-y















