Concrete crack opening forecasting by back propagation neural network and differential equation

Sun, Feifei; Xia, Zhonghua; Feng, Weiqian; Zhu, Xinhua; Xie, Jinping; Yu, Yu; Huang, Lvlong; Sheng, Dong

doi:10.1038/s41598-025-11216-2

Download PDF

Article
Open access
Published: 15 July 2025

Concrete crack opening forecasting by back propagation neural network and differential equation

Feifei Sun¹,
Zhonghua Xia²,
Weiqian Feng²,
Xinhua Zhu³,
Jinping Xie¹,
Yu Yu¹,
Lvlong Huang⁴ &
…
Dong Sheng⁵

Scientific Reports volume 15, Article number: 25452 (2025) Cite this article

841 Accesses
Metrics details

Subjects

Abstract

Concrete crack opening (CCO) is of great importance to hydraulic engineering maintenance. A forecast method is put forward combining back propagation neural network (BPNN) and differential equation (DE) for daily CCO modeling and was applied to Wangqingtuo Reservoir, in the northern semiarid region of China and the contribution of the DE was assessed by using BPNN model as a contrast. First, it is made up of BPNN and DE calibrations: (1) use historical data to calibrate BPNN models and obtain residuals; (2) use the particle swarm optimization to calibrate coefficients of the DE. The periodicity and time delay of air temperature is expressed by the DE well. Second, important results were found by field application: (1) the sole BPNN models can provide reasonable predictions; (2) better prediction can be achieved based on BPNN-DE-2TD by increasing KGE, 12% for JB-1, 37% for JB-3, and 6% for JB-7; (3) it is indicated that the addition of DE can improve the modeling on the role of air temperature under seasonal and linear trend, while BPNN part can express the nonlinear role of water level and precipitation well, confirmed by Fourier amplitude sensitivity test sensitivity and Shapley Additive exPlanations analysis. This study could provide useful insights into further forecasting of CCO under this forecast method in the world.

Learning nonlinear operators in latent spaces for real-time predictions of complex dynamics in physical systems

Article Open access 14 June 2024

Anthropogenic fingerprints in daily precipitation revealed by deep learning

Article Open access 30 August 2023

Stability evaluation of open-pit mine slope based on Bayesian optimization 1D-CNN

Article Open access 18 June 2024

Introduction

The structural integrity of hydraulic engineering projects, such as dams, levees, and spillways, is critical for ensuring engineering safety, water resource management, flood control, and energy generation. However, concrete cracks—ranging from superficial surface fissures to deep structural fractures—remain a persistent challenge, significantly compromising the durability and safety of these infrastructures^1,2,3. Cracks in hydraulic structures often arise from complex interactions among material properties, environmental stressors, and construction practices³.

Early in 1956, it is first categorized⁴ that the factors of concrete displacement are water pressure, temperature, and aging under nonlinear relationship. Thermal gradients during cement hydration, shrinkage due to moisture loss, and uneven mechanical loading can induce tensile stresses exceeding concrete’s capacity, leading to crack initiation and propagation^3,5,6. Under climate change, temperature’s influence can become more and more severe^7,8.

Numerical and laboratory studies had been carried out on crack opening^9,10. A product of axial compression load, slenderness ratio coefficient, eccentric coefficient, and reduction coefficient of the slotted section⁹ is used to express the peak load of specimen in different loading stage. Laboratory study¹⁰ confirms the impact of basalt fiber on concrete under large eccentric compression, whose deflection and moment can be expressed by differential equation, and whose maximum crack width for large eccentrically compressed columns can be defined by a formula.

Generally, the fracture mechanics and cracking in concrete can summarized. Take concrete filled steel tube columns for example. A few failure modes on concrete filled steel tube columns with huge sections⁹ were summarized: (1) under axial compression, the element columns bend to the outer side and show unstable failure mode, but the connection plates can maintain stable of the slotted column and increase in the bearing capacity; (2) under small eccentric compression, the compression side element column shows the axial compression failure characteristic, and the tensile side element column shows the eccentric compression characteristic; (3) under axial compression or small eccentric compression, all failure locations are below the upper connection plate; (4) under large eccentric compression, all element columns show the eccentric compression failure characteristics, and the failure locations are at the middle height. These failure modes can help partially explain the spatial distribution of cracks in a specific monitoring location to some extent.

Although it is common to use machine learning models on concrete-related studies^11,12,13,14, interestingly, a combination of non-linear finite element model and machine learning techniques can applied to model the load-carrying capacity of concrete-filled steel tubular¹⁵. A finite-element cohesive zone model was used to consider potential cracks, while a constitutive model was used to account for interface damage and plasticity¹⁶. Interesting, strain contours with a specified range provided by a finite element model was used to train a deep learning model¹². A deep learning-based acoustic emission data cluster framework¹⁴ was developed for evaluating fatigue cracks, which can diagnose overlapping microscopic noise and damage mechanisms across different cases with various crack lengths. As summarized, strong compression, fractures under persistent heavy stresses, temperature variations, structural difficulties, and freeze–thaw cycles can contribute to concrete cracks¹⁷.

Generally, there are mechanical models¹⁸, statistical models⁷, and artificial intelligence (AI) models¹⁹ on concrete crack opening (CCO). Polynomial regression was applied to model the piezometer levels of the Kremasta Dam²⁰, while hydrostatic-season-time models were invented for modeling monitoring data²¹.

Since the mechanical models on CCO require plenty of monitoring data and computation power, it is reasonable and feasible to focus more on statistical models and artificial intelligence models. Although there exists conventional statistical models of concrete crack opening¹⁹, they cannot capture nonlinear features of temperature and other factors⁷ well. Existing models often oversimplify the influence of air temperature⁷, but daily maximum and minimum of air temperature shall be considered²². Furthermore, the integration of Back Propagation Neural Network (BPNN) bears good potential⁷ of expressing the nonlinearity, but not time delay of temperature influence. To fill the gap of expressing phase shift, differential equation can be introduced.

The reason why model selection starts from a basic ANN model, BPNN model. And our main goal is to test the feasibility of the DE part whether it can improve the modeling accuracy.

Thus, here a forecast method of BPNN and differential equation (DE) for CCO was put forward to capture both linearity, nonlinearity, and time delay. Our main contributions in this paper are as follows:

Formulating a BPNN model for daily CCO in training and testing stages;
Designing a differential equation to express the time delay and nonlinearity on the residual between true observations and model values of the trained BPNN model.

This paper is organized as follows: Section"Forecast method of BPNN-DE"describes the design of the forecasting method based on the Back Propagation Neural Network and Differential Equation (BPNN-DE). Section"Application in the northern semiarid region of China"describes the study region and datasets involved. Section"Results and discussion"presents results and discussion based on the performance of our method. Lastly, Section"Conclusions"concludes the paper, summarizing the results and providing suggestions for further work.

Forecast method of BPNN-DE

In fact, after we obtained the residual between BPNN model values and measured CCO, we found the residual bears the seasonal and linear trend. As a result, we established the DE part and tested it.

Generally, BPNN as a basic ANN too is to express the nonlinear relationship between CCO and WL and P, while DE is to express the seasonal change and long-term linear change.

General framework

The CCO forecast problem can be formularized as an AI modeling problem having two composites: one BPNN model and one first order differential equation. First, for one monitoring site in $S=\{1,\dots ,\text{e}\}$ with e as the quantity of the forecast sites, the historical data is separated for training and testing a BPNN model, where ${T}_{j,n}=\{1,\dots ,j, 1,\dots ,n\}$ represents the data with j as the quantity of time series and n as the time. The data includes water level (WL), precipitation, maximum and minimum of air temperature (AT), and CCO values. Second, one DE is designed to model the residual between observed CCO values and values modeled by the BPNN model.

The forecast method is shown in Fig. 1 and Algorithm 1. First, use daily data WL, precipitation, maximum and minimum of AT at t-1 to predict CCO at t. Second, BPNN model is trained and tested. Third, a first order differential equation is calibrated on the residual in the training phase, which is the difference between predicted CCO by the trained BPNN model and the measured CCO, correspondingly.

Here, BPNN-DE is described as an optimization problem:

$$\text{min }obj=|{f}_{B}({WL}_{m,t-1},{P}_{m,t-1},{MA\_T}_{m,t-1},{MI\_T}_{m,t-1}){-C}_{m,t}|$$

(1)

$${R}_{t}={C}_{m,t}-{C}_{p,t}={C}_{m,t}-{f}_{B}({WL}_{m,t-1},{P}_{m,t-1},{MA\_T}_{m,t-1},{MI\_T}_{m,t-1})$$

(2)

where:

${f}_{B}()$: The trained BPNN model based on measured data of training phase, including WL, precipitation, and maximum and minimum of AT at time t-1; here, the trained BPNN model captures the general nonlinear relationships among these factors, whereas the DE part tries to express the seasonality and nonlinearity of air temperature’s role and others.

${WL}_{m,t-1}$: The measured water level at time t-1;
${P}_{m,t-1}$: The measured precipitation at time t-1;
${MA\_T}_{m,t-1}$: The measured maximum air temperature at time t-1;
${MI\_T}_{m,t-1}$: The measured minimum air temperature at time t-1;
${C}_{m,t}$: The CCO measured value at time t;
${C}_{p,t}$: The CCO predicted value by BPNN at time t;
${R}_{t}$: The residual of CCO between predicted by BPNN and measured at time t.

Back propagation neural network

BPNN is a typical multilayer ANN on the basis of error backpropagation²³. It applies the slope reduction algorithm to minimize error. It is made up of three layers, namely the input layer, hidden layer, and output layer (Fig. 2). While multiple inputs are included in the input layer, one output is included in the output layer. In the hidden layer, multiple neurons bear no direct contact with the outside world, but express the relationship between the input layer and output layer.

A conventional three-layer BPNN is used to establish the prediction model of the daily CCO in this paper. Tan-sigmoid is the transfer function between output and hidden layers, and the nonlinear Levenberg–Marquardt algorithm is the training function of BPNN. The maximum number of iterations is 100. The number of input layer nodes is the same as the number of input variables. The optimal value is determined by continuously adjusting the number of hidden layer neurons in the range of 2 to 13. The original datasets fall into training samples (70%) and testing samples (30%).

After trial and error, two BPNN models were trained and tested with inputs. Both have 4 neuros in the hidden layer.

The mathematical principle of the BPNN model is as follows²³:

$${y}_{i}=\sum_{j=0}^{m}{\omega }_{ij}{x}_{j}+{\beta }_{j}$$

(3)

where ${x}_{j}$ is input neuron and $j\in (0, m)$, $m$ is the number of input neurons, ${\omega }_{ij}$ is weight of the jth neuron in theinput layer corresponding to the jth neuron in the hidden layer, ${\beta }_{j}$ is bias-related weight of hidden neurons, ${y}_{i}$ is input of the hidden layer node (i = 0, 1, …, n), and n the number of neurons in the hidden layer. Tan-sigmoid is the transfer function between the layer output and the hidden layer, and its form is as follows²³:

$${l}_{i}=\frac{1}{1+{e}^{-{y}_{i}}}$$

(4)

The output layer is estimated by the following equation²³:

$${g}_{k}=\sum_{i=0}^{n}{\omega }_{ik}{l}_{i}+{\beta }_{k}$$

(5)

$$O=\text{max}(0, {g}_{k})$$

(6)

Among them, ${g}_{k}$ and O represent input and output values of the output layer, respectively.

The formulas above are the principles of the feedforward propagation mode of the BPNN model. In the process of cyclic simulation, errors generated by the system are collected and returned to the output value (Algorithm 2). By adjusting the weights and thresholds of neurons, network parameters corresponding to the minimum error are determined to generate an ANN system that can simulate the original problem.

Differential equation

A 1^st order differential equation is constructed for the residual ${R}_{t}$.

$$\frac{dR}{dt}={k}_{1}*\text{cos}\left(\frac{2\pi \left(MOD\left(t,365\right)-TD\right)}{365}+\pi \right)+\frac{{k}_{2}}{365}$$

(7)

where:

${k}_{1}$: the amplitude coefficient of periodic fluctuation term characterizing the driving strength of crack propagation by the rate of environmental temperature change; probably ${k}_{1}$ is related with material thermal expansion coefficient and daily temperature difference.
$TD$: the time delay between air temperature and crack opening, which could be a constant, or a function of accumulated temperature on time t; TD can be piecewise function or other complex functions.
${k}_{1}*\text{cos}\left(\frac{2\pi \left(MOD\left(t,365\right)-TD\right)}{365}+\pi \right)$: The periodic term, which reflects the impact of seasonal temperature changes on cracks; $MOD\left(t,365\right)-TD$ represents the TDth day of each year as the phase reference (which may correspond to the time for heat transferring from air to concrete);
${k}_{2}$: The rate constant of linear trend term, which represents the continuous crack propagation caused by material degradation during characterization.
$\frac{{k}_{2}}{365}$: The linear trend term, which reflects the long-term crack propagation trend caused by material aging/continuous load and is related with time-varying effects such as concrete creep and foundation settlement.

Calculate the integral,

$$R\left(t\right)=\int \left[{k}_{1}*\text{cos}\left(\frac{2\pi \left(MOD\left(t,365\right)-TD\right)}{365}+\pi \right)+\frac{{k}_{2}}{365}\right]dt+C$$

$$=\frac{{k}_{1}*365}{2\pi }*\text{sin}\left(\frac{2\pi \left(MOD\left(t,365\right)-TD\right)}{365}+\pi \right)+\frac{{k}_{2}}{365}t+C$$

(8)

Assume as $t$ =0, have $R\left(0\right)$,

$$R\left(0\right)={R}_{0}=\frac{{k}_{1}*365}{2\pi }*\text{sin}\left(-\frac{2\pi *TD}{365}+\pi \right)+C$$

(9)

Under the trigonometric identity $sin\left(\pi -x\right)=\text{sin}(x)$, simplify and have,

$$C={R}_{0}-\frac{{k}_{1}*365}{2\pi }*\text{sin}\left(\frac{2\pi *TD}{365}\right)$$

(10)

Finally, obtain,

$$R\left(t\right)={R}_{0}+\frac{{k}_{2}}{365}t+\frac{{k}_{1}*365}{2\pi }*\text{sin}\left(\frac{2\pi \left(MOD\left(t,365\right)-TD\right)}{365}+\pi \right)+C$$

$$={K}_{1}*\text{sin}\left(\frac{2\pi \left(MOD\left(t,365\right)-TD\right)}{365}+\pi \right)+{K}_{2}t+{C}^{\prime}$$

(11)

The detail of differential equation is described in Algorithm 3. First, the calibration of coefficients in (11) is accomplished in the training phase of the BPNN model by using the residual by particle swarm optimization. Second, adopt the coefficients to calculate the residual at time t.

Application in the northern semiarid region of China

Study region

Wangqingtuo Reservoir^26,27 is located in the western part of the town of Wangqingtuo in Tianjin, at 39 º 10’N and 116 º 52’E (Fig. 3), and was put into operation in 2019. It is a 24/7 regulating reservoir for the South to North Water Diversion Project and has no watershed cover, with 2000 × 10⁴ m³ storage capacity. The main construction contents include reservoir dam, pump station, water return gate, etc. Tianjin city of China is semiarid region, with annual precipitation 534.8 mm.

The most important function of Wangqingtuo Reservoir is: (1) to ensure the smooth switching of water sources from the Yangtze River to the Luan River during maintenance and shutdown of the main canal of the South to North Water Diversion Project; (2) to regulate the unevenness of the incoming water and ensure the stability of urban water supply flow.

Datasets

Three CCO monitoring sites were involved in this study, including JB-1, JB-3, and JB-7, daily values from 2020–1-3 to 2024–8-10 field daily records of WL, precipitation, and maximum and minimum of AT were applied for modeling. The scatter plots, violin plots, and spearman correlation matrix for statistical data analysis are displayed in Fig. 4. The scatter plots show: (1) JB-3 and JB-7 have similar scatter plots between CCO and WL, a kind of piecewise linear relationships; (2) The scatter plots between CCO and MA_T and MI_T in JB-3 have large hollows inside, while the scatter plot between CCO and MI_T in JB-1 have quite small hollows inside. Generally, the hollows in scatter plots between CCO and MA_T and MI_T seem related to the time delay phenomenon.

Table 1 shows that CCO has higher linear correlation with MA_T and MI_T than WL and P. And CCO of JB-3 has lower linear correlation with MA_T and MI_T than ones of JB-1 and JB-7.

Table 1 Spearman matrix for concrete opening measurements.

Full size table

Table 2 shows the basic statistical characteristics of the whole dataset. The linear relationship between CCO and WL is low, which cannot deny that there exists strong nonlinear relationship between CCO and WL.

Table 2 Basic statistical characteristics of the dataset.

Full size table

The measurement is complete and automatic. Data cleaning, missing value handing, normalization, and seasonality adjustment are not involved.

This study adopted the measured values of WL, precipitation, and maximum and minimum of AT from Meteorological department of Tianjin with the sole goal of evaluating this forecast method.

Study design under the method

First, we use the historical 2,020,103–20,230,810 datasets to train and test one BPNN model (see Algorithm 2).

Second, based on the difference between measured CCO and prediction CCO by the BPNN model during 2,020,103–20,230,810, PSO (see Algorithm 3) is used to calibrate the coefficients of the DE part.

Third, use the measured values during 20,230,811–20,240,810 to make BPNN predictions of CCO and use the calibrated DE to obtain DE predictions of CCO. Finally summarize these two parts.

Fourth, modeling is evaluated by the R², maximum error, minimum error, average absolute error, the ratio with < 20% error, and Kling Gupta efficiency (KGE) in the training stage, testing stage, and forecasting stage. As R² is larger than 0.8, the modeling performs well. The closer the KGE goes to 1, the better the modeling is. The Fourier amplitude sensitivity test (FAST) sensitivity analysis, Shapley Additive exPlanations (SHAP) analysis, and Taylor analysis were carried out. We also adopted Variance Accounted For (VAF), Entropy, and mutual information (MI), and Total Information Criterion (TIC) for analysis. The SPSS modeler and MATLAB are applied to carry out the modeling and analysis.

Results and discussion

Training, testing, and forecasting

As can be seen from Table 3, the only BPNN models do not perform well with R² not that large. According Standard for hydrological information and hydrological forecasting (GB/T 22,482–2008)²⁸, as the ratio with < 20% absolute error is larger than 85%, the forecasting is considered first-class. In a word, our forecasting in JB-1 and JB-3 by the BPNN model (see Table 4) meets the first-class criterion.

Table 3 Performance of only BPNN model without the DE analytic solution.

Full size table

Table 4 Performance of forecasting method through BPNN-DE.

Full size table

With the PSO algorithm, the coefficients were calibrated for BPNN-DE models (see Table 4). Based on vertex and valley points of air temperatures,, here adopt the February 11^st and July 24^th as the split points to separate the time.

Thus, under BPNN-DE models, there are BPNN-DE-1TD and BPNN-DE-2TD. For BPNN-DE-1TD, only TD1 is applied for all residual in Eq. (11). For BPNN-DE-2TD, as t falls between February 11^st and July 24^th, TD1 is applied and as t is outside this period, TD2 is applied.

As can be seen from Table 4, both BPNN-DE models with one TD and two TDs perform better than BPNN models, respectively especially by R² and KGE. The KGE increases from BPNN to BPNN-DE-1 TD, 11% for JB-1, 30% for JB-3, 2% for JB-7, while it increases from BPNN to BPNN-DE-2 TD, 12% for JB-1, 37% for JB-3, 6% for JB-7. In general, the addition of DE part can increase model performance, and models with two TDs can add more accuracy than models with one TD, which confirms the time delay influenced by air temperature⁷. VAF, Entropy, MI and TIC confirm that the addition of TD can improve the modeling performance.

In general, these three CCO monitoring sites emerge different relationship features between measured CCO and mean, maximum, and minimum of AT (see Fig. 5), which can reflect the universality and generality of the mechanism of crack occurrence to some extent in the northern semiarid region of China, besides small daily precipitation, small daily WL change. During 2,020,103–20,230,810, maximum, minimum, and mean of daily WL is 11.86m, 8.32m, and 11.16m. For 1960–2024, mean of annual precipitation is 534.8mm.

In JB-1, while WL and precipitation have poor R² with CCO, less than 0.18, mean, maximum, and minimum of AT have higher R², 0.63, 0.71, and 0.68 respectively. While AT has quite positive relationship with CCO, both AT and CCO change with large random fluctuation with time delay and not good continuity, correspondingly, see Fig. 5(a). It is obvious that the time delay is different in AT rising period and AT falling period based on maximum AT and CCO. As maximum AT rises, the measured CCO is scatted to the right of the scatted maximum AT around 20 days. As maximum AT falls, the measured CCO and maximum AT are scattered quite closer. CCO in JB-1 during 2,020,103–20,230,810 has mean value −21.67 mm with a bit increasing trend. This error histograms for BPNN, BPNN-DE-1TD, and BPNN-DE-2TD in Fig. 5(a) show under the addition of DE and TD: (1) the error range becomes smaller; (2) more errors get closer to zeroes.

In JB-3, while WL and precipitation have poor R² with CCO, less than 0.19, mean, maximum, and minimum of AT have a little higher R², 0.44, 0.52, and 0.49 respectively. While AT has quite negative relationship with CCO, CCO changes with small random fluctuation with time delay and good continuity as AT changes seasonally with large random fluctuation, see Fig. 5(b). It is obvious that the time delay is significant between AT extremes and CCO extremes. CCO in JB-3 during 2,020,103–20,230,810 has mean value −18.23 mm with no obvious trend. This error histograms for BPNN, BPNN-DE-1TD, and BPNN-DE-2TD in Fig. 5(b) show under the addition of DE and TD: (1) the error range becomes smaller; (2) more errors get closer to zeroes.

In JB-7, while WL and precipitation have poor R² with CCO, less than 0.17, mean, maximum, and minimum of AT have higher R², 0.74, 0.82, and 0.79 respectively. While AT has quite negative relationship with CCO, CCO changes with small random fluctuation with time delay and good continuity as AT changes seasonally with large random fluctuation, see Fig. 5(c). It is obvious that the time delay is significant between AT extremes and CCO extremes. CCO in JB-7 during 2,020,103–20,230,810 has mean value −0.95 mm with significant decreasing trend. This error histograms for BPNN, BPNN-DE-1TD, and BPNN-DE-2TD in Fig. 5(c) show under the addition of DE and TD: (1) the error range becomes smaller; (2) more errors get closer to zeroes.

As can be seen from Fig. 5(a), BPNN predictions in JB-1 generally capture the trend, but they have obvious time delay and point-scattered issues, and they cannot express the peaks and bottoms of CCO well. The addition of DE through BPNN-DE-1TD and BPNN-DE-2TD models do help decrease the time delay issue and reduce the point-scattered issue. More, both models with DE indeed improve the accuracy in peaks of predictions, but not those values of bottoms.

As can be seen from Fig. 5(b), BPNN predictions in JB-3 generally capture the trend, but they have obvious time delay and point-scattered issues, and they cannot express the bottoms of CCO well. The addition of DE through BPNN-DE-1TD and BPNN-DE-2TD models do help decrease the time delay issue and reduce the point-scattered issue. While both models with DE indeed improve the accuracy in peaks and bottoms of predictions, the DE models have prediction inefficiency around peaks of 2020, 2022, and 2023.

As can be seen from Fig. 5(c), BPNN predictions in JB-7 generally capture the trend and are good in bottoms and not good in peaks, but they have obvious time delay and point-scattered issues. The addition of DE through BPNN-DE-1TD and BPNN-DE-2TD models do help decrease the time delay issue and reduce the point-scattered issue. While both models with DE indeed improve the accuracy in peaks and bottoms of predictions, the DE models have prediction inefficiency around peaks of 2020, 2021, and 2024 and around bottoms of 2021.

To sum up, the addition of DE to BPNN does increase the accuracy by reducing the point-scattered issue and the addition of TD can fix the time delay to some extent. BPNN-DE-2TD can increase predictions in peaks and bottoms of CCO to some extent.

Why the BPNN-DE-2TD performs good or not?

To investigate why BPNN-DE-2TD performs good, ahead of all, it is necessary to sort out the factors of concrete crack opening. Generally, there are a few factors contributing to the concrete crack opening, including water pressure, precipitation, temperature, and other material properties and mechanical characteristics. In the operation period of hydraulic engineering, WL, precipitation, AT, and aging are the main factors⁴. While mechanical models has their limits due to require plenty of monitoring data and computation power, statistical and AI models have great application potential⁷.

Second, the combination of BPNN and DE in this paper has shown great both theory and application meaning for modeling concrete crack opening (see Fig. 5). On one hand, BPNN demonstrates powerful nonlinear modeling capabilities on small and medium-sized datasets through hierarchical nonlinear transformations and error feedback mechanisms, but the Sigmoid function is prone to gradient exponential decay in deep networks^24,29. On the other hand, DE provides a complete modeling language from deterministic to stochastic, from continuous to discrete³⁰. While the sine/cosine function is good at expressing the seasonal variations, the TDs can reflect the time delay of the influence of air temperature very well (see Fig. 5).

Third, the research target Wangqingtuo Reservoir is unique in some ways. First, it has no watershed, which means in two ways: (1) its WL is not influenced by precipitation generating runoff from a watershed; (2) its WL is predictable by calculating the water volume in or out under one-day-ahead water-use planning. More it is quite shallow, with design water level 11.9 m, the dead water level 6.47 m, the bottom elevation 4.2 m, and maximum water depth 7.7 m. As a result, water pressure plays little role on CCO. Second, without watershed, it only absorbs the precipitation with its reservoir area 3.92 km². Thus, precipitation plays little role on CCO. To sum up, the air temperature and other material or mechanical factors are the main contributor of CCO in Wangqingtuo Reservoir.

Forth, although BPNN-DE-2TD performs good to some extent, it has some shorts in predicting some peaks or bottoms. First, the DE analytic solution do not use the AT time series just by using Mod(t, 365), which can be improved probably by a function of the accumulated temperature. Second, the TD is segmented constant term, which can be improved by introducing a function of the AT. Third, since Wangqingtuo Reservoir is mainly influenced by AT, DE on WL and Precipitation for other hydraulic engineering can be a good research topic.

The physics behind BPNN-DE-2TD

The FAST sensitivity analysis, SHAP analysis, and Taylor analysis were carried out (see Fig. 6) to facilitate to understand the physics behind the BPNN-DE-2TD model.

First, BPNN expresses the nonlinear relationship between CCO and WL, P, MA_T, and MI_T, proved by the relative importance values of FAST sensitivity analysis (see Fig 6.a, 6.a, and 6.a). WL and P are the most important factors. WL’s relative importance (RI) values in JB-1, JB-3, and JB-7 are 0.79, 0.68, and 0.87, respectively, while P’s RI values are 0.07, 0.23, and 0.12. JB-1 and JB-7 have the elevation 9 m while JB-3 is −1m. JB-1 and JB-3 are 42 m away from the Wangqingtuo dam’s external foot, while JB-1 is right at the external foot, 0m.

Although the monitoring sites are limited, here probably it is inferenced from the FAST sensitivity analysis on BPNN part that: (1) P can increase the RI value by increasing the underground water level. JB-3’s elevation is −1m, which can be affected by the underground water level. Consequently, JB-3 has the largest RI value 0.23. (2) the increase of distance to the water in the Wangqingtuo reservoir can reduce the RI value. As the distance increases from 0 m of JB-7 to 42 m of JB-1, the RI decreases from 0.87 of JB-7 to 0.79 of JB-1. (3) as the elevation goes low from 9 m of JB-1 to −1m of JB-3, the WL’s RI goes low from 0.79 to 0.68.

Second, DE expresses the seasonal variation, linear trend, and time delay, confirmed by first-order partial derivative (see Fig 6.b, 6.b, and 6.b) and SHAP analysis (see Fig 6.c, 6.c, and 6.c). First-order partial derivative on DE part shows the seasonal amplitude sensitivity in the form of shifted sine function, long term trend sensitivity in the form of linear equation, and phase sensitivity in the form of shifted cosine function. Average absolute SHAP values of periodic term and linear term for JB-1, JB-3, and JB-7 are 0.057 and 0.068, 0.039 and 0.017, 0.016 and 0.025. The ranges of JB-1, JB-3, and JB-7 are 0.03mm, 0.07mm, and 0.83mm. Based on average absolute SHAP values and ranges, it seems that the small the range is, the large the average absolute SHAP values of DE part are. As the range becomes larger, periodic term and linear term of the DE part make smaller contribution.

Third, the TD does increase the modeling accuracy, proved by Taylor diagrams (see Fig. 6.1.d, Fig. 6.2.d, and Fig. 6.3.d). For JB-1, JB-3, and JB-7, BPNN-DE-2TD model performs better than BPNN-DE-1TD, which is better than BPNN model. It is worth noting that standard deviation of BPNN-DE-2TD goes to 1 for all monitoring sites. Two TDs offer more accuracy by assuming that there exits different TDs in heating process and cooling process.

Conclusions

It is important to model concrete crack opening. Inspired by related researchers, a forecast method by combining the BPNN and DE for measured CCO is put forward. First, historical data is used to calibrate BPNN models and obtain residuals. Second, the PSO is used to calibrate the DE for the residuals. Third, summarize the BPNN and DE parts.

This forecasting method was applied in the Wangqingtuo Reservoir’s 2020–2024 daily datasets. The application offers important results: (1) the sole BPNN models can provide reasonable predictions; (2) the method through BPNN-DE-2TD can achieve better prediction of great significance by increasing KGE, 12% for JB-1, 37% for JB-3, and 6% for JB-7; (3) BPNN-DE-2TD can model the influence of air temperature well, not water level, and precipitation. Generally, DE can express the role of air temperature under seasonal and linear trend, while BPNN part can express the nonlinear role of water level and precipitation, confirmed by FAST sensitivity and SHAP analysis.

Admittedly, the forecast method is new, and has some limitations, like not considering time-vary TD or time-varying WL. For example, admittedly, WL changes influence the CCO propagation. In our opinion, the time-varying WL changes could possibly be integrated into the DE part in the form of a new subitem.

Data availability

The data related with this study will be available based on reasonable request through corre-sponding author.

Abbreviations

AT:: Air temperature
BPNN:: Back propagation neural network
BPNN-DE-2TD:: Back propagation neural network-differential equation with two TDs
BPNN-DE-1TD:: Back propagation neural network-differential equation with one TD
C:: Concrete crack opening value
CCO:: Concrete crack opening
DE:: Differential equation
${f}_{B}$ :: The trained BPNN model
FAST:: Fourier amplitude sensitivity test
MA_T:: Maximum air temperature
MI_T:: Minimum air temperature
KGE:: Kling Gupta efficiency
P:: Precipitation
R:: Residual
RI:: Relative importance
SHAP:: Shapley additive exPlanations
TD:: Time dely
WL:: Water level

References

He, J., Chen, J. & Li, K. Research of risk fore warning indicators for operation safety management of three Gorges project (in Chinese). Water Resour. Dev. Res.11, 45–51 (2023).
Google Scholar
Sheng, D., Yu, L., Sun, F., Xie, J. & Yu, Y. Reengineering and Its reliability: An analysis of water projects and watershed management under a digital twin scheme in China. Water 15, 19. https://doi.org/10.3390/w15183203 (2023).
Article Google Scholar
Singh, P., Yogesh, R., Bhowmik, S. & Kishen, J. M. C. Insights into the fracturing process of plain concrete under crack opening. Int. J. Fract.https://doi.org/10.1007/s10704-023-00692-0 (2023).
Article Google Scholar
Tonini, D. Observed behavior of several Italian arch dams. ASCE Power Div. J. 82 (1956).
Richard, H.A., Fulland, M., Sander, M. Theoretical crack path prediction. Fat. Fract. Eng. Mater. Struct. 28 (2004).
Zhang, J., Song, F., Zhang, L., Wang, J. & Liu, C. Analysis on hydraulic fracturing of concrete in super-high arch dam based on the thermodynamic principle of minimum energy consumption rate. Int. J. Heat Technol.40, 383–389 (2022).
Article Google Scholar
Xu, X., Huang, Y., Xu, Y., He, Y. & Yan, J. An improved hybrid prediction model for concrete crack opening based on chaos theory (in Chinese). J. Water Resour. Water Eng. 32, 178–185 (2021).
Google Scholar
McHenry, D. Measured and computed temperature on concrete at Norris dam. J. Am. Concr. Inst. 34, 117–125 (1937).
Google Scholar
Rong, C., Peng, Y., Shi, Q. & Wang, P. Eccentric compression performance of concrete filled steel tube slotted columns: Expreriment and simulation analysis. Structures 74, 13 (2025).
Article Google Scholar
Wang, X. et al. Experimental study on the mechanical properties of short-cut basalt fiber reinforced concrete under large eccentric compression. Sci. Rep. 15, 12 (2025).
CAS Google Scholar
Ren, Y., Isleem, H.F., Almoghaye, W.J.K., Hamed, A.K., Jangir, P., Arpita, Tejani, G.G., Ezugwu, A.E., Soliman, A.A. Machine learning-based prediction of elliptical double steel columns under compression loading. J. Big. Data. 12 (2025).
Niu, Y., Wang, W., Su, Y., Jia, F. & Xu, L. Plastic damage prediction of concrete under compression based on deep learning. Acta Mech.235, 255–266 (2024).
Article MATH Google Scholar
Tipu, R.K., Batra, V., Suman, Pandya, K.S., Panchal, V.R. Enhancing load capacity prediction of column using eReLU-activated BPNN model. Structures 58 (2023).
Li, D. et al. Deep learning-based acoustic emission data clustering for crack evaluation of welded joints in field bridges. Autom. Constr. 165, 15 (2024).
Article Google Scholar
Mohamed, H.S., Qiong, T., Isleem, H.F., Tipu, R.K., Shahin, R.I., Yehia, S.A., Jangir, P., Arpita, Khishe, M. Compressive behavior of elliptical concrete-filled steel tubular short columns using numerical investigation and machine learning techniques. Sci. Rep. (2024).
Zhang, W., Yang, X., Lin, J., Lin, B. & Huang, Y. Experimental and numerical study on the torsional behavior of rectangular hollow reinforced concrete columns strengthened by CFRP. Structures70, 107690 (2024).
Article Google Scholar
Althoey, F. et al. Machine learning based computational approach for crack width detection of self-healing concrete. Cases Stud. Constr. Mater. 17, e01610 (2022).
Google Scholar
Serdar, A. H., Caglar, N. & Demirtas, G. S. M. Nonlinear finite element analysis of steel fiber reinforced concrete beams subjected to impact loads. Revista De La Constr. 23, 88–103 (2024).
Google Scholar
Gao, Z., Bao, T. & Li, Y. Hybrid prediction model of concrete dam crack opening based on MLR-SSA-GRU(in Chinese). Eng. J. Wuhan Univ. 55, 647–653 (2022).
Google Scholar
Kalkani, E. C. Polynomial regression to forecast earth dam piezometer levels. J. Irrig. Drain. Eng. 115, 45–55 (1989).
Article Google Scholar
Crepon, O. & Lino, M. An analytical approach to monitoring. Int. Water Pow. Dam Constr.51, 52–54 (1999).
Google Scholar
Ouyang, J. et al. Application of distributed temperature sensing for cracking control of mass concrete. Constr. Build. Mater. 197, 778–791 (2019).
Article Google Scholar
Fausett, L. V. Fundamentals of neural networks: architectures, algorithms, and applications (Prentice-Hall Inc., 1994).
MATH Google Scholar
Karsoliya, S. Approximating number of hidden layer neurons in multiple hidden layer BPNN architecture. Int. J. Eng. Trends Technol.3, 714–717 (2012).
Google Scholar
Das, G., Pattnaik, P. K. & Padhy, S. K. Artificial neural network trained by particle swarm optimization for non-linear channel equalization. Expert. Syst. Appl. 41, 3491–3496 (2014).
Article Google Scholar
Tambatamba, M. M., Che, A. & Zhu, R. Control methods and influence factors of silt liquefaction: Case of the Wangqingtuo Reservoir in the South-North Water Diversion Project (Tianjin, China). J. Phys.: Conf. Ser.1176, 052062. https://doi.org/10.1088/1742-6596/1176/5/052062 (2019).
Article Google Scholar
Li, H. Y., Wang, Y. X. & Li, X. B. The mechanism and forecasting methods for severe droughts and floods in a river basin in China. Chin. Geogr. Sci.21, 531–542 (2011).
Article CAS Google Scholar
China, T.s.a. Standard for hydrological information and hydrological forecasting. GB/T 22482—2008 (2008).
Dai, H. & Macbeth, C. Effects of learning parameters on learning procedure and performance of a BPNN. Neural Netw. 10, 1505–1521 (1997).
Article PubMed Google Scholar
Evans, D. J. & Raslan, K. R. The Adomian decomposition method for solving delay differential equation. Int. J. Comput. Math. 82, 49–54 (2005).
Article MathSciNet MATH Google Scholar

Download references

Funding

This work was supported by Central Guidance Fund for Local Science and Technology Development of China (24ZYCGYS00730) and National Key R&D Program of China (2021YFB3900603 2021YFB3900605, and 2022YFC3800700).

Author information

Authors and Affiliations

China Water Resources Beifang Investigation, Design and Research Co., Ltd., 60 Dongting Road, Hexi District, Tianjin, 300222, China
Feifei Sun, Jinping Xie & Yu Yu
Tianjin Water Resources Research Institute, 60 Youyi Road, Hexi District, Tianjin, 300222, China
Zhonghua Xia & Weiqian Feng
Yellow River Conservancy Commission of the Ministry of Water Resources, 11 Jinshui Road, Zhengzhou, 450003, China
Xinhua Zhu
Hohai University, 1 Xikang Road, Nanjing, 210024, China
Lvlong Huang
Hunan Water Resources and Hydropower Research Institute, 370 Shaoshan North, Changsha, 410007, China
Dong Sheng

Authors

Feifei Sun
View author publications
Search author on:PubMed Google Scholar
Zhonghua Xia
View author publications
Search author on:PubMed Google Scholar
Weiqian Feng
View author publications
Search author on:PubMed Google Scholar
Xinhua Zhu
View author publications
Search author on:PubMed Google Scholar
Jinping Xie
View author publications
Search author on:PubMed Google Scholar
Yu Yu
View author publications
Search author on:PubMed Google Scholar
Lvlong Huang
View author publications
Search author on:PubMed Google Scholar
Dong Sheng
View author publications
Search author on:PubMed Google Scholar

Contributions

All authors contributed to the study’s conception and design. Forecasting heuristic was formulated by F.S. Material preparation, data collection, and analysis were performed by Z.X., W.F., X.Z., J.X., Y.Y., L.H., and D.S.. The first draft of the manuscript was written by F.S. and all authors commented on previous versions of the manuscript. All authors have read and agreed to the published version of the manuscript.

Corresponding author

Correspondence to Feifei Sun.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Sun, F., Xia, Z., Feng, W. et al. Concrete crack opening forecasting by back propagation neural network and differential equation. Sci Rep 15, 25452 (2025). https://doi.org/10.1038/s41598-025-11216-2

Download citation

Received: 22 April 2025
Accepted: 08 July 2025
Published: 15 July 2025
DOI: https://doi.org/10.1038/s41598-025-11216-2