Introduction

The squeezing effect1 caused by the driving process of jacked piles may cause vertical and horizontal displacements of the soil in the adjacent area, leading to the floating and displacements of the piles. In severe cases, it can cause problems such as broken piles, cracking of adjacent buildings, road heave, pipeline deformation, etc. Excessive piles position deviation will lead to uneven load distribution and even structural instability. Therefore, predicting and controlling the squeezing effect of jacked piles is the key to ensure the quality of the project. It has important application value to carry out the prediction of the displacements of the pile tops and ground surface around piles.

In terms of theoretical analysis, classical theoretical analysis methods include cylindrical hole expansion method2 and strain path method3. The cylindrical expansion method regards the process of piles driving as the expansion of a cylindrical hole in an elastic–plastic infinite medium, and the material follows the Tresca or Mohr Coulomb yield criterion. The theory holds that the resistance of piles driving is related to the deformation modulus and strength of soil, and it can reflect the nonlinear characteristics of soil and simulate the working state of actual piles. Chen et al.4 proposed a novel graph-based analysis method for analyzing the response of expanded cylindrical cavities in modified Cam clay under non-drainage conditions. Gao et al.5 uses the curve equation of quartic polynomial to simulate the boundary of pile hole. Based on the assumption that single-pile penetration can be simulated through a series of spherical cavity expansions, Li6 provided an analytical solution for cavity expansions near the slope. The solution provides a simplified and realistic theoretical method to predict the soil behaviors around the spherical cavity near the sloping ground.

The theory of cylindrical hole expansion7 is the study of cylindrical hole expansion under the action of uniform internal pressure p. The schematic diagram of the cylindrical hole expansion method is shown in Fig. 1. When the internal pressure p increases, the cylindrical region around the cylindrical hole will enter the plastic state from the elastic state, and the plastic region will expand with the increase of the internal pressure p. The maximum radius of the plastic zone is Rp, and the corresponding limit expansion pressure is Pu. The soil outside the radius Rp still maintains the elastic equilibrium state. In order to compare with the calculation results of machine learning algorithms, the cylindrical hole expansion method was simultaneously used to calculate the displacements of the soil around the pile.

Fig. 1
Fig. 1
Full size image

The schematic diagram of the cylindrical hole expansion method.

where Ru is the initial radius of cylindrical hole. Rp is the maximum radius of plastic zone. Pu is the limit expansion pressure value. r is the calculate the distance between point and piles center. r0 is the initial radius of the pile. a is the cylindrical hole radius during expansion. up is the radial displacement of the boundary of the influence zone. σθ is the tangential stress of soil. σr is the radial stress of soil.

The displacements in the elastic region are:

$${u}_{r}=\frac{(1+\mu ){c}_{0}\text{cos}{\varphi }_{0}}{E}(\frac{{R}_{p}}{r}{)}^{2}r$$
(1)

where μ is the Poisson’s ratio. \({c}_{0}\) is the cohesion of the soil. \({\varphi }_{0}\) is the internal friction angle of the soil.

It should be noted that the elastic modulus E in the above calculation formulas is different from the compression modulus Es of soil. E and Es can be converted according to the following relationship:

$$E={E}_{\text{s}}(1-\frac{2{\mu }^{2}}{1-\mu })$$
(2)

Lu et al.8 used finite element method to simulate the complete process of continuous driving of a single pile. Luo et al.9 studied the influence law of shielding effect on soil displacements of jacked piles, and found that shielding effect had significant influence on soil displacements in front direction and back direction. Shao et al.10 used the finite element method to simulate the displacements of soft clay and the underlying gravel layer with the increase of soil depth and radial distance during pile driving. It is found that the lateral soil displacement is obvious in the area 1.0 m away from the prestressed high-strength concrete (PHC) pile axis, and decreases with the increase of the radial distance during pile driving. When the radial distance is above 4.0 m, the lateral displacement can be ignored.

Zhou et al.11 monitored the driving process of three static press piles in saturated clay, and analyzed the variation laws of lateral displacement of soil around the piles, vertical uplift value of the ground, and pore water pressure with the depth of pile driving and distance from the pile center. Zhang et al.12 found that the soil deformation caused by pile driving first increases and then decreases in depth, and the soil deformation decreases exponentially in the horizontal direction. Under the influence of compression, the width of the shear strain zone does not change with the increase of driving. Yuan et al.13,14 proposed a method for visualizing the soil displacement field around laterally loaded piles using transparent soil technology. The influence of passive piles on the three-dimensional ground deformation around laterally loaded piles and laterally loaded piles was studied through a series of model experiments.

The theoretical analysis method simplifies the piles group into a single pile, which is very different from the actual situation. The field test method needs more material resources, and the test cost is high. The conventional model test method cannot guarantee the similarity ratio, and the centrifuge test cost is very high. In the numerical simulation of soil squeezing effect of pile groups, the grid will have large deformation, which will cause convergence difficulties, and the error between the simulation results and the measured results is often large.

Considering that there are many factors affecting the soil squeezing effect of piles group, the mechanism is complex and has strong nonlinearity, which is difficult to be expressed directly by explicit function. Although empirical formulas can be used to establish the mathematical expressions between various influencing factors and soil squeezing effect indexes (such as excess pore water pressure, the displacements of the pile tops and ground surface around piles), the prediction accuracy and universality of these empirical formulas are often not ideal. The rapid development of artificial intelligence (AI) technology provides new tools for the progress of many industries. Machine learning is an important part of AI. In recent years, the application of machine learning in all walks of life has developed rapidly. Machine learning is an appropriate and effective method to solve engineering problems.

Many scholars15,16,17,18,19 have introduced algorithms with excellent nonlinear mapping ability such back propagation (BP), adaptive boosting (Adaboost), deep neural networks (DNN), random forest (RF), extreme gradient boosting (XGBoost) and support vector machine (SVM) into pile foundation engineering, and established many prediction models of pile bearing capacity considering different influencing factors (such as the geometric parameters of the piles foundation, soil physical and mechanical parameters, standard driving test (SPT) value, resting time, etc.). Kordjazi et al.20 established a SVM the prediction model of pile bearing capacity based on 108 sample data sets including geometric parameters of pile foundation, pile load test and cone penetration test (CPT) test data. Shahin et al.21 established a recurrent neural networks (RNN) bearing capacity-settlement prediction model of pile foundation based on field load test and CPT test data. Moayedi, et al.22 established a prediction model of the load-settlement relationship curve of pile foundation based on the in-situ CPT data set by feedforward neural network (FFNN) and focused time delay neural network (FTDNN). Tan et al.23,24 proposed an innovative hybrid machine learning model specifically for predicting the load–displacement characteristics of bored in-situ piles. This model establishes a complex relationship between key design parameters (diameter, length, SPT index and effective overlay pressure) and the load–displacement response of piles. Tram et al.25,26 addresses a robust predictive model for the axial load-bearing behavior of pre-bored grouted planted nodular (PGPN) piles. This model adopts a new hybrid method for predicting pile head settlement and has been applied in pile foundation engineering in Vietnam. Yuan et al.27 examined the effects of coral sand particle size and rigid pile embedment depth on pile-soil interaction. The horizontal strain distribution in coral sand around piles under lateral load was disclosed.

At present, there are many research achievements on the prediction of pile foundation bearing capacity and pile foundation settlement based on machine learning algorithms. The influence laws of various influencing factors on the bearing capacity of pile foundations and the settlement of pile foundations were analyzed. However, there are few research results on machine learning prediction of the soil squeezing effect (pile foundation displacements, soil displacement around the pile) caused by the driving of piles group, and the analysis of the action laws of various factors affecting the soil squeezing effect of piles group is not thorough. Based on the machine learning prediction for the bearing capacity and settlement of the reference pile foundation, the relationship between the horizontal displacements of the pile tops and ground surface around piles and various influencing factors was established using the machine learning algorithm in this paper. In addition, the patterns of the effects of various influencing factors have been analyzed.

BP neural network is a multilayer feedforward neural network trained according to the error back propagation algorithm (Tiwari et al.28), which is one of the most widely used neural network models. But BP neural network has slow convergence speed, low learning efficiency and easy to converge to a local minimum (Guo et al.29). In addition, common machine learning algorithms also include DNN, RF, XGBoost, SVM, et al. ( Lin et al.30; Huang et al.31). As a representative of the Boosting series of algorithms, the Adaboost algorithm can improve the prediction accuracy of the model by gradually enhancing the model’s performance. AdaBoost can effectively enhance the prediction accuracy of BP neural networks. Adaboost-BP has disadvantages such as easy overfitting and slow training speed. AdaBoost-BP has been applied in the prediction of foundation and base settlement and is expected to be extended to other engineering scenarios32,33.

DNN can automatically learn more advanced and essential feature representations than shallow networks (such as BP networks), making it more suitable for handling complex tasks. DNN requires a large amount of data and computing power, is complex to train, and is easily affected by hyperparameters. RF effectively reduces the variance of a single decision tree through random sampling and random feature selection, and has a natural ability to resist overfitting. However, the RF algorithm often performs worse than the Boosting algorithm in dealing with complex problems. As an outstanding representative of the Boosting series of algorithms, XGBoost effectively reduces bias and variance and improves prediction accuracy through techniques such as gradient boosting, regularization, and weighted quantiles. The XGBoost algorithm has complex parameter tuning and is prone to overfitting on small data. AdaBoost-BP, RF and XGBoost are all integrated algorithms. Although DNN is not an integrated algorithm, its performance can be improved through the idea of integrated learning.

Each algorithm has its own advantages and disadvantages. In this study, multiple algorithms (AdaBoost-BP, DNN, RF and XGBoost) were used for displacements prediction, and the prediction results of different algorithms were compared. In addition, in order to compare with the calculation results of machine learning algorithms, the cylindrical hole expansion method was used to calculate the soil displacements around the pile.

The main influencing factors of the displacements of the pile tops and ground surface around piles

The influencing factors of soil squeezing effect of jacked piles mainly include soil properties, pile driving sequence, pile spacing, time effect, etc. (Sagaseta et al.6; Lu et al.8; Zhou et al.11). Different soil properties have different responses to soil squeezing effect. For example, unsaturated soil and sand will be compacted under the squeezing effect (void ratio decreases), while saturated cohesive soil will cause lateral displacement and vertical uplift due to excess pore water pressure. The influence of pile driving sequence on soil squeezing effect is also obvious8. For example, the soil squeezing effect of pile driving from the four sides to the middle is usually more serious than that of pile driving from the middle to the four sides. In addition, the size of the pile (pile diameter, pile length), pile spacing and soil plug effect on pile driving will affect the soil squeezing effect of jacked piles8. The soil property index of cohesive soil includes physical property index, plasticity index and liquid index. The three most important indexes of soil property, namely relative density, moisture content, and density, can be directly measured in the laboratory (Guo et al.29). Compression modulus, cohesion force and internal friction angle are also important factors affecting the displacement of the pile tops and ground surface around piles. In addition, factors such as pile diameter, pile length, bending stiffness of the pile body, number of piles, pile spacing, and time effect (the rest age after pile driving) also have a significant impact on the displacement of the pile tops and ground surface around piles.

In summary, there are 15 main factors affecting the displacements of pile tops of jacked piles group considered in this paper, which are respectively moisture content, natural density, relative density, compression modulus, cohesion, internal friction angle, pile diameter, pile length, bending stiffness of pile body, number of piles in each row in X direction, number of piles in each row in Y direction, pile spacing, resting time, the distance and orientation between the center of the pile tops and the center of the bearing platform. There are 15 main factors affecting the displacements of the ground surface around piles, which are moisture content, natural weight, relative density, compression modulus, cohesion, internal friction Angle, pile diameter, pile length, bending stiffness of pile body, number of piles in each row in X direction, number of piles in each row in Y direction, pile spacing, resting time, the distance and orientation between the monitoring points and the center of the bearing platform. Considering that the soil layer within the engineering site is multi-layered soil, referring to the method proposed by Liu et al.7, the average moisture content, average natural weight, average relative density, average compression modulus, average cohesion and average internal friction angle of the soil within the pile length range were obtained through the weighted average of soil layer thickness.

Introduction to machine learning algorithms

BP neural network

The classic BP neural network is composed of three layers: input layer, hidden layer and output layer (Guo et al.29). The topology structure of the three-layer BP network is shown in Fig. 2.

Fig. 2
Fig. 2
Full size image

Topological structure of BP neural network.

The algorithm expression of BP neural network can be found in relevant references (Wang.34).

The number of hidden layer units is largely dependent on experience (Tiwari et al.28). The selection range was determined by using the method of Deng et al.35.

$$\sum\limits_{i = 0}^{{\text{n}}} {C_{{n_{1} }}^{i} } > k$$
(3)
$$n_{1} = \sqrt {n + m} + a$$
(4)
$$n_{1} \ge \log_{2} n$$
(5)

The parameters in Eqs. (3), (4) and (5) are shown in reference (Deng et al.35).

AdaBoost algorithm

Adaboost combines multiple weak classifiers into a strong classifier by iteratively adjusting sample weights and weak classifier weights (Murmu et al.36). The topology structure of AdaBoost algorithm is shown in Fig. 3.

Fig. 3
Fig. 3
Full size image

Topological structure of the AdaBoost algorithm.

The expression of the AdaBoost algorithm can be found in reference Guo et al.29.

AdaBoost-BP algorithm

For each iteration (i.e., each weak predictor), a weak predictor is first trained using the current sample weights, and the prediction error of this predictor is calculated. If the prediction error of the samples in a certain round of iteration exceeds the set threshold, the sample weights will be updated based on the current weak predictor’s weight. The specific approach is to adjust the model by increasing the weight of the incorrect samples and reducing the weight of the correct samples. In the next round of training, more attention will be paid to the samples with higher errors, and the weight of each sample will be added to the total error rate. That is, the sum of the prediction errors. Calculate the weight of the current weak predictor based on the sum of the prediction errors.

The calculation process is as follows:

Step 1: Randomly select m groups of training data in the sample space and initialize the distribution weights Dt(i) of the data (Guo et al.29):

$$D_{{\text{t}}} \left( i \right) = {1}/{\text{m}}$$
(6)

Step 2: Determine network parameters (such as the number of nodes in the input layer, hidden layer, and output layer).

Step 3: Train to obtain the t-th BP weak predictor, denoted as ht(x). The cumulative error exceeds the weight Dt(i) of the corresponding term δt to obtain the calculation error εt:

$$\varepsilon_{t} = \sum\limits_{i = 1}^{m} {D_{t} (i)}$$
(7)

Step 4: Calculate the weight αt for ht(x) based on the error εt calculated in Step 3:

$$\alpha_{t} = \frac{1}{2}\ln \left( {\frac{{1 - \varepsilon_{t} }}{{\varepsilon_{t} }}} \right)$$
(8)

Step 5: Adjust the weight of training data.

$$D_{t + 1} (i) = \frac{{D_{t} (i)}}{{Z_{t} }} \times \left\{ {\begin{array}{*{20}l} {e^{{ - \alpha_{t} }} ,} \hfill & {\varepsilon_{t} < \delta_{t} } \hfill \\ {e^{{\alpha_{t} }} ,} \hfill & {\varepsilon_{t} \ge \delta_{t} } \hfill \\ \end{array} } \right.$$
(9)

where:

$$Z_{t} = \sum\limits_{i = 1}^{m} {D_{t} } (i) \times \left\{ {\begin{array}{*{20}l} {e^{{ - \alpha_{t} }} ,} \hfill & {\varepsilon_{t} < \delta_{t} } \hfill \\ {e^{{\alpha_{t} }} ,} \hfill & {\varepsilon_{t} \ge \delta_{t} } \hfill \\ \end{array} } \right.$$
(10)

Step 6: After training T rounds, obtain T weak predictors.

Step 7: Output strong predictor:

$$H(x) = \sum\limits_{i = 1}^{T} {\frac{{\alpha_{t} }}{{\sum\limits_{t = 1}^{T} {\alpha_{t} } }}} h_{t} (x)$$
(11)

Introduction to other machine learning methods

Besides AdaBoost algorithm, DNN, XGBoost, Bagging, RF and other algorithms are often used for prediction (Lin et al.30). Currently, the DNN used in the analysis are mainly FFNN. The depth of the network refers to the number of hidden layers. Different from the traditional shallow neural network, DNN can extract features from low to high, learn features between data at a deeper level, extract the features of each layer, and establish a mapping relationship from the bottom signal to the top signal. DNN has a deep nonlinear structure that approximates arbitrary complex functions, which is an important feature of traditional shallow neural network, and has stronger ability to deal with complex, uncertain and fuzzy data. DNN can express larger and more complex functions. The topology structure of a DNN with three hidden layers is shown in Fig. 4. In the DNN structure, the layers are connected to each other.

Fig. 4
Fig. 4
Full size image

Topology structure of DNN with three hidden layers.

Xgboost is an enhanced tree model (Guo et al.29), which integrates many decision trees to form a stronger learner. RF is an ensemble learning method that belongs to a type of supervised learning algorithm (Chen et al.37), and it is a classifier or regressor composed of multiple decision trees. In this paper, BP, AdaBoost-BP, DNN, Xgboost and RF algorithms are used to predict the squeezing effect.

Case analysis

Data sources

The length of the Metro Line 1 project in Bogota, the capital of Colombia, is about 23.9 km. The whole line of the project is a viaduct. The general feature of the landform of the project is that the groundwater level is shallow, the thickness of silt and peat soil is large, and there are often fine sand and clay intercalations. Most pile foundations of the project are PHC pipe piles. The project has a total of 6232 PHC pipe piles, and each static pile driver works an average of 3 piles per day. The model of PHC pipe pile is PHC-1000–140, with a diameter of 1000 mm and a wall thickness of 140 mm. The length of the piles is 15 to 48 m, and the concrete strength grade of the pile body is C60. The number of piles under the bearing platform is mainly 12, 16 and 20, and the pile spacing is 2.5 m. There are many different types of pipelines around the pile group (gas, water supply and drainage, cables, communications, etc., and the materials include concrete, cast iron, PVC, ceramics, etc.). The distance between most pipelines and adjacent pile foundations is 0.4 m ~ 5 m. The buried depth of the pipeline is between 1-5 m. The geological survey data of the project is complete. The construction company monitored the horizontal displacements of the pile tops and ground surface around piles before and after the construction of PHC pipe piles, and obtained a large number of measured data.

The input of the prediction model of displacements of the pile tops is as follows: Feature1is moisture content (%). Feature2 is natural weight (kN/m3) . Feature3 is the relative density (%). Feature4 is the compression modulus (MPa). Feature5 is the cohesion force (kPa). Feature 6 is the internal friction angle (°). Feature7 is the resting time (day). Feature8 is the horizontal distance r (m) between the center of the bearing platform and the center of the pile (Fig. 5); Feature9 represents the angle θ between the line connecting the center of the bearing platform and the center of the pile and the positive direction of the X-axis (°, counterclockwise is positive, Fig. 5)). Feature10 is the pile diameter (mm); Feature11 is the pile length (m); Feature12 is the pile bending stiffness EI (N‧mm2); Feature13 is the number of piles per row in the X direction; Feature14 is the number of piles per row in the Y direction; Feature15 is pile spacing (m). The output value is the horizontal displacements (mm) of the pile tops. Part of the sample data in this paper are shown in Table 1. The values of Feature10 ~ Feature15 are unchanged (1000, 30, 1.29E21, 3,4,2.5 respectively). Through trial calculation, it is found that removing the variables (Feature10 ~ Feature15) will improve the calculation speed of the model and have no significant impact on the prediction accuracy of the model. So these variables were removed in the modeling of this case. The pile position layout of the 12 pile groups is shown in Fig. 5. The pile driving sequence is 5 > 8 > 2 > 11 > 4 > 6 > 7 > 9 > 10 > 1 > 3 > 12.

Fig. 5
Fig. 5
Full size image

Pile position layout diagram of 12 pile groups.

Table 1 Partial samples of horizontal displacements at pile tops.

Similarly, the input of the prediction model of displacements of the ground surface around piles is as follows: Feature1 is moisture content (%). Feature2 is natural weight (kN/m3). Feature3 is the relative density (%). Feature4 is the compression modulus (MPa). Feature5 is the cohesion force (kPa). Feature6 is the internal friction angle (°). Feature7 is the resting time (day). Feature8 is the horizontal distance (m) between the center of the bearing platform and the monitoring point. Feature9 represents the angle between the line connecting the center of the bearing platform and the center of the pile and the positive direction of the X-axis (°, counterclockwise is positive). Feature10 is the pile diameter (mm); Feature11is the pile length (m); Feature12 is the pile bending stiffness EI(N‧mm2); Feature13 is the number of piles per row in the X direction; Feature14 is the number of piles per row in the Y direction; Feature15 is pile spacing (m); The output value is the horizontal displacements (mm) of the ground surface monitoring point. Part of the sample data in this paper are shown in Table 2. The values of Feature10 ~ Feature15 are unchanged (1000, 30, 1.29E21, 3, 4 and 2.5 respectively). For the same reason as above, these variables were removed in the modeling of this case.

Table 2 Partial samples of surface horizontal displacements of the ground surface around piles.

Through the displacement monitoring, 512 monitoring data of the horizontal displacements of the pile tops and 459 monitoring data of the horizontal displacements of the ground surface around piles were obtained. At the same time, the soil parameters of each piles group site are collected from the geological survey report of the project.

In order to analyze the influence of sample size on the model prediction results, the horizontal displacements predictions of the pile tops and ground surface around piles were carried out based on large samples and small samples, respectively. For the dataset of horizontal displacements of the pile tops, the number of large samples and small samples are 512 and 103, respectively. For the dataset of horizontal displacements of the ground surface around piles, the number of large samples and small samples are 459 and 84, respectively.

The K-Fold cross-validation method can solve the problem of a small number of samples in a dataset and also address the issue of hyperparameter tuning. Taking the prediction of the horizontal displacements of the pile tops as an example, the specific method of the fivefold cross-validation used in this paper is introduced as follows: i) Randomly select the horizontal displacements data at the pile tops corresponding to a certain bearing platform from all datasets as the test set (the random sampling tool in Excel was used to extract 12 samples). ii) Divide the remaining dataset into five equal parts. iii) Use one of them as the validation set and the remaining four as the training set each time. iv) Train the model and calculate the validation error. v) Repeat the above process five times . vi) Take the average of the verification errors of the five times as the final evaluation index.

The mathematical formula for K-fold cross-validation:

$$CV_{k} = \frac{1}{K}\sum\limits_{i = 1}^{K} {Loss_{i} }$$
(12)

where K is the number of folds. Lossi is the loss value on the i-th fold validation set (the mean square error (MSE) is adopted). CV(k) is the average loss value of K-fold cross-validation.

Parameter tuning was performed using the grid search method. The steps are as follows: by traversing the parameter grid (such as the Learning rate of 0.002, 0.005, 0.01, 0.015, 0.02), calculate the K-fold cross-validation mean score (MSE) for each combination. Select the parameter combination with the lowest MSE value. Then, the model is retrained on the full training set using the optimal parameters, and the final performance is evaluated through the test set.

The machine learning algorithm runs in the Windows 10 operating system, with an Intel Core i5-7300HQ processor and 32G of memory. The computation time for a single model ranges from 3 to 15 min.

The BP network established consists of 3 layers, in which there are 9 neurons in the input layer (Feature1, Feature2, Feature3……Feature15) and 1 neuron in the output layer (Y). The hidden layer of BP model uses S-type transfer function to calculate the number of hidden layer neurons according to formula (3) ~ (5). From Eqs. (3) to (5), the number of neurons in the hidden layer can be taken from 8 to 14.

The AdaBoost-BP prediction model was established by using MATLAB platform. Using Log–Sigmoid transfer function (Fig. 6), the return value of neural network is between (0,1).

Fig. 6
Fig. 6
Full size image

Log–Sigmoid transfer function.

The data of each sample is normalized by using the method of Min–Max Normalization (Eq. 13) to make it fall between (0,1).

$$x_{{{\text{normalized}}}} = \frac{x - \min \left( x \right)}{{\max \left( x \right) - \min \left( x \right)}}$$
(13)

Trainlm function is used for training, and the MSE target of the AdaBoost-BP model is set as 1e-5. The superparameters ranges and optimized values of BP model and AdaBoost model are shown in Table 3.

Table 3 Superparameters ranges and optimized values of BP model and AdaBoost model.

Weak predictor parameters are shown in Table 4.

Table 4 Weak predictor parameter table.

Taking the corresponding strong predictor for predicting the displacements of the pile tops (Small sample) as an example: H(x) = 0.1064 × h(1) + 0.1005 × h(2) + 0.0989 × h(3) + 0.0938 × h(4) + 0.1044 × h(5) + 0.0987 × h(6) + 0.0919 × h(7) + 0.1076 × h(8) + 0.1073 × h(9) + 0.0904 × h(10) .

The DNN, RF, and Xgboost models all use the Min–Max normalization method (see Eq. 13) to normalize the data of each sample to fall between (0,1).

For DNN model, appropriately increasing the number of hidden layers can effectively reduce the calculation error and improve the prediction accuracy to a certain extent, but it also complicates the calculation process of the model, which increases the training time of the model and may lead to “over fitting” phenomenon38. The range of the number of hidden layers in DNN refers to the conclusion of Mohammad et al.39.

Hyperparameter tuning is carried out using a fivefold cross-check. The superparameters ranges and optimized values of DNN model, RF model and XGboost model are shown in Tables 5, 6, 7.

Table 5 Superparameters and optimized parameters of DNN model.
Table 6 Superparameters and optimized parameters of RF model.
Table 7 Superparameters and optimized parameters of XGboost model.

The BP, AdaBoost-BP, DNN, RF, and Xgboost algorithms are all calculated based on MATLAB R2023b.

Evaluation indexes

There are various evaluation indexes for the model, and each has its own advantages and disadvantages. Among them, the coefficient of determination (R2, see Eq. 14) is easy to understand and is widely used in regression analysis. However, outliers can affect the R2 value, leading to distorted evaluation. Mean absolute error (MAE, see Eq. 15) is insensitive to outliers and is more robust, but MAE is an absolute indicator and needs to be combined with other indicators to evaluate different models. Mean square error (MSE, see Eq. 16) is suitable for scenarios that are sensitive to large errors, but outliers have a significant impact on MSE. Mean absolute percentage error (MAPE, see Eq. 17) represents error in percentage form and is not affected by the data dimension, facilitating comparison between different datasets. However, MAPE’s asymmetric penalties for overestimation and underestimation may lead the model to lean towards underestimation. In this paper, the evaluation of the model uses R2, MAE, MSE, MAPE to judge the quality of the model. The predicted and experimental values of BP model, AdaBoost-BP model, DNN model, RF model and xGBoost are shown in Tables 8, 9, 10, 11. It can be seen from this Tables that for small sample datasets, the prediction accuracy of the AdaBoost-BP model, RF model, and DNN model is higher than that of the BP model. The prediction accuracy of xGBoost is slightly lower than that of AdaBoost-BP model, DNN model and RF model, but higher than that of BP model. For large sample datasets, the prediction accuracy of various models has improved, but the prediction accuracy of the BP model is lower than that of other models

$${\text{R}}^{2} = 1 - \frac{{\sum\limits_{i = 1}^{n} {{(}y_{i} - \hat{y}_{i} {)}^{2} } }}{{\sum\limits_{i = 1}^{n} {{(}y_{i} - \frac{1}{n}\sum\limits_{i = 1}^{n} {y_{i} } {)}^{2} } }}$$
(14)
$${\text{MAE}} = \frac{1}{n}\sum\limits_{i = 1}^{n} {\left| {y_{i} - \hat{y}_{i} } \right|}$$
(15)
$${\text{MSE}} = \frac{1}{n}\sum\limits_{i = 1}^{n} {{(}y_{i} - \hat{y}_{i} {)}^{2} }$$
(16)
$${\text{MAPE}} = \frac{100\% }{n}\sum\limits_{i = 1}^{n} {\left| {\frac{{y_{i} - \hat{y}_{i} }}{{y_{i} }}} \right|}$$
(17)

where n is the total number of samples. yi is the true value of the i-th sample. \(\hat{y}_{i}\) is the predicted value for sample i.

Table 8 Predicted values and measured values of the displacements of the pile tops (Small sample).
Table 9 Predicted and measured values of the displacements of the pile tops (Large sample).
Table 10 Predicted values and measured values of the displacements of the ground surface around piles (Small sample).
Table 11 Predicted and measured values of the displacements of the ground surface around piles (Small sample (Large sample).

To evaluate whether there are significant differences among the above-mentioned different prediction models, the Wilcoxon signed-rank test was used to assess the large sample prediction set based on SPSS26 software. The original assumption of all tests was that "there was no significant difference in the overall distribution of the two different models." The result of the Wilcoxon symbolic rank test is the p-value. The obtained P-value is compared with the confidence value α (such as 0.05). If the P-value is lower than α, the null hypothesis is rejected. Table 12 shows the test results. It can be found that there is no significant difference among the models at the 0.05 level. At the 0.1 level, BP and AdaBoost-BP, BP and Xgboost, AdaBoost-BP & DNN all show significant differences in predicting the horizontal displacements of the pile tops, which is caused by the structure of the models themselves. Therefore, the adoption of ensemble learning models (such as AdaBoost-BP, Xgboost) or deep learning models is more effective in solving the problem of displacements prediction at the pile tops.

Table 12 Wilcoxon signed rank test results.

Analysis the mechanism of characteristic variable driven displacement

In order to analyze the mechanism of displacement driven by each characteristic variable, the horizontal displacements of the pile tops and ground surface around piles were predicted based on the AdaBoost-BP model. The measured samples of the horizontal displacement of the pile tops are 512, and the measured samples of the horizontal displacement of the ground surface around piles are 459. A fivefold cross-validation is adopted, and 20% of the samples are randomly selected as the test set. Randomly divide the remaining samples into 5 subsets and traverse these 5 subsets in sequence. Each time, the current subset is used as the validation set, and all the remaining samples are used as the training set for the training and evaluation of the model. The mean square error (MSE) is used as the loss function. The superparameters ranges and optimized values of AdaBoost-BP model are shown in Table 3. The calculation of a single model takes approximately 20 min.

Figure 7a and b are scatter plots of the horizontal displacements of the pile tops and ground surface around piles, respectively. It can be seen from the figures that the linear fitting between the predicted values and the measured values of the horizontal displacements of the pile tops and ground surface around piles is good, with R2 values of 0.82 and 0.67 respectively. It indicates that the AdaBoost-BP model is applicable to the prediction of the horizontal displacements of the pile tops and ground surface around piles, and the prediction results have a certain degree of credibility.

Fig. 7
Fig. 7
Full size image

Scatter plot of predicted and measured values in the test set. (a) Horizontal displacements of the pile tops; (b) Horizontal displacements of the ground surface around piles.

Residual analysis was conducted on the predicted and measured values of the horizontal displacements of the pile tops and ground surface around piles. The results are shown in Fig. 8a and b. As can be seen from the figures, the residual distributions of the horizontal displacements are both close to the normal distribution, indicating that the model assumption is reasonable and the prediction error is statistically stable.

Fig. 8
Fig. 8
Full size image

Residual plot of predicted and measured values in the test set. (a) Horizontal displacements of the pile tops; (b) Horizontal displacements of the ground surface around piles.

Figure 9a and b are the kernel density diagrams of the horizontal displacements of the pile tops and ground surface around piles, respectively. The figure shows that both the horizontal displacements data roughly follow a normal distribution, which is similar to the actual horizontal displacements. The fitting degree between the predicted displacements value and the measured displacements value is both good, but there are still certain errors. The causes of errors are related to factors such as the quality of the sample, the number of samples, the representativeness of the sample, and the values of model parameters.

Fig. 9
Fig. 9
Full size image

Comparison plots of kernel density. (a) Horizontal displacements of the pile tops; (b) Horizontal displacements of the ground surface around piles.

The SHAP (shapley additive explanations) value ranking of each feature was given through SHAP analysis. The variable weight ranking of AdaBoost-BP model for horizontal displacements of the pile tops is Feature8、Feature9、Feature7、Feature1、Feature3、Feature6、Feature5、Feature4、Feature2 (shown in Fig. 10a). The variable weight ranking of AdaBoost-BP model for horizontal displacements of the ground surface around piles is Feature8、Feature9、Feature7、Feature3、Feature1、Feature6、Feature5、Feature4、Feature2 (as shown in Fig. 10b). This reflects the distance (Feature8) between the center of the bearing platform and the pile (or the monitoring point) is the main influencing factor. The impact of the angle between the line connecting the center of the bearing platform and the pile (or the monitoring point) and the positive X-axis direction (Feature9) ranks second. This is mainly because the number of rows of the piles group in the X and Y directions is different. The resting time (Feature7) ranks third. This reflects that the time effect (such as soil rheology) has a significant impact on the displacements of the pile tops or monitoring points. In soil properties indicator, the influence of moisture content (Feature1), relative density (Feature3) and internal friction angle (Feature6) on the displacements of the pile tops or the ground surface around piles is more significant than other indexes (cohesion force (Feature5), compression modulus (Feature4) and natural weight (Feature2)).

Fig. 10
Fig. 10
Full size image

SHAP bar plot. (a) Horizontal displacements of the pile tops; (b) Horizontal displacements of the ground surface around piles.

Figure 11 presents the contribution intensity and action direction of all input features to the displacements through the visualization of SHAP values. Each point in the Figure represents a characteristic value of a sample, and the depth of the color corresponds to the size of the characteristic value (the larger the value, the lighter the color; conversely, the darker the value). As can be seen from Fig. 11a, samples with larger or smaller feature values in Feature8 show a significant negative effect. This is because the pile bodies corresponding to these samples are located in the central area of the bearing platform (pile 5 and pile 8 in Fig. 5) or at the corners (pile 1, pile 3, pile 10 and pile 12 in Fig. 5). The pile 5 and pile 8 have relatively small displacements at pile tops due to the strong constraint effect of nearby piles. The pile 1, pile 3, pile 10 and pile 12 are the last four piles to be pressed in, and their displacements at pile tops is also relatively small.

Fig. 11
Fig. 11
Full size image

SHAP analysis plots. (a) Horizontal displacements of the pile tops; (b) Horizontal displacements of the ground surface around piles.

The smaller the feature value of Feature8 in Fig. 11b, the greater its positive contribution to the displacements of ground surface around the piles. It indicates that the closer the distance between the ground surface monitoring points around piles and the center of the bearing platform, the greater the horizontal displacements. The pattern presented by Feature9 is similar to that of Feature8. The larger the feature value of Feature7 in Fig. 11a and b, the greater the positive effect on the displacements. It indicates that the longer the intermittent rest time, the greater the horizontal displacements of the pile tops and ground surface around piles. The smaller the feature value of Feature1 (moisture content) is, the greater the negative effect on displacements will be. It indicates that the lower the moisture content of the soil, the smaller the horizontal displacements. The greater the Feature3 (relative density) is, the denser the soil mass, and the stronger the interlocking and friction between soil particles during soil squeezing, making it more difficult for the soil to be compressed and flow laterally. The larger the internal friction angle (Feature6), the higher the shear strength of the soil, and the less likely the soil is to undergo shear failure during soil squeezing. That is, the smaller the soil deformation. The SHAP values of other features are relatively small, indicating that the influence laws of these features on the displacements are not obvious.

Quantile regression analysis

Quantile regression analysis was conducted with the horizontal displacements of the pile tops and ground surface around piles as dependent variables, and Feature1 to Feature9 as independent variables (large samples were used). The regression results are shown in Tables 13 and 14, respectively. As shown in Table 13, Feature1 has a significant impact at both the 0.1 and 0.9 percentiles (P < 0.05), Feature4 and Feature7 have a significant impact at the 0.9 percentiles (P < 0.05), Feature8 has a significant impact at the 0.1, 0.25, and 0.9 percentiles (P < 0.05), and Feature9 has a significant impact at all percentiles (P < 0.05). As shown in Table 14, Feature1 has a significant impact at the 0.1, 0.25, 0.75, and 0.9 quantiles (P < 0.05), Feature3 has a significant impact at the 0.9 quantile (P < 0.05), Feature7 and Feature8 have a significant impact at all quantiles (P < 0.05), and Feature9 has a significant impact at the 0.75 and 0.9 quantiles (P < 0.05). From these tables, it can be seen that Feature1, Feature7, Feature8, and Feature9 are the main influencing factors. From the 0.1 percentile to the 0.9 percentile, the regression coefficients of Feature1 and Feature7 show a general trend of increasing from small to large. It can be seen that with the increase of moisture content and interval time, the displacements of the pile tops and ground surface around piles both show an increasing trend. The regression coefficients of Feature8 are all negative, indicating a negative correlation between the horizontal displacements and the distance r (Fig. 5). The regression coefficient of Feature9 does not monotonically increase (or decrease) with the increase of quantiles, indicating that the relationship between the horizontal displacements and the angle θ (Fig. 5) is complex and needs to be analyzed in conjunction with other indexes.

Table 13 Quantile regression analysis of displacements of pile tops. RC- regression coefficient.
Table 14 Quantile regression analysis of displacements of ground surface around piles.

Calculation of soil displacements by cylindrical hole expansion theory

The theoretical calculation equation for cylindrical hole expansion does not take into account the influence factor of layered soil. Referring to the method proposed by Liu et al.7, the average Poisson’s ratios, the average elastic modulus, the average cohesion, and the average internal friction angle of soil within the pile length range were obtained by weighted averaging of soil layer thickness. The Poisson’s ratio of each layer of soil are referenced in relevant literature 7,40. Elastic modulus is calculated according to Formula (2). The value of Δ is 0.0157. The parameter values for the group of 12 piles are shown in Table 15. The radial displacements of the monitoring points of ground surface are calculated according to Eq. (1). The calculated values of the cylindrical hole expansion method and the measured values are shown in Table 16. According to Table 16, there is a certain error between the calculated values of the cylindrical hole expansion method and the measured values. As shown in Table 16, the prediction accuracy of machine learning model (such as DNN) is higher than that of the cylindrical hole expansion method.

Table 15 Parameter value table.
Table 16 Table of observation points displacements.

Conclusion

The main influencing factors were analyzed on the horizontal displacements of the pile tops and ground surface around piles caused by the pile driving. Based on the measured data of the Bogota Metro project and combined with machine learning algorithms, prediction models for the horizontal displacements of the pile tops and ground surface around the piles was established. The conclusion is as follows:

  1. (1)

    The AdaBoost algorithm was applied to the BP neural network model to form the Adaboost-BP model, which improved the learning ability of the BP neural network. Hyperparameter tuning was carried out using a fivefold cross-check. For small sample datasets, the prediction accuracy of AdaBoost-BP model, RF model and DNN model is higher than that of BP model. For large sample datasets, the prediction accuracy of various models has been improved, but the prediction accuracy of the BP model is lower than that of other models.

  2. (2)

    SHAP analysis was conducted on 9 input features. Analysis shows that the horizontal distance (Feature8) and angle (Feature9) between the center of the bearing platform and the center of the pile tops (or ground surface monitoring point) are the two most important influencing factors. The resting time (Feature7) is also an important influencing factor. Among soil property indexes, moisture content (Feature1), relative density (Feature 3), and internal friction angle (Feature6) has a more significant influence on the horizontal displacements than other indexes (cohesion (Feature5), compressive modulus (Feature4), and natural weight (Feature2)).

  3. (3)

    Quantile regression analysis shows that the horizontal displacement of the piles top and ground surface around piles is negatively correlated with the horizontal distance (Feature8), and positively correlated with the rest time (Feature7) and moisture content (Feature1).

  4. (4)

    The horizontal displacement of the ground surface monitoring points was calculated by using the cylindrical hole expansion method. It was found through comparison that the prediction accuracy of machine learning algorithms (such as DNN) is higher than that of the cylindrical hole expansion method.

  5. (5)

    The machine learning prediction model established considers the influence of geological conditions (such as soil moisture content, compression modulus), geometric conditions (such as pile number, pile length), and other factors, and can be used to predict the displacement caused by soil squeezing effect of pile groups to be built (such as Bogota Metro Line 2). The method is to collect geological conditions and pile group design schemes during the design phase, and machine learning models are used to predict the pile tops and ground surface around piles. Based on the prediction results, optimize the pile diameter, pile length, pile spacing, and construction sequence to reduce the impact of soil squeezing effect. During the construction phase, collect data on soil and pile deformation caused by soil squeezing effects. Optimize the soil squeezing effect prediction model, improve prediction accuracy, and dynamically adjust construction parameters (such as pile driving speed and sequence) based on the prediction results.

  6. (6)

    The number of samples has a significant impact on the prediction accuracy of the model. How to improve the prediction accuracy of machine learning algorithms in small sample situations for projects that are difficult to obtain large samples is a key issue that needs to be addressed in the future.