Prediction of the displacements of the pile tops and ground surface around piles based on machine learning algorithms

Li, Penglin; Guo, Shaolong; Liang, Manman; Lu, Qun

doi:10.1038/s41598-026-36502-5

Download PDF

Article
Open access
Published: 23 January 2026

Prediction of the displacements of the pile tops and ground surface around piles based on machine learning algorithms

Penglin Li¹,
Shaolong Guo²,
Manman Liang² &
…
Qun Lu²

Scientific Reports volume 16, Article number: 6057 (2026) Cite this article

1365 Accesses
1 Altmetric
Metrics details

Subjects

Abstract

The soil squeezing effect of pile groups may cause displacements and deformation at the pile tops and ground surface around piles. In severe cases, it can cause problems such as broken piles, cracking of adjacent buildings or cracking of pipes. Artificial intelligence provides a new way to predict horizontal displacements of the pile tops and ground surface around piles caused by soil squeezing effect. The adaptive boosting (AdaBoost) algorithm was applied to the back propagation (BP) neural network model to form the Adaboost-BP model, which improved the learning ability of the BP neural network. For small sample datasets, the prediction accuracy of AdaBoost-BP model, Random Forest (RF) model and Deep Neural Networks (DNN) model is higher than that of BP model. For large sample datasets, the prediction accuracy of various models has improved, but the BP model is lower than that of other models. Analysis shows that the horizontal distance and angle between the center of the bearing platform and the center of the pile tops (or ground surface monitoring points) are the two most important influencing factors. The resting time is also an important influencing factor. Moisture content, relative density, and internal friction angle have a more significant influence on the horizontal displacements of the pile tops and ground surface around piles than other soil property indexes. Quantile regression analysis shows that the horizontal displacements is negatively correlated with the horizontal distance, and positively correlated with the rest time and moisture content. The prediction accuracy of machine learning algorithms (such as DNN) is higher than that of the cylindrical hole expansion method.

A case study on the bearing characteristics of a bottom uplift pile in a layered foundation

Article Open access 28 December 2022

Sensitivity analysis of counterweight double-row pile deformation to weak stratum parameters

Article Open access 23 November 2023

Comparative performance evaluation of machine learning models for predicting the ultimate bearing capacity of shallow foundations on granular soils

Article Open access 21 October 2025

Introduction

The squeezing effect¹ caused by the driving process of jacked piles may cause vertical and horizontal displacements of the soil in the adjacent area, leading to the floating and displacements of the piles. In severe cases, it can cause problems such as broken piles, cracking of adjacent buildings, road heave, pipeline deformation, etc. Excessive piles position deviation will lead to uneven load distribution and even structural instability. Therefore, predicting and controlling the squeezing effect of jacked piles is the key to ensure the quality of the project. It has important application value to carry out the prediction of the displacements of the pile tops and ground surface around piles.

In terms of theoretical analysis, classical theoretical analysis methods include cylindrical hole expansion method² and strain path method³. The cylindrical expansion method regards the process of piles driving as the expansion of a cylindrical hole in an elastic–plastic infinite medium, and the material follows the Tresca or Mohr Coulomb yield criterion. The theory holds that the resistance of piles driving is related to the deformation modulus and strength of soil, and it can reflect the nonlinear characteristics of soil and simulate the working state of actual piles. Chen et al.⁴ proposed a novel graph-based analysis method for analyzing the response of expanded cylindrical cavities in modified Cam clay under non-drainage conditions. Gao et al.⁵ uses the curve equation of quartic polynomial to simulate the boundary of pile hole. Based on the assumption that single-pile penetration can be simulated through a series of spherical cavity expansions, Li⁶ provided an analytical solution for cavity expansions near the slope. The solution provides a simplified and realistic theoretical method to predict the soil behaviors around the spherical cavity near the sloping ground.

The theory of cylindrical hole expansion⁷ is the study of cylindrical hole expansion under the action of uniform internal pressure p. The schematic diagram of the cylindrical hole expansion method is shown in Fig. 1. When the internal pressure p increases, the cylindrical region around the cylindrical hole will enter the plastic state from the elastic state, and the plastic region will expand with the increase of the internal pressure p. The maximum radius of the plastic zone is R_p, and the corresponding limit expansion pressure is P_u. The soil outside the radius R_p still maintains the elastic equilibrium state. In order to compare with the calculation results of machine learning algorithms, the cylindrical hole expansion method was simultaneously used to calculate the displacements of the soil around the pile.

where R_u is the initial radius of cylindrical hole. R_p is the maximum radius of plastic zone. P_u is the limit expansion pressure value. r is the calculate the distance between point and piles center. r₀ is the initial radius of the pile. a is the cylindrical hole radius during expansion. u_p is the radial displacement of the boundary of the influence zone. σ_θ is the tangential stress of soil. σ_r is the radial stress of soil.

The displacements in the elastic region are:

$${u}_{r}=\frac{(1+\mu ){c}_{0}\text{cos}{\varphi }_{0}}{E}(\frac{{R}_{p}}{r}{)}^{2}r$$

(1)

where μ is the Poisson’s ratio. ${c}_{0}$ is the cohesion of the soil. ${\varphi }_{0}$ is the internal friction angle of the soil.

It should be noted that the elastic modulus E in the above calculation formulas is different from the compression modulus E_s of soil. E and E_s can be converted according to the following relationship:

$$E={E}_{\text{s}}(1-\frac{2{\mu }^{2}}{1-\mu })$$

(2)

Lu et al.⁸ used finite element method to simulate the complete process of continuous driving of a single pile. Luo et al.⁹ studied the influence law of shielding effect on soil displacements of jacked piles, and found that shielding effect had significant influence on soil displacements in front direction and back direction. Shao et al.¹⁰ used the finite element method to simulate the displacements of soft clay and the underlying gravel layer with the increase of soil depth and radial distance during pile driving. It is found that the lateral soil displacement is obvious in the area 1.0 m away from the prestressed high-strength concrete (PHC) pile axis, and decreases with the increase of the radial distance during pile driving. When the radial distance is above 4.0 m, the lateral displacement can be ignored.

Zhou et al.¹¹ monitored the driving process of three static press piles in saturated clay, and analyzed the variation laws of lateral displacement of soil around the piles, vertical uplift value of the ground, and pore water pressure with the depth of pile driving and distance from the pile center. Zhang et al.¹² found that the soil deformation caused by pile driving first increases and then decreases in depth, and the soil deformation decreases exponentially in the horizontal direction. Under the influence of compression, the width of the shear strain zone does not change with the increase of driving. Yuan et al.^13,14 proposed a method for visualizing the soil displacement field around laterally loaded piles using transparent soil technology. The influence of passive piles on the three-dimensional ground deformation around laterally loaded piles and laterally loaded piles was studied through a series of model experiments.

The theoretical analysis method simplifies the piles group into a single pile, which is very different from the actual situation. The field test method needs more material resources, and the test cost is high. The conventional model test method cannot guarantee the similarity ratio, and the centrifuge test cost is very high. In the numerical simulation of soil squeezing effect of pile groups, the grid will have large deformation, which will cause convergence difficulties, and the error between the simulation results and the measured results is often large.

Considering that there are many factors affecting the soil squeezing effect of piles group, the mechanism is complex and has strong nonlinearity, which is difficult to be expressed directly by explicit function. Although empirical formulas can be used to establish the mathematical expressions between various influencing factors and soil squeezing effect indexes (such as excess pore water pressure, the displacements of the pile tops and ground surface around piles), the prediction accuracy and universality of these empirical formulas are often not ideal. The rapid development of artificial intelligence (AI) technology provides new tools for the progress of many industries. Machine learning is an important part of AI. In recent years, the application of machine learning in all walks of life has developed rapidly. Machine learning is an appropriate and effective method to solve engineering problems.

Many scholars^{15,16,17,18,19} have introduced algorithms with excellent nonlinear mapping ability such back propagation (BP), adaptive boosting (Adaboost), deep neural networks (DNN), random forest (RF), extreme gradient boosting (XGBoost) and support vector machine (SVM) into pile foundation engineering, and established many prediction models of pile bearing capacity considering different influencing factors (such as the geometric parameters of the piles foundation, soil physical and mechanical parameters, standard driving test (SPT) value, resting time, etc.). Kordjazi et al.²⁰ established a SVM the prediction model of pile bearing capacity based on 108 sample data sets including geometric parameters of pile foundation, pile load test and cone penetration test (CPT) test data. Shahin et al.²¹ established a recurrent neural networks (RNN) bearing capacity-settlement prediction model of pile foundation based on field load test and CPT test data. Moayedi, et al.²² established a prediction model of the load-settlement relationship curve of pile foundation based on the in-situ CPT data set by feedforward neural network (FFNN) and focused time delay neural network (FTDNN). Tan et al.^23,24 proposed an innovative hybrid machine learning model specifically for predicting the load–displacement characteristics of bored in-situ piles. This model establishes a complex relationship between key design parameters (diameter, length, SPT index and effective overlay pressure) and the load–displacement response of piles. Tram et al.^25,26 addresses a robust predictive model for the axial load-bearing behavior of pre-bored grouted planted nodular (PGPN) piles. This model adopts a new hybrid method for predicting pile head settlement and has been applied in pile foundation engineering in Vietnam. Yuan et al.²⁷ examined the effects of coral sand particle size and rigid pile embedment depth on pile-soil interaction. The horizontal strain distribution in coral sand around piles under lateral load was disclosed.

At present, there are many research achievements on the prediction of pile foundation bearing capacity and pile foundation settlement based on machine learning algorithms. The influence laws of various influencing factors on the bearing capacity of pile foundations and the settlement of pile foundations were analyzed. However, there are few research results on machine learning prediction of the soil squeezing effect (pile foundation displacements, soil displacement around the pile) caused by the driving of piles group, and the analysis of the action laws of various factors affecting the soil squeezing effect of piles group is not thorough. Based on the machine learning prediction for the bearing capacity and settlement of the reference pile foundation, the relationship between the horizontal displacements of the pile tops and ground surface around piles and various influencing factors was established using the machine learning algorithm in this paper. In addition, the patterns of the effects of various influencing factors have been analyzed.

BP neural network is a multilayer feedforward neural network trained according to the error back propagation algorithm (Tiwari et al.²⁸), which is one of the most widely used neural network models. But BP neural network has slow convergence speed, low learning efficiency and easy to converge to a local minimum (Guo et al.²⁹). In addition, common machine learning algorithms also include DNN, RF, XGBoost, SVM, et al. ( Lin et al.³⁰; Huang et al.³¹). As a representative of the Boosting series of algorithms, the Adaboost algorithm can improve the prediction accuracy of the model by gradually enhancing the model’s performance. AdaBoost can effectively enhance the prediction accuracy of BP neural networks. Adaboost-BP has disadvantages such as easy overfitting and slow training speed. AdaBoost-BP has been applied in the prediction of foundation and base settlement and is expected to be extended to other engineering scenarios^32,33.

DNN can automatically learn more advanced and essential feature representations than shallow networks (such as BP networks), making it more suitable for handling complex tasks. DNN requires a large amount of data and computing power, is complex to train, and is easily affected by hyperparameters. RF effectively reduces the variance of a single decision tree through random sampling and random feature selection, and has a natural ability to resist overfitting. However, the RF algorithm often performs worse than the Boosting algorithm in dealing with complex problems. As an outstanding representative of the Boosting series of algorithms, XGBoost effectively reduces bias and variance and improves prediction accuracy through techniques such as gradient boosting, regularization, and weighted quantiles. The XGBoost algorithm has complex parameter tuning and is prone to overfitting on small data. AdaBoost-BP, RF and XGBoost are all integrated algorithms. Although DNN is not an integrated algorithm, its performance can be improved through the idea of integrated learning.

Each algorithm has its own advantages and disadvantages. In this study, multiple algorithms (AdaBoost-BP, DNN, RF and XGBoost) were used for displacements prediction, and the prediction results of different algorithms were compared. In addition, in order to compare with the calculation results of machine learning algorithms, the cylindrical hole expansion method was used to calculate the soil displacements around the pile.

The main influencing factors of the displacements of the pile tops and ground surface around piles

The influencing factors of soil squeezing effect of jacked piles mainly include soil properties, pile driving sequence, pile spacing, time effect, etc. (Sagaseta et al.⁶; Lu et al.⁸; Zhou et al.¹¹). Different soil properties have different responses to soil squeezing effect. For example, unsaturated soil and sand will be compacted under the squeezing effect (void ratio decreases), while saturated cohesive soil will cause lateral displacement and vertical uplift due to excess pore water pressure. The influence of pile driving sequence on soil squeezing effect is also obvious⁸. For example, the soil squeezing effect of pile driving from the four sides to the middle is usually more serious than that of pile driving from the middle to the four sides. In addition, the size of the pile (pile diameter, pile length), pile spacing and soil plug effect on pile driving will affect the soil squeezing effect of jacked piles⁸. The soil property index of cohesive soil includes physical property index, plasticity index and liquid index. The three most important indexes of soil property, namely relative density, moisture content, and density, can be directly measured in the laboratory (Guo et al.²⁹). Compression modulus, cohesion force and internal friction angle are also important factors affecting the displacement of the pile tops and ground surface around piles. In addition, factors such as pile diameter, pile length, bending stiffness of the pile body, number of piles, pile spacing, and time effect (the rest age after pile driving) also have a significant impact on the displacement of the pile tops and ground surface around piles.

In summary, there are 15 main factors affecting the displacements of pile tops of jacked piles group considered in this paper, which are respectively moisture content, natural density, relative density, compression modulus, cohesion, internal friction angle, pile diameter, pile length, bending stiffness of pile body, number of piles in each row in X direction, number of piles in each row in Y direction, pile spacing, resting time, the distance and orientation between the center of the pile tops and the center of the bearing platform. There are 15 main factors affecting the displacements of the ground surface around piles, which are moisture content, natural weight, relative density, compression modulus, cohesion, internal friction Angle, pile diameter, pile length, bending stiffness of pile body, number of piles in each row in X direction, number of piles in each row in Y direction, pile spacing, resting time, the distance and orientation between the monitoring points and the center of the bearing platform. Considering that the soil layer within the engineering site is multi-layered soil, referring to the method proposed by Liu et al.⁷, the average moisture content, average natural weight, average relative density, average compression modulus, average cohesion and average internal friction angle of the soil within the pile length range were obtained through the weighted average of soil layer thickness.

Introduction to machine learning algorithms

BP neural network

The classic BP neural network is composed of three layers: input layer, hidden layer and output layer (Guo et al.²⁹). The topology structure of the three-layer BP network is shown in Fig. 2.

The algorithm expression of BP neural network can be found in relevant references (Wang.³⁴).

The number of hidden layer units is largely dependent on experience (Tiwari et al.²⁸). The selection range was determined by using the method of Deng et al.³⁵.

$$\sum\limits_{i = 0}^{{\text{n}}} {C_{{n_{1} }}^{i} } > k$$

(3)

$$n_{1} = \sqrt {n + m} + a$$

(4)

$$n_{1} \ge \log_{2} n$$

(5)

The parameters in Eqs. (3), (4) and (5) are shown in reference (Deng et al.³⁵).

AdaBoost algorithm

Adaboost combines multiple weak classifiers into a strong classifier by iteratively adjusting sample weights and weak classifier weights (Murmu et al.³⁶). The topology structure of AdaBoost algorithm is shown in Fig. 3.

The expression of the AdaBoost algorithm can be found in reference Guo et al.²⁹.

AdaBoost-BP algorithm

For each iteration (i.e., each weak predictor), a weak predictor is first trained using the current sample weights, and the prediction error of this predictor is calculated. If the prediction error of the samples in a certain round of iteration exceeds the set threshold, the sample weights will be updated based on the current weak predictor’s weight. The specific approach is to adjust the model by increasing the weight of the incorrect samples and reducing the weight of the correct samples. In the next round of training, more attention will be paid to the samples with higher errors, and the weight of each sample will be added to the total error rate. That is, the sum of the prediction errors. Calculate the weight of the current weak predictor based on the sum of the prediction errors.

The calculation process is as follows:

Step 1: Randomly select m groups of training data in the sample space and initialize the distribution weights D_t(i) of the data (Guo et al.²⁹):

$$D_{{\text{t}}} \left( i \right) = {1}/{\text{m}}$$

(6)

Step 2: Determine network parameters (such as the number of nodes in the input layer, hidden layer, and output layer).

Step 3: Train to obtain the t-th BP weak predictor, denoted as h_t(x). The cumulative error exceeds the weight D_t(i) of the corresponding term δ_t to obtain the calculation error ε_t:

$$\varepsilon_{t} = \sum\limits_{i = 1}^{m} {D_{t} (i)}$$

(7)

Step 4: Calculate the weight α_t for h_t(x) based on the error ε_t calculated in Step 3:

$$\alpha_{t} = \frac{1}{2}\ln \left( {\frac{{1 - \varepsilon_{t} }}{{\varepsilon_{t} }}} \right)$$

(8)

Step 5: Adjust the weight of training data.

$$D_{t + 1} (i) = \frac{{D_{t} (i)}}{{Z_{t} }} \times \left\{ {\begin{array}{*{20}l} {e^{{ - \alpha_{t} }} ,} \hfill & {\varepsilon_{t} < \delta_{t} } \hfill \\ {e^{{\alpha_{t} }} ,} \hfill & {\varepsilon_{t} \ge \delta_{t} } \hfill \\ \end{array} } \right.$$

(9)

where:

$$Z_{t} = \sum\limits_{i = 1}^{m} {D_{t} } (i) \times \left\{ {\begin{array}{*{20}l} {e^{{ - \alpha_{t} }} ,} \hfill & {\varepsilon_{t} < \delta_{t} } \hfill \\ {e^{{\alpha_{t} }} ,} \hfill & {\varepsilon_{t} \ge \delta_{t} } \hfill \\ \end{array} } \right.$$

(10)

Step 6: After training T rounds, obtain T weak predictors.

Step 7: Output strong predictor:

$$H(x) = \sum\limits_{i = 1}^{T} {\frac{{\alpha_{t} }}{{\sum\limits_{t = 1}^{T} {\alpha_{t} } }}} h_{t} (x)$$

(11)

Introduction to other machine learning methods

Besides AdaBoost algorithm, DNN, XGBoost, Bagging, RF and other algorithms are often used for prediction (Lin et al.³⁰). Currently, the DNN used in the analysis are mainly FFNN. The depth of the network refers to the number of hidden layers. Different from the traditional shallow neural network, DNN can extract features from low to high, learn features between data at a deeper level, extract the features of each layer, and establish a mapping relationship from the bottom signal to the top signal. DNN has a deep nonlinear structure that approximates arbitrary complex functions, which is an important feature of traditional shallow neural network, and has stronger ability to deal with complex, uncertain and fuzzy data. DNN can express larger and more complex functions. The topology structure of a DNN with three hidden layers is shown in Fig. 4. In the DNN structure, the layers are connected to each other.

Xgboost is an enhanced tree model (Guo et al.²⁹), which integrates many decision trees to form a stronger learner. RF is an ensemble learning method that belongs to a type of supervised learning algorithm (Chen et al.³⁷), and it is a classifier or regressor composed of multiple decision trees. In this paper, BP, AdaBoost-BP, DNN, Xgboost and RF algorithms are used to predict the squeezing effect.

Case analysis

Data sources

The length of the Metro Line 1 project in Bogota, the capital of Colombia, is about 23.9 km. The whole line of the project is a viaduct. The general feature of the landform of the project is that the groundwater level is shallow, the thickness of silt and peat soil is large, and there are often fine sand and clay intercalations. Most pile foundations of the project are PHC pipe piles. The project has a total of 6232 PHC pipe piles, and each static pile driver works an average of 3 piles per day. The model of PHC pipe pile is PHC-1000–140, with a diameter of 1000 mm and a wall thickness of 140 mm. The length of the piles is 15 to 48 m, and the concrete strength grade of the pile body is C60. The number of piles under the bearing platform is mainly 12, 16 and 20, and the pile spacing is 2.5 m. There are many different types of pipelines around the pile group (gas, water supply and drainage, cables, communications, etc., and the materials include concrete, cast iron, PVC, ceramics, etc.). The distance between most pipelines and adjacent pile foundations is 0.4 m ~ 5 m. The buried depth of the pipeline is between 1-5 m. The geological survey data of the project is complete. The construction company monitored the horizontal displacements of the pile tops and ground surface around piles before and after the construction of PHC pipe piles, and obtained a large number of measured data.

The input of the prediction model of displacements of the pile tops is as follows: Feature1is moisture content (%). Feature2 is natural weight (kN/m³) . Feature3 is the relative density (%). Feature4 is the compression modulus (MPa). Feature5 is the cohesion force (kPa). Feature 6 is the internal friction angle (°). Feature7 is the resting time (day). Feature8 is the horizontal distance r (m) between the center of the bearing platform and the center of the pile (Fig. 5); Feature9 represents the angle θ between the line connecting the center of the bearing platform and the center of the pile and the positive direction of the X-axis (°, counterclockwise is positive, Fig. 5)). Feature10 is the pile diameter (mm); Feature11 is the pile length (m); Feature12 is the pile bending stiffness EI (N‧mm²); Feature13 is the number of piles per row in the X direction; Feature14 is the number of piles per row in the Y direction; Feature15 is pile spacing (m). The output value is the horizontal displacements (mm) of the pile tops. Part of the sample data in this paper are shown in Table 1. The values of Feature10 ~ Feature15 are unchanged (1000, 30, 1.29E21, 3,4,2.5 respectively). Through trial calculation, it is found that removing the variables (Feature10 ~ Feature15) will improve the calculation speed of the model and have no significant impact on the prediction accuracy of the model. So these variables were removed in the modeling of this case. The pile position layout of the 12 pile groups is shown in Fig. 5. The pile driving sequence is 5 > 8 > 2 > 11 > 4 > 6 > 7 > 9 > 10 > 1 > 3 > 12.

Table 1 Partial samples of horizontal displacements at pile tops.

Subjects

Abstract

Similar content being viewed by others

A case study on the bearing characteristics of a bottom uplift pile in a layered foundation

Sensitivity analysis of counterweight double-row pile deformation to weak stratum parameters

Comparative performance evaluation of machine learning models for predicting the ultimate bearing capacity of shallow foundations on granular soils

Introduction

The main influencing factors of the displacements of the pile tops and ground surface around piles

Introduction to machine learning algorithms

BP neural network

AdaBoost algorithm

AdaBoost-BP algorithm

Introduction to other machine learning methods

Case analysis

Data sources

Evaluation indexes

Analysis the mechanism of characteristic variable driven displacement

Quantile regression analysis

Calculation of soil displacements by cylindrical hole expansion theory

Conclusion

Data availability

References

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher’s note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Quick links