Formation permeability estimation using mud loss data by deep learning

Abdollahfard, Yaser; Mirabbasi, Seyed Morteza; Ahmadi, Mohammad; Hemmati-Sarapardeh, Abdolhossein; Ashoorian, Sefatallah

doi:10.1038/s41598-025-94617-7

Download PDF

Article
Open access
Published: 30 April 2025

Formation permeability estimation using mud loss data by deep learning

Yaser Abdollahfard¹,
Seyed Morteza Mirabbasi¹,
Mohammad Ahmadi¹,
Abdolhossein Hemmati-Sarapardeh² &
…
Sefatallah Ashoorian³

Scientific Reports volume 15, Article number: 15251 (2025) Cite this article

1466 Accesses
1 Citations
Metrics details

Subjects

A Correction to this article was published on 26 May 2025

This article has been updated

Abstract

Permeability estimation plays an essential role in the assessment of reservoirs and hydrocarbon extraction. There are various methods to evaluate the formation and estimate the formation permeability, but in some cases, the evaluation may not be done or it may not be done correctly. This study focuses on a novel method to estimate the formation’s permeability with appropriate accuracy using the mud loss data. Machine learning applications are becoming more popular nowadays and can succeed in many fields. This current research focuses on the application of mud loss data and deep learning to estimate the formation’s permeability. To implement and validate our methodology, it is considered pilot cases including reservoir and drilling parameters values (depth, formation type, formation thickness, mud density, mud viscosity, and formation permeability). It is assumed that mud loss was occurred because of deferential pressure between formation pressure and bottom-hole pressure. The mud loss rate data were generated at different sets of reservoir and drilling data values using a reservoir simulator and then evaluated by calculating the correlation coefficients to ensure their validity and to check the fit under real conditions. This can be used to estimate the formation permeability values. One-dimensional convolutional neural networks(1D-CNN), a type of convolutional neural network, is utilized to be trained with data to perform a regression problem based on the contribution of flattening, dropout, and fully connected layers to estimate permeability with high accuracy (training data R² = 0.970, testing data R² = 0.964). Then the new deep learning method, Deep jointly informed neural network (DJINN), with the cooperation of neural networks and decision trees, provides a more accurate model than 1D-CNN (training data R² = 0.978, testing data R² = 0.972). These descriptions may provide new applications for mud loss data, where data while drilling can be used to predict formation permeability and provide insights for petroleum engineers to accurately measure design.

New insights into permeability determination by coupling Stoneley wave propagation and conventional petrophysical logs in carbonate oil reservoirs

Article Open access 08 July 2022

Origins of pressure dependent permeability in unconventional hydrocarbon reservoirs

Article Open access 02 May 2023

Physics-informed machine learning with differentiable programming for heterogeneous underground reservoir pressure management

Article Open access 04 November 2022

Introduction

Formation permeability is a critical parameter in petroleum engineering, influencing fluid flow within reservoirs and impacting hydrocarbon recovery efficiency. Traditional methods for estimating permeability, such as core sampling and well testing, can be costly and time-consuming, often yielding limited spatial coverage of reservoir properties. Moreover, the inherent heterogeneity of geological formations can lead to significant discrepancies in permeability values, complicating reservoir characterization. As a result, there is an increasing need for innovative approaches that leverage advanced data analytics and machine learning techniques to enhance the accuracy and efficiency of permeability estimation from available well log data and other indirect measurements¹.

Understanding and evaluating formation permeability is essential for several reasons:

1.
Hydrocarbon flow: Formation permeability directly affects the movement of oil and gas within the reservoir. High permeability allows for easier flow of hydrocarbons to the wellbore, which is essential for efficient extraction. Conversely, low permeability can hinder production rates and increase extraction costs¹.
2.
Reservoir management: Understanding permeability is vital for effective reservoir management and development strategies. Engineers use permeability data to predict reservoir behavior under various production scenarios, optimize well placement, and enhance recovery techniques².
3.
Enhanced oil recovery (EOR): In enhanced oil recovery methods, knowledge of formation permeability is crucial for selecting appropriate techniques, such as water flooding or gas injection. The effectiveness of these methods often depends on the permeability characteristics of the reservoir rock³.
4.
Modeling and simulation: Accurate permeability measurements are essential for creating reliable reservoir models and simulations. These models help engineers forecast production performance, evaluate the economic viability of projects, and make informed decisions regarding drilling and completion strategies⁴.

Therefore, the precise measurement of this parameter is of utmost importance. The prediction of reservoir permeability has been of key interest in the industry as investment decisions based on the volume of hydrocarbon resources are dependent on their accuracy⁵. As a result, considerable time and money have been allocated to take advantage of technological advances in data collection from cores and well testing, among other activities, to help reduce the permeability data uncertainty and Improve the reservoir performance prediction⁶. On the other hand, such conventional methods often face problems such as a lack of data in some regions that makes it impossible to determine formation permeability.

Machine learning (ML) is a subset of artificial intelligence (AI) that focuses on the development of algorithms and statistical models that enable computers to perform tasks without explicit programming. Instead of being programmed with specific instructions, ML systems learn from data, identifying patterns and making decisions based on that information. In the past decade, the advancements in computer science especially in the field of artificial intelligence and machine learning (ML) enabled us to effectively extract basic data from real-world data collected in oil industry applications and employ them for better reservoir characterization^7,8,9,10. ML is broadly acknowledged to improve our understanding of wells¹¹, production, and reservoir areas¹². In specific, ML is most broadly utilized in reservoir management and has accomplished important outcomes such as permeability, porosity, and tortuosity prediction¹³, modeling CO₂-oil systems minimum miscibility pressure¹⁴, shale gas production forecast¹⁵, reservoir characterization¹⁶, predicting formation damage of oil fields¹⁷, digital 3D core reconstruction^18,19,20, well test interpretation^21,22, shale gas production optimization^23,24, well log processing²⁵, modeling wax deposition of crude oils²⁶ and history matching^27,28. This has motivated numerous researchers to gradually abandon the use of multiple linear regression models and empirical correlations in favor of incorporating ML when forecasting significant reservoir petro-physical properties²⁹.

ML approach for permeability prediction

Artificial neural network (ANN) is a famous approach that uses the obtained results to predict permeability³⁰. In recent years, some authors have studied reservoir characterization problems from different aspects, including soft computing methods ^31,32,33. The results of such studies revealed that soft computing models outperform regression models. The advantage of computational methods over regression lies in the fact that elemental uncertainties or heterogeneities are not explicitly included in computational regression methods³².

Permeability can be evaluated by interpreting in situ measurements taken by formation testers using well-testing equipment and well-logging. During verification, given the average permeability thickness, transient well testing provides a wealth of information about the flow capacity of the reservoir. Another worthwhile method for measuring the absolute permeability of reservoirs is to conduct flow experiments using representative core samples³⁴. Geoscientists can therefore manage the production process effectively with the help of a reliable and accurate permeability estimation.

Several studies have been proposed to estimate permeability. Mohagueg et al.³⁵ presented their three main approaches to permeability estimation, including analytical, statistical, and computational tools using well-log data. Chehrazi and Rezaee³⁶ introduced a classification plan for permeability prediction models, including analytical models, soft computing models, and porous phase models using well-log data. Rezaee et al.³⁷ presented the results of a research project that investigated permeability prediction for the Precipice Sandstone of the Surat Basin which machine learning techniques were used for permeability estimation based on multiple wireline logs. Tembely et al.³⁸ emphasize the important role of feature engineering in predicting physical properties using machine and deep learning. The proposed framework, which integrates various learning, rock imaging, and modeling algorithms, is capable of rapidly and accurately estimating petrophysical properties to facilitate reservoir simulation and characterization. Okon et al.³⁹ presented an ANN model to forecast the physical properties of reservoirs namely, porosity, permeability, and water saturation, developed based on logs from fifteen fields. A joint reversal technique based on a multilayer linear calculator and particle swarm optimization algorithm was applied by Yasin et al.⁴⁰ to estimate the spatial variation of important petrophysical parameters e.g. porosity, permeability, and saturation, and essential geo-mechanical specifications (Poisson’s ratio, and Young’s modulus) for downhole zones using seismic data. Anifowose et al.⁴¹ conducted stringent parametric research to examine the comparative accuracy of ML techniques in estimating the permeability of the carbonate reservoir in the Middle East using integrating seismic attributes and wireline data. Akande et al.⁴² studied the predictability and impact of feature engineering on the precision of support vector machines in estimating carbonate reservoir permeability using well-log data. Bruce et al.⁴³ accomplished ANN to process permeability estimation by the usage of wireline logs. El Ouahed et al.⁴⁴ proposed combining the ANN with fuzzy logic to fully account for the fractured reservoir using well-log data. Al Khalifah et al.⁴⁵ used ANN and genetic algorithms to estimate cores permeability measured by lab experiments.

The contents mentioned in the previous paragraph are analyzed in Table 1 which provides a comparison between the research of different researchers on permeability estimation, where the method and type of data are given. It can be seen that the researchers used statistical tools and artificial intelligence to estimate the permeability of the formation, and in their research, the data used included well log data, rock imaging data, seismic data, and core data.

Table 1 Tools and data types used by researchers to estimate permeability.

Full size table

Lost circulation vs. formation permeability

Lost circulation is a prevalent drilling problem, especially in formations with high permeability, and natural or induced fractures ^46,47. Lost circulation can occur in a variety of formations ranging from h shallow, unconsolidated geological layers to well-consolidated geological layers which are disrupted by drilling fluids hydrostatic pressure ^48,49. Two conditions are necessary for a loss of circulation in the borehole to occur. First the pressure at the bottom of the well exceeds the pore pressure and next there should be a fluid flow path for lost circulation⁵⁰. Underground routes that cause to occur lost circulation can be defined as following classes:

Cavernous formations: In the direction of drilling in some formations, there are cavernous and empty spaces in which, as a result of drilling the formations, a large amount of mud loss occurs(Fig. 1a).
Natural fractures: The existence of a natural fracture network, which is created by tectonics in the formation, can act as a conduit to cause leakage in the formation, and the amount of leakage depends on several factors which are mentioned below(Fig. 1b).
Induced fractures (e.g. quick tripping or blowouts): In this mechanism, as a result of drilling operations such as tripping and blowouts, the bottom hole pressure increases, and cracks are created by induction. In fact, due to the low strength of some formations against stress, due to the application of additional stresses on the formation, fracture occurs in the formation(Fig. 1c).
Highly permeable formations: The presence of permeable formations causes a large amount of drilling fluid to leak into the formation due to the pressure difference between the bottom hole and the formation pressure(Fig. 1d).

Fractures are an important cause of drilling fluids loss to formations whose lost circulation severity depends on fracture opening width, fracture density, fracture orientation, fracture distribution, fracture network, etc.^52,53,54.

The loss ratio indicates paths of lost circulations and can show what remedial technique should be employed to counteract the loss. The lost circulation severity can be divided into four classes as follows ^55,56,57:

Seepage losses: less than 1 m³/h.
Partial losses: 1–10 m³/h.
Severe losses: more than 15 m³/h.
Complete losses: no fluid comes out of the annulus.

In this study, for the first time, a novel method is proposed to have another important use of mud loss data. To this end, synthetic data driven from a commercial reservoir simulator is used as input to train and build our AI model. Mud loss data which is generated by a simulator is used to estimate formation permeability using Deep Jointly Informed Neural Networks(DJINN) and Convolutional Neural Networks(CNN) by formation type, formation thickness, mud density, mud viscosity, drilling depth, and mud loss rate data, which is presented as an accurate prediction of formation permeability.

Methodology

The model development diagram is shown in Fig. 2 and the method preparation is discussed in Sects. 2.1 to 2.5. The model development flowchart begins with data generation where all of the parameters related to mud loss are generated. The generated data is then subjected to statistical analysis. Next, the data undergoes preprocessing to make it suitable for modeling. The modeling phase begins with initializing the hyper-parameters for the deep learning model. When hyper-parameters are initialized, the models are trained with an adaptive moment estimation (Adam) optimizer. The hyper-parameters are adjusted and iterated using the trial-and-error method until the model shows good performance metrics with a minimum error.

Data generation

As shown in Fig. 3, the drilling fluid loss process is similar to fluid injection in the porous medium. According to Darcy’s law, the loss rate depends on the parameters of bottom-hole pressure, formation pressure, viscosity of the drilling fluid, and the formation permeability. In this study reservoir simulator software (Eclipse E100) was used to simulate the drilling fluid loss process and generate mud loss data, which can be used to.

The data available in the drilling process include mud weight, mud viscosity, drilling depth, mud loss rate, formation type, and the thickness of the drilled formation up to that depth. Therefore, mud loss data were generated according to the following assumptions in the 810 data series:

1-
Loss circulation limits: 1–250 bbl /hr.
2-
Fluid type: water-based mud.
3-
Increasing mud viscosity with increasing mud weight.
4-
Increasing mud weight with increasing depth in general.
5
–10 layers with different pore pressure.

Statistical analysis of generated data

Data analysis of the mud loss dataset focused on definitive and inferential statistics which focused on univariate analysis. It was summarized the data by visualizing the distribution of each parameter in Table 2 and Fig. 4 shows the histogram of each variable.

Table 2 Data summary generated mud loss data.

Full size table

Mainly, the correlation coefficient (CC) is used to test the linear association between parameters. This can be expressed as follows:

$$CC = \frac{{\sum\limits_{i = 1}^{n} {(x_{m} - \overline{x}_{m} )(x_{p} - \overline{x}_{p} )} }}{{\sqrt {\sum\limits_{i = 1}^{n} {(x_{m} - \overline{x}_{m} )^{2} \sum\limits_{i = 1}^{n} {(x_{p} - \overline{x}_{p} )^{2} } } } }}$$

(1)

where n represents the number of experimental data, ${x}_{m}$, and ${x}_{p}$ define the measured and predicted parameters, respectively, and $\overline{x }$ _m, and $\overline{x }$ _p signify their average values⁵⁸.

Figure 5 indicates the CC matrix for analyzed variables. According to this data mud loss rate is the main parameter that influences the permeability parameter while other parameters have insignificant effects on the permeability. By analyzing the CC matrix, it can be seen that generated data is comparable to operational data, e.g. there is a high correlation between fluid viscosity and fluid density that is similar to the relation between these parameters in the real condition.

Figure 6 shows non-linear relationships between variables. It is clear that the linearity relation between some variables, e.g. depth vs formation type, drilling fluid viscosity vs density, etc. Also, the non-linearity between some parameters is obvious, it is because of random conditions assumed for drilling conditions and no linear relationship between them at the real condition.

Data arrangement

Mud loss data were passed through three steps before fitting into deep learning models. The process initiated with organizing categorical data, pre-processing the data using a normalization scaler, and splitting the data into training and testing sets.

The categorical variable organized in the data collection stage is formation types, which were nominated values 1 to 10, respectively. Then, normalization was performed to convert the variables between 0 and 1. Normalized data had an average of zero and a standard deviation of one. Such normalized data is used to train the model Neuropathy since it enhances learning processes and reduces high computational costs ⁵⁹. The formula which used for normalization is as follows:

$$\overset{\lower0.5em\hbox{$\smash{\scriptscriptstyle\frown}$}}{x}_{i} = \frac{{x_{i} - x_{\min } }}{{x_{\max } - x_{\min } }}$$

(2)

In Eq. (2), x_i is the value of the variable for the i_th observation, x_min is the minimum value of a variable, and x_max is the maximum value of the variable. Finally, the reshaped data were split into two sets using 80:20 ratios for the training and testing set.

Convolutional neural networks (CNN)

In the past decade, Convolutional Neural Networks have been responsible for breakthroughs in computer vision and image processing ^60,61,62, obtaining state-of-the-art results on a range of benchmark and real-world tasks. More recently, one-dimensional CNNs have shown great promise in processing structured linguistic data in tasks such as machine translation ^63,64 and document classification ^65,66. Bai et al. ⁶⁷ in 2018 indicated that, for many sequence modeling tasks, 1D-CNNs using current best practices such as dilated convolution often perform better than other recurrent neural network architectures.

A convolutional neural network is a type of feedforward neural network that consists of multiple convolution stages that perform the task of feature extraction and a single output stage that combines the extracted high-level features to predict the desired output ⁶⁸. Figure 7 indicates a sample of the 1D-CNN architecture for the forecasting model.

In this study, Table 3 shows the elements of neural networks that included one 1D-CNN layer, one flattened layer, two dropout layers with a value of 0.2, and two fully connected layers. Exponential Linear Unit (elu) is applied in the convolution and fully connected layers as an activation function.

Table 3 Details of the used 1D-CNN model.

Full size table

Deep jointly informed neural networks (DJINN)

The DJINN algorithm determines the appropriate deep neural network architecture and initializes the weights using the dependency structure of the decision tree trained on the data. The algorithm can be divided into three steps: building a set of decision trees, mapping the tree to a neural network, and fine-tuning the neural network through backpropagation ⁶⁹.

Decision tree construction

The first step of the DJINN algorithm is to build a model based on a decision tree. This can be a single decision tree generating a neural network or an ensemble of trees, such as random forests⁷⁰, which will create a set of neural networks. The depth of the tree is often limited to avoid creating neural networks that are too large; Maximum tree depth is a hyper-parameter that must be tuned for each data set⁶⁹.

Mapping decision trees to deep neural networks

The DJINN algorithm selects a deep neural network architecture and a set of initial weights based on the structure of the decision tree. The mapping is not intended to reproduce a decision tree, but instead uses the decision path as a guide for network architecture and weight initialization. Neural networks are initialized layer by layer, whereas decision trees are typically saved for each decision pass. The path starts at the top branch of the tree and follows each decision to the left and then to the right until it reaches a leaf (prediction). Due to the way trees are stored, it is difficult to navigate the tree by depth, but it is easy to traverse the tree recursively. Mapping from a tree to a neural network is easiest if the structure of the tree is known before initializing the neural network weights. Therefore, the decision pass is executed twice; first, it determines the structure and then initializes the weights ⁶⁹.

Optimizing the neural networks

As soon as the tree is mapped to the initialized neural network, the weights are adjusted using backpropagation. In this example, a deep neural network is trained with Google’s deep learning software Tensor Flow. The activation function used in each hidden layer is a modified linear unit, which generally works well in deep neural networks^71,72 and can retain the values of neurons in previously hidden layers. The Adam optimizer⁷³ is used to minimize the cost function (mean squared error (MSE) for regression, cross-entropy with logit for classification)⁷⁴.

Model performance evaluation

It is necessary to recognize the criteria associated with evaluating model performance. In this work, root mean squared error, mean absolute error, mean absolute percentage error, R-squared, and relative error were used as statistical indicators to evaluate the performance of the models.

Root mean squared error (RMSE)

The root mean squared error is used to see how well the network output matches the desired output. Better performance is guaranteed with smaller RMSE values. It is defined as follows ⁷⁵:

$$RMSE = \sqrt {\frac{1}{n}\sum\limits_{i = 1}^{n} {(x_{m} - x_{p} )^{2} } }$$

(3)

Mean absolute error (MAE)

The mean absolute error is the average value of the absolute difference between the predicted value and the actual value. Errors showing a uniform distribution shall be presented. Furthermore, MAE is the most natural and accurate measure of the average level of error ⁵⁸.

$$MAE = \frac{1}{n}\sum\limits_{i = 1}^{n} {\left| {x_{p} - x_{m} } \right|}$$

(4)

Mean absolute percentage error (MAPE)

The mean absolute percentage error is calculated by dividing the absolute error of each period by the observed values evident in that period. Then average these fixed percentages. This approach is useful when the size or dimensions of the predictor variable are important in assessing the accuracy of the prediction ^76,77. MAPE indicates the degree of forecast error compared to the actual value.

$$MAPE = \frac{1}{n}\sum\limits_{i = 1}^{n} {\frac{{\left| {x_{p} - x_{m} } \right|}}{{x_{m} }}} \times 100\%$$

(5)

R-squared (R²)

An important index to check the correctness of the regression algorithm is ${R}^{2}$, which ranges from 0 to 1. ${R}^{2}$ is defined as follows⁵⁸:

$$R^{2} = 1 - \frac{{\sum\limits_{i = 1}^{n} {(x_{p} - \overline{x}_{m} )^{2} } }}{{\sum\limits_{i = 1}^{n} {(x_{m} - \overline{x}_{m} )^{2} } }}$$

(6)

where n represents the number of observations, ${x}_{m}$, and ${x}_{p}$ define the measured and predicted parameters, respectively, and $\overline{x }$ _m signifies the average of measured parameters.

Relative error (RE)

The relative error is defined as the ratio of the difference of the predicted to the measured value. If ${x}_{m}$ is the measured value of a quantity, ${x}_{p}$ is the predicted value of the quantity, then the relative error can be measured using the below formula⁷⁸.

$$RE = \frac{{x_{p} - x_{m} }}{{x_{m} }}$$

(7)

Results and discussion

Based on the previously mentioned methods fthe structural parameters of the CNN and DJINN for predicting formation permeability were determined, and the models were trained and tested. Real value versus predicted values of permeability (md) for training and testing data are displayed as cross plots in Fig. 8 and Fig. 9. R² represents an alternative measure of forecast accuracy. As a precision indicator, it represents the proportion of the variance displayed by the dependent variable that can be predicted through the choice of the independent variable. If R² = 1, this shows that the permeability of the formation can be predicted without error by the selected independent variables.

As can be seen from Table 4, the 1D-CNN prediction model has sufficiently high accuracy on training and test data (for training data: R² = 0.968, RMSE = 50.78, MAE = 37.50, MAPE = 16.39; for test data: R² = 0.962, RMSE = 58.17, MAE = 42.95, MAPE = 11.29). As shown in Table 5, the DJINN prediction model has also sufficiently high accuracy on training and test data (for training data: R² = 0.973, RMSE = 46.15, MAE = 34.34, MAPE = 9.57; for test data: R² = 0.970, RMSE = 51.39, MAE = 39.56, MAPE = 13.53).

Table 4 1D-CNN model accuracies.

Full size table

Table 5 DJINN model accuracies.

Full size table

Figure 10 and Fig. 11 indicate relative error for 1D-CNN and DJINN models. According to them the accuracy for data with low values is lower than the accuracy for data with high values. Therefore, these models are suitable for prediction data with high values.

Figure 12 compares the computational error on training and test data for used algorithms. It indicates that RMSE, R², MAE, and MAPE for the DJINN model are more accurate than the 1D-CNN model. Therefore, DJINN is a better algorithm for predicting formation permeability.

Table 6 compares the results of this study with those of recent permeability estimation studies. While most studies have used well log data, seismic data, rock imaging data, core data to estimate formation permeability by Analytical, statistical, and computational tools, and artificial intelligence tool, this study benefits from deep learning algorithms to estimate the formation permeability through mud loss data.

Table 6 Compare the results obtained with those of recent permeability estimation studies.

Full size table

Conclusions

Permeability is the key parameter to reservoir characterization. There are various methods to evaluate the formation and estimate the formation permeability, but in some cases, the evaluation may not be done or it may not be done correctly.

This study estimated formation permeability using drilling fluid data and two deep-learning algorithms. Drilling data including depth, formation type, fluid density, fluid viscosity, formation thickness, and mud loss rate were generated by reservoir simulator software similar to real-world conditions and Deep learning algorithms including 1D-CNN and DJINN.

The results show that DJINN (R² equals 0.973 on training data and 0.970 on test data) is a more accurate model than 1D-CNN (R² equals 0.968 on training data and 0.962 on test data) in modeling this problem. Therefore, this study could present a novel method that uses mud loss data to estimate formation permeability accurately by deep learning algorithms (Supplementary table S1).

Data availability

"The authors confirm that the data supporting the findings of this study are available within the article and its supplementary materials."

Change history

26 May 2025
A Correction to this paper has been published: https://doi.org/10.1038/s41598-025-03476-9

References

Dake, L. P. Fundamentals of reservoir engineering (Elsevier, 1983).
Google Scholar
E. Egbogah, L. White, M. Mirkin, Steam Stimulated Enhanced Oil Recovery, Paper SPI 2400S retrieved from www. Rocando. com, (2003).
Lake, L. W., Johns, R., Rossen, B. & Pope, G. A. Fundamentals of enhanced oil recovery (Society of Petroleum Engineers Richardson, 2014).
Book Google Scholar
Zhao, Y., Wang, C., Zhang, Y. & Liu, Q. Experimental study of adsorption effects on shale permeability. Nat. Resour. Res. 28, 1575–1586 (2019).
Article CAS Google Scholar
Hussain, M. et al. Reservoir characterization of basal sand zone of lower Goru Formation by petrophysical studies of geophysical logs. J. Geol. Soc. India 89, 331–338 (2017).
Article CAS Google Scholar
Ahmed, N., Kausar, T., Khalid, P. & Akram, S. Assessment of reservoir rock properties from rock physics modeling and petrophysical analysis of borehole logging data to lessen uncertainty in formation characterization in Ratana Gas Field, northern Potwar, Pakistan. J. Geol. Soc. India 91, 736–742 (2018).
Article Google Scholar
Anifowose, F. & Abdulraheem, A. Fuzzy logic-driven and SVM-driven hybrid computational intelligence models applied to oil and gas reservoir characterization. J. Natural Gas Sci. Eng. 3, 505–517 (2011).
Article Google Scholar
Sircar, A., Yadav, K., Rayavarapu, K., Bist, N. & Oza, H. Application of machine learning and artificial intelligence in oil and gas industry. Pet. Res. 6, 379–391 (2021).
Google Scholar
Mohaghegh, S., Arefi, R., Ameri, S., Aminiand, K. & Nutter, R. Petroleum reservoir characterization with the aid of artificial neural networks. J. Petrol. Sci. Eng. 16, 263–274 (1996).
Article CAS Google Scholar
Anifowose, F., Adeniye, S. & Abdulraheem, A. Recent advances in the application of computational intelligence techniques in oil and gas reservoir characterisation: a comparative study. J. Exp. Theor. Artif. Intell. 26, 551–570 (2014).
Article Google Scholar
C.I. Noshi, J.J. Schubert, The role of machine learning in drilling operations; a review, in SPE/AAPG Eastern regional meeting, OnePetro, 2018.
Purbey, R. et al. Machine learning and data mining assisted petroleum reservoir engineering: a comprehensive review. Int. J. Oil, Gas Coal Technol. 30, 359–387 (2022).
Article CAS Google Scholar
Graczyk, K. M. & Matyka, M. Predicting porosity, permeability, and tortuosity of porous media from images by deep learning. Sci. Rep. 10, 21488 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Lv, Q. et al. Modelling minimum miscibility pressure of CO2-crude oil systems using deep learning, tree-based, and thermodynamic models: Application to CO2 sequestration and enhanced oil recovery. Separation Purification Technol. https://doi.org/10.1016/j.seppur.2022.123086 (2023).
Article Google Scholar
Hui, G., Chen, S., He, Y., Wang, H. & Gu, F. Machine learning-based production forecast for shale gas in unconventional reservoirs via integration of geological and operational factors. J. Nat. Gas Sci. Eng. 94, 104045 (2021).
Article Google Scholar
Liu, X., Ge, Q., Chen, X., Li, J. & Chen, Y. Extreme learning machine for multivariate reservoir characterization. J. Petrol. Sci. Eng. 205, 108869 (2021).
Article CAS Google Scholar
Larestani, A., Mousavi, S. P., Hadavimoghaddam, F. & Hemmati-Sarapardeh, A. Predicting formation damage of oil fields due to mineral scaling during water-flooding operations: Gradient boosting decision tree and cascade-forward back-propagation network. J. Petrol. Sci. Eng. 208, 109315 (2022).
Article CAS Google Scholar
Y. Bai, V. Berezovsky, V. Popov, Digital core 3D reconstruction based on micro-CT images via a deep learning method, in 2020 International Conference on High Performance Big Data and Intelligent Systems (HPBD&IS), IEEE.2020.
Najafi, A. et al. Upscaling permeability anisotropy in digital sandstones using convolutional neural networks. J. Nat. Gas Sci. Eng. 96, 104263 (2021).
Article Google Scholar
Telvari, S., Sayyafzadeh, M., Siavashi, J. & Sharifi, M. Prediction of two-phase flow properties for digital sandstones using 3D convolutional neural networks. Adv. Water Resour. 176, 104442 (2023).
Article Google Scholar
Xue, L. et al. An automated data-driven pressure transient analysis of water-drive gas reservoir through the coupled machine learning and ensemble Kalman filter method. J. Petrol. Sci. Eng. 208, 109492 (2022).
Article CAS Google Scholar
Khazali, N. & Sharifi, M. New approach for interpreting pressure and flow rate data from permanent downhole gauges, least square support vector machine approach. J. Petrol. Sci. Eng. 180, 62–77 (2019).
Article CAS Google Scholar
Wang, H. et al. A novel shale gas production prediction model based on machine learning and its application in optimization of multistage fractured horizontal wells. Front. Earth Sci. https://doi.org/10.3389/feart.2021.726537 (2021).
Article Google Scholar
Qiao, L., Wang, H., Lu, S., Liu, Y. & He, T. Novel self-adaptive shale gas production proxy model and its practical application. ACS Omega 7, 8294–8305 (2022).
Article CAS PubMed PubMed Central Google Scholar
P.-Y. Wu, V. Jain, M.S. Kulkarni, A. Abubakar, Machine learning-based method for automated well-log processing and interpretation, in 2018 SEG International Exposition and Annual Meeting, OnePetro 2018.
Amiri-Ramsheh, B., Zabihi, R. & Hemmati-Sarapardeh, A. Modeling wax deposition of crude oils using cascade forward and generalized regression neural networks Application to crude oil production. Geoenergy Sci. Eng. https://doi.org/10.1016/j.geoen.2023.211613 (2023).
Article Google Scholar
Yousefzadeh, R. & Ahmadi, M. Improved history matching of channelized reservoirs using a novel deep learning-based parametrization method. Geoenergy Sci. Eng. 229, 212113 (2023).
Article CAS Google Scholar
Yousefzadeh, R. & Ahmadi, M. Fast marching method assisted permeability upscaling using a hybrid deep learning method coupled with particle swarm optimization. Geoenergy Sci. Eng. 230, 212211 (2023).
Article CAS Google Scholar
Otchere, D. A., Ganat, T. O. A., Gholami, R. & Ridha, S. Application of supervised machine learning paradigms in the prediction of petroleum reservoir properties: Comparative analysis of ANN and SVM models. J. Petrol. Sci. Eng. 200, 108182 (2021).
Article CAS Google Scholar
P. Wong, F. Aminzadeh, M. Nikravesh, Soft computing for reservoir characterization and modeling, Physica, 2013.
Ouenes, A. Practical application of fuzzy logic and neural networks to fractured reservoir characterization. Comput. Geosci. 26, 953–962 (2000).
Article ADS Google Scholar
Nikravesh, M., Zadeh, L. A. & Aminzadeh, F. Soft computing and intelligent data analysis in oil exploration (Elsevier, 2003).
Google Scholar
Rezaee, M. R., Kadkhodaie-Ilkhchi, A. & Alizadeh, P. M. Intelligent approaches for the synthesis of petrophysical logs. J. Geophys. Eng. 5, 12–26 (2008).
Article Google Scholar
Sander, R., Pan, Z. & Connell, L. D. Laboratory measurement of low permeability unconventional gas reservoir rocks: A review of experimental methods. J. Nat. Gas Sci. Eng. 37, 248–279 (2017).
Article CAS Google Scholar
S. Mohaghegh, B. Balan, S. Ameri, State-of-the-art in permeability determination from well log data: Part 2-verifiable, accurate permeability predictions, the touch-stone of all models, in SPE Eastern Regional Meeting, OnePetro, 1995.
Chehrazi, A. & Rezaee, R. A systematic method for permeability prediction, a Petro-Facies approach. J. Petrol. Sci. Eng. 82, 1–16 (2012).
Article Google Scholar
Rezaee, R. & Ekundayo, J. Permeability prediction using machine learning methods for the CO2 injectivity of the precipice sandstone in Surat Basin. Australia, Energies 15, 2053 (2022).
Article CAS Google Scholar
Tembely, M., AlSumaiti, A. M. & Alameri, W. S. Machine and deep learning for estimating the permeability of complex carbonate rock from X-ray micro-computed tomography. Energy Rep. 7, 1460–1472 (2021).
Article Google Scholar
Okon, A. N., Adewole, S. E. & Uguma, E. M. Artificial neural network model for reservoir petrophysical properties: porosity, permeability and water saturation prediction. Modeling Earth Syst. Environ. 7, 2373–2390 (2021).
Article Google Scholar
Yasin, Q., Sohail, G. M., Ding, Y., Ismail, A. & Du, Q. Estimation of petrophysical parameters from seismic inversion by combining particle swarm optimization and multilayer linear calculator. Nat. Resour. Res. 29, 3291–3317 (2020).
Article Google Scholar
Anifowose, F., Abdulraheem, A. & Al-Shuhail, A. A parametric study of machine learning techniques in petroleum reservoir permeability prediction by integrating seismic attributes and wireline data. J. Petrol. Sci. Eng. 176, 762–774 (2019).
Article CAS Google Scholar
Akande, K. O., Owolabi, T. O. & Olatunji, S. O. Investigating the effect of correlation-based feature selection on the performance of support vector machines in reservoir characterization. J. Nat. Gas Sci. Eng. 22, 515–522 (2015).
Article Google Scholar
Bruce, A. G. et al. A state-of-the-art review of neural networks for permeability prediction, The. APPEA Journal 40, 341–354 (2000).
Article Google Scholar
Elkatatny, S., Mahmoud, M., Tariq, Z. & Abdulraheem, A. New insights into the prediction of heterogeneous carbonate reservoir permeability from well logs using artificial intelligence network. Neural Comput. Appl. 30, 2673–2683 (2018).
Article Google Scholar
Al Khalifah, H., Glover, P. & Lorinczi, P. Permeability prediction and diagenesis in tight carbonates using machine learning techniques. Marine Pet. Geol. 112, 104096 (2020).
Article Google Scholar
T. Nayberg, B. Petty, Laboratory study of lost circulation materials for use in oil-base drilling muds, in SPE Deep Drilling and Production Symposium, OnePetro, 1986.
S. Mirabbasi, M. Ameri, F. Biglari, A. Shirzadi, Geomechanical Study on Strengthening a Wellbore with Multiple Natural Fractures: A Poroelastic Numerical Simulation, in 82nd EAGE Annual Conference & Exhibition, European Association of Geoscientists & Engineers, 2021.
Moore, P. Drilling Practices Manual 2nd edition (PennWell Books, 1986).
Google Scholar
Mirabbasi, S. M., Ameri, M. J., Biglari, F. R. & Shirzadi, A. Thermo-poroelastic wellbore strengthening modeling: An analytical approach based on fracture mechanics. J. Petrol. Sci. Eng. 195, 107492 (2020).
Article CAS Google Scholar
Osisanya, S. Course notes on drilling and production laboratory (University of Oklahoma, Oklahoma (Spring), 2002).
Google Scholar
M. Alsaba, R. Nygaard, G. Hareland, O. Contreras, Review of lost circulation materials and treatments with an updated classification, in AADE National Technical Conference and Exhibition, Houston, TX, 2014.
Fan, C. et al. Formation stages and evolution patterns of structural fractures in marine shale: case study of the Lower Silurian Longmaxi Formation in the Changning area of the southern Sichuan Basin, China. Energy Fuels 34, 9524–9539 (2020).
Article CAS Google Scholar
Fan, C. et al. Quantitative prediction and spatial analysis of structural fractures in deep shale gas reservoirs within complex structural zones: A case study of the Longmaxi Formation in the Luzhou area, southern Sichuan Basin. China, J. Asian Earth Sci. 263, 106025 (2024).
Article Google Scholar
Li, J. et al. Shale pore characteristics and their impact on the gas-bearing properties of the Longmaxi Formation in the Luzhou area. Sci. Rep. 14, 16896 (2024).
Article PubMed PubMed Central Google Scholar
B.O. Company, Various Daily Reports, Final Reports, and Tests for 2007, 2008, 2009, 2010, 2011 and 2012, in, Several Drilled Wells, Basra’s Oil Fields Basra, 2012.
Mirabbasi, S. M., Ameri, M. J., Alsaba, M., Karami, M. & Zargarbashi, A. The evolution of lost circulation prevention and mitigation based on wellbore strengthening theory: A review on experimental issues. J. Pet. Sci. Eng. https://doi.org/10.1016/j.petrol.2022.110149 (2022).
Article Google Scholar
Karami, M., Ameri, M. J., Mirabbasi, S. M. & Nasiri, A. Wellbore strengthening evaluation with core fracturing apparatus: An experimental and field test study based on preventive approach. J. Petrol. Sci. Eng. 208, 109276 (2022).
Article CAS Google Scholar
Willmott, C. J. Some comments on the evaluation of model performance. Bull. Am. Meteor. Soc. 63, 1309–1313 (1982).
Article ADS Google Scholar
Francois, C. Deep Learning with Python (Manning Publications, 2017).
Google Scholar
Krizhevsky, A., Sutskever, I. & Hinton, G. E. Imagenet classification with deep convolutional neural networks. Commun. ACM 60, 84–90 (2017).
Article Google Scholar
K. Simonyan, A. Zisserman, Very deep convolutional networks for large-scale image recognition, Preprint https://arXiv.org/abs/ 1409.1556, (2014).
K. He, X. Zhang, S. Ren, J. Sun, Deep residual learning for image recognition, in Proceedings of the IEEE conference on computer vision and pattern recognition, 770–778 2016.
J. Gehring, M. Auli, D. Grangier, D. Yarats, Y.N. Dauphin, Convolutional sequence to sequence learning, in International conference on machine learning, PMLR 1243-1252 2017.
J. Gehring, M. Auli, D. Grangier, Y.N. Dauphin, A convolutional encoder model for neural machine translation, Preprint at https://arXiv.org/abs/1611.02344 (2016).
A. Conneau, H. Schwenk, L. Barrault, Y. Lecun, Very deep convolutional networks for text classification, Preprint at https://arXiv.org/abs/ 1606.01781, (2016).
R. Johnson, T. Zhang, Deep pyramid convolutional neural networks for text categorization, in Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2017.
S. Bai, J.Z. Kolter, V. Koltun, An empirical evaluation of generic convolutional and recurrent networks for sequence modeling, Preprint at https://arXiv.org/abs/ 1803.01271, (2018).
Shenfield, A. & Howarth, M. A novel deep learning model for the detection and identification of rolling element-bearing faults. Sensors 20, 5112 (2020).
Article ADS PubMed PubMed Central Google Scholar
Humbird, K. D., Peterson, J. L. & McClarren, R. G. Deep neural network initialization with decision trees. IEEE Trans. Neural Networks Learning Syst. 30, 1286–1295 (2018).
Article Google Scholar
L. Breiman, Random forest, vol. 45, Mach Learn, 1 (2001).
V. Nair, G.E. Hinton, Rectified linear units improve restricted boltzmann machines, in Proceedings of the 27th international conference on machine learning (ICML-10), 2010.
G.E. Dahl, T.N. Sainath, G.E. Hinton, Improving deep neural networks for LVCSR using rectified linear units and dropout, in 2013 IEEE international conference on acoustics, speech and signal processing, IEEE 2013.
D.P. Kingma, J. Ba, Adam: A method for stochastic optimization, Preprint https://arXiv.org/abs/ 1412.6980, (2014).
De Boer, P.-T., Kroese, D. P., Mannor, S. & Rubinstein, R. Y. A tutorial on the cross-entropy method. Ann. Oper. Res. 134, 19–67 (2005).
Article MathSciNet Google Scholar
Kuo, J.-T., Hsieh, M.-H., Lung, W.-S. & She, N. Using artificial neural network for reservoir eutrophication prediction. Ecol. Model. 200, 171–177 (2007).
Article Google Scholar
McKenzie, J. Mean absolute percentage error and bias in economic forecasting. Econ. Lett. 113, 259–262 (2011).
Article MathSciNet Google Scholar
De Myttenaere, A., Golden, B., Le Grand, B. & Rossi, F. Mean absolute percentage error for regression models. Neurocomputing 192, 38–48 (2016).
Article Google Scholar
X.R. Li, Z. Zhao, Relative error measures for evaluation of estimation algorithms, in 2005 7th international conference on information fusion, IEEE, 2005.

Download references

Author information

Authors and Affiliations

Petroleum Engineering Department, Amirkabir University of Technology, Tehran, Iran
Yaser Abdollahfard, Seyed Morteza Mirabbasi & Mohammad Ahmadi
Department of Petroleum Engineering, Shahid Bahonar University of Kerman, Kerman, Iran
Abdolhossein Hemmati-Sarapardeh
Institute of Petroleum Engineering, School of Chemical Engineering, University of Tehran, P.O. Box:11155-4563, Tehran, Iran
Sefatallah Ashoorian

Authors

Yaser Abdollahfard
View author publications
Search author on:PubMed Google Scholar
Seyed Morteza Mirabbasi
View author publications
Search author on:PubMed Google Scholar
Mohammad Ahmadi
View author publications
Search author on:PubMed Google Scholar
Abdolhossein Hemmati-Sarapardeh
View author publications
Search author on:PubMed Google Scholar
Sefatallah Ashoorian
View author publications
Search author on:PubMed Google Scholar

Contributions

Yaser Abdollahfard authored the main manuscript text, produced Python code for result output and modeling, and created artificial data using reservoir simulation software. Dr. Seyyed Morteza Mirabbasi revised the manuscript technically as drilling engineer. Dr. Mohammad Ahmadi revised the manuscript as corresponding author. Dr. Abdolhosein Hemmati revised the manuscript as data scientist. Dr. Sefatollah Ashorian revised the manuscript as a final technical check reviser.

Corresponding author

Correspondence to Mohammad Ahmadi.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

The original online version of this Article was revised: The original version of this Article contained an error in the Author contributions section. As a result, it now reads: “Yaser Abdollahfard authored the main manuscript text, produced Python code for result output and modeling, and created artificial data using reservoir simulation software. Dr. Seyyed Morteza Mirabbasi revised the manuscript technically as drilling engineer. Dr. Mohammad Ahmadi revised the manuscript as corresponding author. Dr. Abdolhosein Hemmati revised the manuscript as data scientist. Dr. Sefatollah Ashorian revised the manuscript as a final technical check reviser.”

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Abdollahfard, Y., Mirabbasi, S.M., Ahmadi, M. et al. Formation permeability estimation using mud loss data by deep learning. Sci Rep 15, 15251 (2025). https://doi.org/10.1038/s41598-025-94617-7

Download citation

Received: 04 October 2024
Accepted: 15 March 2025
Published: 30 April 2025
DOI: https://doi.org/10.1038/s41598-025-94617-7

Subjects

Abstract

Similar content being viewed by others

New insights into permeability determination by coupling Stoneley wave propagation and conventional petrophysical logs in carbonate oil reservoirs

Origins of pressure dependent permeability in unconventional hydrocarbon reservoirs

Physics-informed machine learning with differentiable programming for heterogeneous underground reservoir pressure management

Introduction

ML approach for permeability prediction

Lost circulation vs. formation permeability

Methodology

Data generation

Statistical analysis of generated data

Data arrangement

Convolutional neural networks (CNN)

Deep jointly informed neural networks (DJINN)

Decision tree construction

Mapping decision trees to deep neural networks

Optimizing the neural networks

Model performance evaluation

Root mean squared error (RMSE)

Mean absolute error (MAE)

Mean absolute percentage error (MAPE)

R-squared (R2)

Relative error (RE)

Results and discussion

Conclusions

Data availability

Change history

26 May 2025

References

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher’s note

Supplementary Information

Supplementary Information.

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Quick links

R-squared (R²)