Introduction

The exploration and development of shale oil in China have recently achieved remarkable progress, particularly in the Ordos Basin. In 2019, the Qingcheng oilfield, with reserves exceeding 10 × 10⁸ t of shale oil, was discovered in the Chang 7 member of the Mesozoic Yanchang Formation in this basin1. Permeability is a crucial petrophysical parameter in reservoir evaluation as it directly influences the fluid output capacity of the reservoir. Therefore, accurate permeability prediction plays a vital role in reservoir assessment2,3,4,5,6. Permeability is controlled by porosity, lithology, and pore structure characteristics7,8,9,10,11. Shale oil reservoirs exhibit highly complex pore structures, a wide range of pore sizes, and high heterogeneity12,13. Traditional methods for predicting reservoir permeability have shown low accuracy, making the effective prediction of shale oil reservoir permeability a challenging issue that requires immediate attention.

For highly heterogeneous reservoirs, there are currently three primary types of models used to evaluate permeability: physical evaluation models, electrical evaluation models, and statistical classification models14,15,16,17,18,19,20,21. Physical evaluation models center on factors like porosity, bound water saturation, particle sorting, particle size, mineral composition, and specific surface area, which influence permeability. Electrical evaluation models consider conductivity, pore connectivity coefficient, and cementation index as key factors. Statistical classification models analyze the relationship between porosity and permeability through techniques like lithologic identification, physical phase constraints, structural classification, and flow unit division. Although physical and electrical evaluation models account for physical factors such as area and surface, they often have complex structures and numerous parameters, making it challenging to obtain accurate model parameters, especially in highly heterogeneity reservoirs. In contrast, statistical classification models can more directly reflect physical changes in reservoirs and are more practical.

The flow unit classification method is the most widely used quantitative classification method for reservoirs in statistical classification models, especially when only core physical property data are available22. A hydraulic flow unit (HFU) is a reservoir genesis unit that resulting from the interaction of geological processes such as sedimentation, diagenesis, and tectonic movement. It enables the quantitative description and characterization of reservoir heterogeneity, revealing the heterogeneity and fluid seepage characteristics of the reservoir23. Chen et al. used nuclear magnetic resonance (NMR) logging data to calculate the continuous flow unit index FZI and predict the permeability of a tight sandstone gas reservoir based on the classification24. Zhou et al. classified the HFU of sandstone reservoirs based on well logging data and analyzed the physical characteristics of each flow unit. They also established a model for evaluating permeability in shale gas reservoirs25. Lu et al. conducted a flow unit division of shale oil using well logging data and predicted the permeability of pure mud shale reservoirs26. However, previous researchers have only quantitatively divided reservoirs based on the FZI from a physical perspective, which fails to reflect the reservoir permeability determined by the complex depositional environment and exhibits obvious limitations27,28,29,30. In recent years, machine learning algorithms have been increasingly employed in permeability prediction31,32,33. Some of the commonly used algorithms include support vector machines (SVMs), random forests (RFs), decision trees (DTs), extreme learning machines (ELMs), and artificial neural networks (ANNs).ANNs are known for their ability to model complex nonlinear relationships and have been widely applied in various fields, including reservoir engineering34,35. ANN is inspired by the capabilities of the brain to process information, consisting of a number of neurons (nodes) linked by a special architecture. The ANN can minimize the errors between target outputs (in training dataset) and computed outputs (from the output nodes) through adjusting the weights of links and bias of nodes on the basis of a certain training algorithm. A Back Propagation (BP) neural network is a multilayer feed-forward ANN that uses an error back propagation algorithm for training36. In contrast to the complex processes and high requirements of traditional permeability prediction methods, which involve intricate mathematical derivations, parameter adjustments, iterative computations, factor correlations, and a need for complete data, BP neural network with its flexible model structure, formidable nonlinear mapping capabilities, and robust self-adaptation and self-learning abilities, demonstrate remarkable advantages in terms of efficiency and applicability37.

In order to overcome the limitations of traditional methods and enhance the accuracy of permeability prediction, this paper investigates the fundamental characteristics of the shale oil reservoir of the Chang 7 member in the Sai 392 area. By analyzing well logs, core samples, physical property data, and experimental techniques such as casting thin section (CTS) observation and scanning electron microscopy (SEM). The correlations among lithology, porosity, pore structure parameters, and permeability are thoroughly explored and discussed. An improved method is established by combining lithology and hydraulic flow units (HFUs) to predict permeability using conventional logs. This method improves the accuracy of permeability predictions and offers a reliable foundation for identifying high-quality reservoirs and achieving efficient reservoir development.

Regional geological background

The Sai 392 area is situated in the central region of the Yishan slope within the Ordos Basin. It is between Zhidan and Jingbian counties in Shaanxi Province. The region has a flat tectonic structure, with several rows of low-amplitude nose structure belts locally developed against the background of a low-angle west-dipping monocline38 (Fig. 1a). The thickness of the Chang 7 layer in the Ordos Basin ranges from approximately 80–120 m. These lithofacies are distributed along the margin of the lake basin in a ring belt, with various types of deltaic facies near the shoreline. The lake depth increased toward the center of the basin39. During the sedimentary period of the Chang 7 member in the basin, the material sources were mostly oriented in the northeast‒southwest directions. The sand bodies at the delta front and the turbidite sand bodies formed in the semideep to deep lake were relatively widespread40. The source material of the Yanchang Formation in the study area originated primarily from the northeast. The Chang 7 member is primarily composed of delta front sediments, which can be further subdivided into four sedimentary microfacies: the underwater distributary channel, estuary dam, underwater natural levee, and interdistributary bay. The reservoir lithology varies significantly, primarily consisting of fine sandstone, siltstone, and mudstone. The reservoir sand bodies are oriented northwest‒southeast and are arranged in a flower-like shape (Fig. 1b).

Fig. 1
Fig. 1
Full size image

Location and geological setting of the study area (modified by38,39,40). (a) Geological tectonics38. (b) Sedimentary facies39,40. (Made using CorelDRAW Graphics Suite 2024, version: v25.2.1.313, Website: https://www.coreldraw.com/en/).

Results and discussion

Reservoir characterization

Lithological characteristics

To elucidate the properties of the reservoir, we first analyzed the lithological characteristics of the Chang 7 member in the study area. The dominant lithologies consist of lithic arkose and arkose, and the sandstone grains are mostly extremely fine-grained to fine-grained, with moderate to good sorting. The grain shapes are mostly subangular, and the rock debris is primarily composed of metamorphic rocks and mica. The average content of the matrix is 5.1%, with hydromica and chlorite serving as the primary interstitial materials. The cement content ranges from 0.5 to 30% (average of 7.5%). The primary cement is ferroan calcite, with smaller amounts of calcite, siliceous cement, and dolomite cement (Fig. 2).

Fig. 2
Fig. 2
Full size image

Ternary diagram of the sandstone composition of the Chang 7 shale oil reservoirs. Q: Quartz, F: Feldspar, R: Rock debris.

The lithological data of the shale oil reservoirs within the Chang 7 member in the study area reveal a composition primarily consisting of fine sandstone, argillaceous siltstone, silty mudstone, and mudstone. These lithologies exhibit varying thicknesses. Fine sandstone is the most prevalent, accounting for 48.29%, followed by mudstone at 19.73% (Fig. 3b). An analysis of permeability distribution across these lithologies reveals that the permeability of the fine sandstone ranges from 0.052 to 0.14 mD, that of the argillaceous siltstone ranges from 0.035 to 0.094 mD, that of the silty mudstone ranges from 0.009 to 0.06 mD, and that of the mudstone is less than 0.048 mD (Fig. 3a). The permeability decreases progressively from the fine sandstone to the mudstone, indicating that lithology is the key factor influencing permeability. Therefore, incorporating lithological parameters is crucial for accurate permeability predictions.

Fig. 3
Fig. 3
Full size image

Different lithological characteristics of the Chang 7 shale oil reservoirs. (a) Range of permeability distribution. (b) Thickness ratios.

Physical characteristics

The analysis of the porosity–permeability data reveals that the porosities primarily range between 4 and 9%, while the permeabilities are mainly distributed between 0.01 and 0.2 mD (Fig. 4a and b). These findings suggest that the reservoirs in this region have ultralow porosity and permeability. Additionally, a weak correlation between porosity and permeability was observed (Fig. 4c), likely due to late diagenesis of the reservoirs. The Chang 7 reservoirs have experienced significant compaction, resulting in the loss of numerous pores and a decrease in porosity (Fig. 5g). Conversely, the dissolution of feldspar has generated dissolution pores, enhancing pore connectivity and improving the reservoir storage capacity (Fig. 5a, b, and e). However, the presence of ferric calcite and clay as cementing materials led to more developed cementation processes, leading to reduced pore space and pore throat sizes, ultimately diminishing the reservoir seepage capacity (Fig. 5d and h).

Fig. 4
Fig. 4
Full size image

Reservoir quality of the Chang 7 shale oil reservoirs. (a) The distribution of reservoir porosity. (b) The distribution of reservoir permeability. (c) The relationship between porosity and permeability from the reservoir test data.

Fig. 5
Fig. 5
Full size image

Microscopic characteristics of the Chang 7 shale oil reservoirs. CTS observations: (a) Feldspar dissolution pore, well X538, 1970.71 m. (b) Intergrain pores, feldspar dissolution pores, well S544, 2142.38 m. (c) Micropores, well X538, 1972.81 m. (d) Ferroan calcite, well X538, 1972.2I. SEM observations: (e) Feldspar dissolution pore, well X538, 1990.91 m. (f) Clay micropores, well X538, 1990.91 m. (g) Mica compressive deformation, well X538, 1972.81 m. (h) Ferroan dolomite, chlorite, illite, well S246 1973.6 m. InterG: intergranular. F-disso: feldspar dissolution.

Pore structure characteristics

The Chang 7 shale oil reservoirs in the study area exhibit poor pore development. As observed in the CTSs, the total surface porosity of only 1.93%. Dissolution pores are the predominant type of pores in this rock, accounting for 1.34% of the surface porosity. Some samples also contain intergranular pores and a small number of clastic dissolution pores, with surface porosities of 0.52% and 0.07%, respectively (Fig. 5a–c). SEM images (Fig. 5e and f) demonstrate the presence of numerous dissolution pores and clay micropores. The pore types are mainly composed of dissolution pores and micropores, intergranular pores and dissolution pores, and micropores.

The analysis of the Hg pressure data revealed that the median radii of the reservoir ranged from 0 to 0.17 µm (average of 0.09 µm). The outlet pressures were mainly distributed between 1.09 and 41.41 MPa (average of 7.36 MPa). The maximum pore throat radii were primarily distributed between 0.02 and 0.68 µm (average of 0.039 µm), falling within the fine pore throat, micro- to fine pore throat, and micropore throat categories, with the latter two being the most common. The maximum mercury saturation values primarily ranged between 42.05 and 89.27% (average of 72.75%). The mercury removal efficiency values were mainly distributed between 23.14 and 36.06% (average of 28.79%). The pore throat connectivity in the reservoir was weak (Table 1), with pore throat combinations mainly characterized as mesopores and micro- to fine pore throats, small pores and micro- to fine pore throats, and micropores and micropore throats. The complex pore structure of the reservoir reflects its strong heterogeneity.

Table 1 Mercury injection data of the Chang 7 shale oil reservoirs in the Sai 392 area.

Pore structure parameters, such as the maximum pore throat radius and the average pore throat radius, directly reflect the distribution, sorting, and connectivity of rock samples41,42. The crossplots of permeability and these parameters reveal strong positive correlations (correlation coefficients R2 > 0.73) (Fig. 6a and b). This indicates that the pore structure of the Chang 7 reservoirs in the study area directly affects their permeability.

Fig. 6
Fig. 6
Full size image

Correlations between the permeability and pore structure parameters of the Chang 7 shale oil reservoirs. (a) Relationship between permeability and maximum pore throat radius. (b) Relationship between permeability and average pore throat radius.

Lithology identification and classification

Reservoir physical properties are largely determined by lithology, and the impact of lithology must be considered when predicting reservoir physical properties43. The complexity of lithology of the shale oil reservoirs within the study area is notable. While the lithologic profile along the longitudinal direction of a single well was established during logging, errors in lithology interpretation due to artificial factors during logging are significant. Thus, there is an urgent need to enhance the accuracy of lithologic profile interpretation in the study area. Relying solely on a single logging curve is insufficient for effective lithology identification. Imaging logging enables direct qualitative identification of different lithologies and reflects changes in rock grain size44,45. However, this method is prone to artifacts, resulting in inaccuracies in lithologic classification and low identification efficiency. By reconstructing a logging curve to amplify the logging response, the accuracy of identifying reservoir lithology can be improved, enabling quantitative classification of different lithologies. The use of the BP neural network method is effective in correcting the entire logging-based lithological profile, ultimately enhancing the accuracy of lithological interpretation.

The BP neural network model employed to predict the properties of the reservoir is contingent upon a set of input features, namely natural gamma ray (GR), acoustic (AC), density (DEN), neutron (CNL), and true formation resistivity (RT). These features are pivotal in capturing the various characteristics of the reservoir, as they offer distinct and complementary details about the reservoir’s physical and geological properties46. DEN, which quantifies the density of the rock, is associated with the composition and texture, facilitating the differentiation of different rock types and indicating the presence of minerals such as calcite or dolomite. AC measures the acoustic properties, which are affected by the pore structure and fluid content, thus providing valuable insights into the porosity and permeability of the reservoir. CNL measures the neutron porosity, which is a significant parameter for characterising the fluid-bearing capacity of the reservoir. GR is sensitive to radioactive elements and can be used to determine the lithology and mineralogy of the reservoir. RT provides information regarding the electrical resistivity, which is related to the salinity and fluid conductivity of the reservoir47.

The performance and accuracy of the model are greatly influenced by the selection and quality of these input features. The combination of DEN, AC, CNL, GR, and RT has been found to improve the model’s ability to accurately predict the reservoir properties48,49. Studies have demonstrated that including these features leads to enhanced correlations between the predicted and measured values of porosity and permeability. This enable the model to better capture the complex relationships between these parameters and the geological and physical properties of the reservoir28.

In this study, we first consolidated, categorized, and arranged the lithologies observed in the core samples, and then correlated the imaging logging data with the core observation data. Image features were utilized to determine the lithology throughout the entire wellbore interval, resulting in the classification of target formations into four primary lithologies: fine sandstone, argillaceous siltstone, silty mudstone, and mudstone (Fig. 7). The sensitivity curves for lithology, derived from GR, AC, DEN, and CNL data, were screened through sensitivity analysis. These results were used to construct the lithologic reconstruction curve Ilith (Eq. 1). The Ilith values for different lithologies exhibited notable distinctions (Fig. 8), leading to the establishment of lithology identification and classification criteria for shale oil reservoirs in the specific study area (Table 2).

Fig. 7
Fig. 7
Full size image

Lithologic characteristics interpreted from conventional logging and imaging logging of the Chang 7 shale oil reservoirs. (a) Mudstone. (b) Silty mudstone. (c) Argillaceous siltsAtone. (d) Fine sandstone.

Fig. 8
Fig. 8
Full size image

Lithologic identification chart of the Chang 7 shale oil reservoirs.

Table 2 Criteria for identifying and classifying lithologies.

The lithologic reconstruction curves are shown below:

$${\text{I}}_{\text{lith}}=\frac{{\text{CNL}}^{\prime}-{\text{DEN}}^{\prime}}{{\text{GR}}^{\prime}\times {\text{AC}}^{\prime}}$$
(1)

where GR’, AC’, CNL,' and DEN’ are the normalized values of the logging curves GR, AC, CNL, and DEN, respectively (Eqs. 24).

We employed the BP neural network method to correct the lithological profiles of individual wells. The four lithological categories (fine sandstone, argillaceous siltstone, silty mudstone, and mudstone) identified in the previous section were digitized and replaced by 1–4 as parameters for lithological evaluation (Table 2). A BP neural network model for lithology was established by using the GR, AC, DEN, CNL, and lithologic evaluation parameters as input layers and lithology as the output layer. The network training results showed that the predicted lithologic profile was more detailed and accurate than the log-based lithologic profile (Fig. 9, Tracks 6 and 7). This enhanced lithologic profile can serve as a basis for permeability prediction50.

Fig. 9
Fig. 9
Full size image

Comparison between the log-based lithologic profile and the predicted lithologic profile of the Xin488 well. Track 1 shows the Chang 7 member; Track 2 shows the depth; Tracks 3, 4, and 5 are the main logging curves; Track 6 is the log-based lithologic profile; and Track 7 is the predicted lithologic profile.

HFU identification and permeability estimation

HFU identification and classification

Porosity and permeability are crucial parameters for evaluating hydrocarbon reservoirs because they reflect the storage and flow capacity of a medium46. Porosity is typically determined using three types of logging data (AC, DEN, and CNL). The effective porosity can be measured in the laboratory. The total porosity is determined by well logging data and impacted by the clay content of the shale reservoir. GR data can be used to determine the clay content of shale reservoirs effectively. Therefore, this study utilized three types of porosity logs and GR logs as input layers to predict porosity using a BP neural network model based on the multilayer perceptron (MLP) architecture in SPSS Modeler 14.1 software.

Taking the Xin488 well as a case study, the MLP architecture was optimal when it has 70 and 30 data points in the training and test subsets, respectively. The crossplot between the predicted and measured porosities shows a strong correlation (R2 = 0.84) (Fig. 10).

Fig. 10
Fig. 10
Full size image

Relationship between the measured porosity and predicted porosity of the Xin488 well.

By substituting the measured porosity and permeability data into Eqs. (7) and (8), the reservoir quality index (RQI) and the normalized porosity index (PMR) can be calculated. Equation (9) was applied to determine the FZI, which can be classified to identify HFUs in the shale oil reservoirs. The distribution of log(FZI) from the Xin488 well data suggests that the reservoir can be categorized into five HFUs (Fig. 11a), with varying FZI distribution ranges across each HFU type. The value of log(FZI) decreases from HFU I to HFU V (Table 3).

Fig. 11
Fig. 11
Full size image

Identification of the five HFUs and associated data in the Xin488 well. (a) Cumulative frequency plot of log(FZI) data. (b) Porosity–permeability relationship.

Table 3 FZI intervals used for HFU identification.

The permeability‒porosity crossplots of the different HFUs reveal a strong correlation between the porosity and permeability within the same HFU type (R2 > 0.82). (Fig. 11b). Therefore, based on the evaluation of porosity, the permeability of shale oil reservoirs can be predicted by the correlation between the porosity and permeability of each HFU.

Model validation and permeability prediction of the uncored well

The HFUs are classified in the cored interval based on FZI values calculated from the measured porosity and permeability data of the samples. These FZI values typically align with the corresponding well logging data, aiding in the continuous identification of HFUs in both cored and uncored intervals. Therefore, for uncored intervals without measured porosity and permeability data, it becomes essential to model the relationship between the FZI and well logging response using readily available well logging data. Initially, the FZI is calculated for the uncored intervals and then used to classify the HFU types. The correlations between the FZI and well logging curves are more complex than the correlations between the porosity and logging curves51. Therefore, this paper selects an enhanced accuracy model for predicting permeability. A BP neural network model is established using a substantial amount of logging data strongly correlated with FZI (DEN, AC, CNL, GR, and RT) and with lithological evaluation parameters to calculate FZI, following a similar approach to that used for predicting porosity.

Due to the complex diversity of lithologies and pore structures in the Chang 7 member of the study area, the shale oil reservoirs exhibit significant differences in terms of the reservoir FZI and well logging response. As a result, establishing a unified model for predicting permeability based on well logging is challenging, requiring separate modeling and evaluation efforts for each well. Using the Xin488 well as a case study, the network training results showed a strong positive correlation (R2 = 0.85) between the predicted and measured FZI values (Fig. 12). This suggests that the model is effective for identifying HFUs continuously in both cored and uncored intervals of a well. Based on the five types of HFUs classified by FZI identification (Fig. 11b), we established a permeability prediction index model for each HFU type, specifically the porosity–permeability relationship equation (Table 4).

Fig. 12
Fig. 12
Full size image

Relationship between the measured FZI and predicted FZI of the Xin488 well.

Table 4 Exponential models of permeability evaluation for different HFUs in the Xin488 well.

In this study, the permeability of shale reservoirs was predicted in 10 un-cored wells within the Sai 392 area. Figures 13 and 14 show predicted permeability values for 2 wells (the Xin488 and Xin485 wells). The predicted permeability values correspond well to measured values, indicating that the improved model can accurately predict the permeability of shale reservoirs.

Fig. 13
Fig. 13
Full size image

Prediction results of the porosity and permeability profiles for the Xin488 well. Track 1: Chang 7 member; Track 2: depth; Tracks 3, 4, and 5: well logging curves; Track 6: log-based lithologic profile; Track 7: predicted lithologic profile; Track 8: predicted porosity results; Track 9: predicted permeability results.

Fig. 14
Fig. 14
Full size image

Prediction result profile of the porosity and permeability for the Xin485 well. Track 1: Chang 7 member; Track 2: depth; Tracks 3, 4, and 5: well logging curves; Track 6: log-based lithologic profile; Track 7: predicted lithologic profile; Track 8: predicted porosity results; Track 9: predicted permeability results.

In order to quantitatively assess the performance of the model, crossplots of predicted and measured values were generated. Most of the data in the crossplot of the measured permeability versus the lithologic-control-based predicted permeability fall within the ± 0.17 uncertainty boundaries, and show a strong positive correlation, the R2 value is 0.803 for the Xin488 well and 0.801 for the Xin485 well. Analysis of the linear fitting error reveals that the predictions with a low root mean square error (RMSE ≤ 0.037) (Fig. 15a and b). Furthermore, R2 and RMSE of 8 un-cored wells within the Sai 392 area and 2 un-cored wells within the Longdong area of the Ordos Basin (the Chen96 and Li357 wells) were statistically analyzed. The results indicating a strong overall correlation between the measured and predicted permeabilities (R2 > 0.72), and a low RMSE of predictions (RMSE ≤ 0.051). Among them, the R2 value is 0.826 for Chen96 well and 0.797 for Li357 well, and RMSE values are 0.031 for Chen96 well and 0.046 for Li357 well (Fig. 16). Therefore, the improved method can be considered to be generally applicable.

Fig. 15
Fig. 15
Full size image

Relationships between the measured permeability and predicted permeability of the improved method. (a) Xin488 well. (b) Xin485 well.

Fig. 16
Fig. 16
Full size image

The R2 and RMSE for different single wells.

In addition, the traditional multiple regression method was employed to predict permeability of the Xin488 well and the Xin485 well, respectively. It is evident that the correlations between the measured and predicted permeabilities of the 2 wells are relatively poor, the R2 value is 0.318 for the Xin488 well and 0.245 for the Xin485 well (Fig. 17a and b). Conversely, the correlations between the predicted permeabilities and the measured permeabilities are significantly enhanced, with all R2 value greater than 0.8(Fig. 15a and b). This further confirms the assumption that this method has strong performance in accurately predicting permeability. Moreover, the predicted permeability curves effectively revealed favorable shale oil/gas seepage layers37.

Fig. 17
Fig. 17
Full size image

Relationships between the measured permeability and predicted permeability of the traditional method. (a) Xin488 well. (b) Xin485 well.

Conclusions

The Sai 392 Chang 7 member shale reservoirs in the Ordos Basin have a low surface porosity and are primarily composed of lithic arkose and arkose. The Chang 7 member shale reservoir porosity mainly ranges from 4 to 9%, and the corresponding permeability mainly ranges from 0.01 to 0.2 mD. These characteristics are indicative of typical ultralow-porosity, ultralow-permeability reservoirs. The relationship between porosity and permeability is weak, with a variety of pore throat structures present, including mesopores and micro- to fine pore throats, small pores and micro- to fine pore throats, and micropores and micropore throats, indicating strong heterogeneity.

  1. 1)

    The lithology of the shale oil reservoirs in the Chang 7 member in the study area consists mainly of fine sandstone, argillaceous siltstone, silty mudstones, and mudstones. Each lithology exhibits a distinct range of permeabilities, with fine sandstone exhibiting the highest permeability, followed by argillaceous siltstone. The permeability of a reservoir is strongly correlated with both the maximum and average pore throat radii. Therefore, the permeability of the studied reservoirs is influenced primarily by the lithology and pore structure.

  2. 2)

    Lithology identification criteria are established based on lithologic reconstruction curves, with Ilith value boundaries of −0.5, 1.5, and 3. Additionally, the BP neural network is used to correct single-well logging-based lithologic profiles, enhancing the accuracy of lithologic interpretation.

  3. 3)

    A reservoir permeability log prediction model is established using the lithology–flow unit–BP neural network permeability evaluation method to estimate the permeability of the entire studied well interval. The predicted permeability values closely align with the measured values, with nearly all the data falling within the uncertainty boundary of ± 0.17. The prediction results reveal a strong positive correlation (R2 > 0.72) and a low root mean square error (RMSE ≤ 0.051), indicating that the proposed model can effectively and accurately predict the permeability of the whole well interval of the Chang 7 member.

The proposed method may be more applicable to shale reservoirs. However, due to the unique geological characteristics and fluid behaviors exhibited by different reservoirs, it is imperative to develop a novel method with extensive applicability, high accuracy, and robustness. In future studies, we should incorporate a more diverse array of data, covering seismic, core, production, and various geological data. Integrating these data sources can provide a more comprehensive understanding of the reservoir. It is essential to deeply explore the potential of leveraging machine learning algorithms and data mining techniques to extract valuable information from multi-source data. Additionally, it is recommended to contemplate the concurrent application of multiple modeling approaches. By capitalizing on their individual fortes, we can enhance the accuracy of reservoir property prediction, which is instrumental for more accurate reservoir assessment and more effective identification of favourable shale oil/gas seepage layers.

Data and methods

Samples and experimental section

In this study, we collected physical property data from 69 wells, Hg pressure data from 10 wells, imaging logging data from 8 wells, CTS data from 36 wells, and SEM data from 20 wells in the Sai 392 area of the Changqing Oilfield Corporation. Additionally, we extracted the oil from 11 samples from the study area using a solvent mixture of dichloromethane and methanol (9:1 ratio) for 72 h. The extracted samples were then dried at 110 °C to eliminate residual oil and water. Subsequently, CTS and SEM experiments were conducted with the samples.

The CTS experiments were completed on a CIAS-2007 rock casting image analyzer. The CTS was resin-impregnated with blue epoxy; therefore, the blue color in the micrographs of the casts represents pores.

The SEM experiments were first performed by polishing the observation surface of the samples with an Ilion+ II 697C argon-ion polisher produced by the Gatan Company in the United States; then, the samples were observed with a Quanta 450 field emission SEM produced by the FEI in the United States.

Data preprocessing

Data preprocessing refers to standardizing data so that different variables have the same scale, The standardization process is performed to make the data comparable. Data standardization methods are as follows52:

Standard deviation standardization:

$${x}_{ij}^{\prime}=\frac{{x}_{ij}-{\overline{x} }_{j}}{{s}_{j}}\left(i=\text{1,2},\dots ,n,j=\text{1,2},\dots ,m\right)$$
(2)
$${\overline{x} }_{j}=\frac{1}{n}\sum_{i=1}^{n}{x}_{ij},{s}_{j}={\left[\frac{1}{n}\sum_{i=1}^{n}{\left({x}_{ij}-{\overline{x} }_{j}\right)}^{2}\right]}^\frac{1}{2},\left(j=\text{1,2},\dots ,m\right)$$
(3)

xij is expressed as the raw data of the jth index of the ith sample; x'ij is the standardized data of the jth index of the ith sample.

Min–Max normalization:

$${x}_{ij}^{{\prime}{\prime}}=\frac{{x}_{ij}^{\prime}-\underset{1\le i\le n}{\mathit{min}}\left\{{x}_{ij}^{\prime}\right\}}{\underset{1\le i\le n}{\mathit{max}}\left\{{x}_{ij}^{\prime}\right\}-\underset{1\le i\le n}{\mathit{min}}\left\{{x}_{ij}^{\prime}\right\}},(j=1,2,\dots ,m)$$
(4)

Hydraulic flow units

A HFU refers to a collection of reservoirs that share similar lithologic and seepage characteristics, both vertically and laterally53,54 These reservoirs exhibit uniform fluid flow properties due to their similar pore throat characteristics55,56 By classifying reservoirs into different categories based on HFUs, the heterogeneity of the reservoir can be mitigated to some extent. The RQI and flow zone index (FZI) effectively reflect the differences in the microscopic pore structure. The FZI can be used to quantitatively identify and classify HFUs, with each HFU being characterized by its FZI. This allows for the establishment of a strong correlation between permeability and porosity. Estimating porosity from logging curves is a simpler approach than directly predicting permeability. Therefore, based on the porosity–permeability relationship for each HFU, it is possible to predict the permeability of samples within each HFU57,58,59,60.

According to the Kozeny–Carman Eq.61, permeability and porosity satisfy the following relationship:

$$K=\frac{1}{{H}_{c}}\times \frac{{\Phi }^{3}}{{\left(1-\Phi \right)}^{2}}$$
(5)

where K is the permeability in µm2, Φ is the effective porosity (fraction), and Hc is a structural property constant that is formed by the product of the pore tortuosity, shape factor, and square of the specific surface area.

Dividing both sides of Eq. (5) by Φ and taking the square root results in the following form:

$$\sqrt{\frac{K}{\Phi }}=\frac{1}{\sqrt{{H}_{c}}}\times \frac{\Phi }{1-\Phi }$$
(6)

The parameters that can be defined when permeability (K) is expressed in mD are:

The RQI:

$$RQI=0.0314\times \sqrt{\frac{K}{\Phi }}$$
(7)

The normalized porosity index (PMR):

$$PMR=\frac{\Phi }{1-\Phi }$$
(8)

The FZI:

$$FZI=\frac{1}{\sqrt{{H}_{c}}}=\frac{RQI}{PMR}$$
(9)

Equation (9) illustrates that the FZI is a parameter that reflects the characteristics of the pore structure of the rock; hence, its value accurately reflects the heterogeneity of the reservoir seepage characteristics.

By taking the logarithm of Eq. (9), the following relationship is derived:

$$lgRQI=lgPMR+lgFZI$$
(10)

Equation (10) shows that there is a linear relationship between the RQI and the PMR. This means that data points with similar FZI values are distributed on the same straight line, while data points with different FZI values are distributed on a set of parallel straight lines.

Back-propagation (BP) neural network

The BP network comprises an input layer, an output layer, and multiple hidden layers and operates in two stages: forward propagation and BP. In the forward propagation stage, the input signal is transmitted to the hidden layer, and after applying the activation function, the output information of the hidden layer is forwarded to the output layer for the final output result. If the output layer fails to reach the desired output value, the network proceeds to the BP error stage, where the connection weights of each layer are adjusted based on the back-propagated error signals to determine the optimal set of weights for the correct output (Fig. 18). As there are feedback connections between neurons within each layer, information is transmitted only from the input layer to the output layer. The architecture of the network includes feedback connections between neurons within layers, ensuring that information flows from the input to the output layer only, with no direct connections between neurons within the same layer. Therefore, the BP neural network algorithm is effective in solving the relationship between the input and output and addressing complex nonlinear problems. BP neural networks have been successfully applied in predicting the porosity, reservoir sedimentary microfacies, and lithology of heterogeneous reservoirs62, FZI can be evaluated using a BP neural network.

Fig. 18
Fig. 18
Full size image

Basic schematic of a BP neural network.