Integrating multiple seismic attributes for fault detection using a new hybrid machine learning

Esmaeili, Hadi; Bagheri, Majid; Esmaeili, Shamseddin

doi:10.1038/s41598-025-26889-y

Download PDF

Article
Open access
Published: 28 November 2025

Integrating multiple seismic attributes for fault detection using a new hybrid machine learning

Hadi Esmaeili¹,
Majid Bagheri² &
Shamseddin Esmaeili³

Scientific Reports volume 15, Article number: 42744 (2025) Cite this article

1027 Accesses
Metrics details

Subjects

Abstract

This paper proposes a novel hybrid approach for fault detection in seismic data by integrating multiple seismic features using a novel hybrid machine. This hybrid method introduces the integration of Multilayer Perceptron (MLP) neural networks with Support Vector Machines (SVM). The main objective of this study is to enhance the analysis and recognition of fault patterns in seismic data while reducing detection errors. The dataset consists of two-dimensional synthetic and real seismic data and their corresponding labels. Various seismic features, such as the gray-level co-occurrence matrix (GLCM), and features derived from ant tracking, chaos, variance, sweetness, correlation, slope direction, and energy, are extracted and normalized. These features are subsequently used to train MLP and SVM models for fault detection. SVM, a supervised learning method, operates by determining a hyperplane that maximizes the separation between data classes. In contrast, MLP, an artificial neural network, uses multiple layers to optimize weights and capture complex data relationships. The performance of each model is evaluated using experimental data, and its corresponding accuracies are calculated. Finally, the predictions from both models are combined to improve the overall fault detection accuracy. The results show that this combined approach significantly increases the fault detection accuracy compared to the independent application of each model.

Ensemble of hybrid model based technique for early detecting of depression based on SVM and neural networks

Article Open access 26 October 2024

Multilabel classification for defect prediction in software engineering

Article Open access 13 March 2025

Enhancing the classification of seismic events with supervised machine learning and feature importance

Article Open access 24 December 2024

Introduction

Oil fields development and production wells drilling remain critical strategies in oil-rich countries¹. However, hydrocarbon exploration and the development of carbonate reservoirs are inherently complex due to inadequate seismic imaging and reservoir heterogeneity arising from diagenetic variations². Consequently, identifying faulted zones is vital for determining optimal drilling locations³. Detecting faults and fractures is an essential task throughout all stages of oilfield operations, including exploration, extraction, and production. Traditional methods require interpreters to spend considerable time visually identifying faults and fractures, followed by manual interpretation. When data quality is poor or geological formations are structurally complex, such identification can become challenging or even infeasible⁴. Manual interpretation of faults and fractures is often fraught with high uncertainty, particularly in basins with limited seismic data⁵. For instance, when faults align parallel to the strike, their detection becomes more difficult due to the overlap of fault lineaments with bedding lineaments.

Accurate fault identification and localization in seismic data are among the most critical steps in geophysical analysis. Faults, recognized as zones of structural weakness, play a significant role in assessing earthquake-prone areas. Given the complexity and volume of seismic data, advanced analytical methods are indispensable. This study investigates the application of MLP and SVM techniques for fault detection in seismic datasets, utilizing a range of extracted seismic attributes to achieve greater precision.

The Multi-Layer Perceptron (MLP) neural network was first introduced by Frank Rosenblatt in the 1960s. Initially limited to solving linear problems, its capabilities expanded significantly in the 1980s with the advent of the backpropagation algorithm, developed by David Rumelhart and colleagues, enabling the training of multi-layered networks. Today, MLP stands as a cornerstone of artificial intelligence and machine learning. Soft computing techniques are increasingly employed for reservoir characterization, with notable contributions from researchers such as Tanner⁷, Roberts¹², Tingdahl et al.¹³, Aminzadeh et al.¹⁴, Ashraf et al.¹⁵, and Xiao et al.¹⁶. Recent advancements in hardware and software have further accelerated the adoption and evolution of these methodologies.

These recent contributions highlight the dynamic evolution of seismic fault detection techniques, with deep learning methods providing powerful alternatives. Nevertheless, such models often demand large labeled datasets, high computational resources, and GPU infrastructure, making them challenging to implement in all research and industry contexts. Against this background, our proposed hybrid approach based on MLP and SVM offers a simpler and more resource-efficient solution for 2D seismic datasets, while still achieving competitive accuracy compared with recent deep learning frameworks.

Methodology

This research employs a hybrid approach combining MLP and SVM algorithms, implemented using MATLAB software. Initially, a suite of seismic attributes, including instantaneous phase and frequency, GLCM matrices (contrast, correlation, energy, and homogeneity), ant-tracking, curvature, edge detection (Sobel and Canny), dip, and azimuth, is extracted from the seismic data and normalized. Following extensive testing, frequency, GLCM, and edge-detection features were selected as inputs for the algorithms in this study, though feature selection may vary depending on the dataset. The data is then divided into training and testing sets, with MLP and SVM models trained using the training data. SVM identifies the optimal hyperplane to separate data classes, while MLP learns intricate data relationships through multiple layers. Model performance is evaluated on the test set, and accuracy is computed. Below, we briefly discuss key seismic attributes that yielded superior results in this study.

Curvature attribute

Curvature measures the degree of bending or angular change at boundaries, proving effective for identifying structures with sharp variations, such as faults. In seismic and geological analysis, curvature attributes aid in interpreting seismic reflections and detecting features like faults and folds. This is achieved by fitting a quadratic surface, expressed mathematically, to seismic data points. For a given point and its eight neighbors, the local curvature is estimated using Eq. (1).

Gray-level co-occurrence matrix (GLCM)

GLCM is a statistical tool that analyzes the co-occurrence of pixel intensities in an image and extracts textural features critical for fault detection. Key features include contrast, correlation, energy, and homogeneity, Eqs. (2–4)¹⁹.

Ant-tracking attribute

Inspired by ants’ collective behavior, the ant-tracking algorithm identifies prominent paths or boundaries in images, making it highly effective for detecting faults and fractures. Rather than relying on a specific mathematical formula, it operates based on defined movement rules, prioritizing regions with distinct edges, Eq. (5)^20,21.

Dip and azimuth attributes

Dip: This attribute measures the angle of inclination of a subsurface layer relative to the horizontal plane, calculated from changes in depth or reflection time across directions. It is instrumental in characterizing geological structures like faults, Eq. (6).

Azimuth: Representing the direction of maximum dip, azimuth indicates the orientation or extent of geological features derived from horizontal coordinate variations, Eq. (7)²².

Chaos

The Chaos attribute measures the degree of structural disorder in seismic data. It is particularly useful for identifying fault zones, fractures, and chaotic depositional environments, Eq. (8)⁴.

Variance

Variance measures the lateral changes in seismic amplitude, highlighting discontinuities such as faults and stratigraphic features, Eq. (9)⁴.

Sweetness (sweet)

Sweetness is the ratio of instantaneous amplitude to the square root of instantaneous frequency. It is commonly used to identify hydrocarbon reservoirs, Eq. (10)²³.

Correlation

The correlation attribute measures the similarity between seismic traces within a specified window. It is useful for analyzing the continuity and coherence of seismic reflectors, Eq. (11)⁴.

Dip steering

Dip Steering calculates the local dip and azimuth of seismic reflectors, often used for guiding horizon interpretation, Eq. (12)⁶.

Energy

Energy measures the sum of squared amplitudes within a seismic trace window, representing reflectivity strength, Eq. (13)⁸.

Gradient magnitude (GradianMag)

Gradient Magnitude represents the rate of amplitude change within a seismic section, useful for detecting faults and other discontinuities, Eq. (14)⁴.

Amplitude contrast

Amplitude Contrast highlights abrupt changes in seismic amplitude, often used for fault and fracture detection, Eq. (15)⁹.

Flatness

Flatness measures the similarity of seismic reflectors to a horizontal plane, which is useful for identifying stratigraphic features and differentiating them from chaotic or faulted areas, Eq. (16)⁴.

Research approach

Multi-layer perceptron (MLP) with backpropagation

MLP is a feedforward artificial neural network that maps input features to output probabilities. In our framework, the MLP processes the integrated seismic attributes and generates an initial interpretation, supporting the development of the new hybrid machine. MLP consists of input, hidden, and output layers. The input layer receives data, hidden layers (whose number varies by problem complexity) process it, and the output layer delivers results. Inspired by biological neurons, MLP learns by adjusting weights via backpropagation to minimize prediction errors. In this project, an MLP with five hidden layers is designed, utilizing ReLU activation in hidden layers and sigmoid activation in the output layer (Fig. 1). Trained on seismic data, the network optimizes its parameters and predicts faults in the test set. MLP is a standard tool for supervised pattern recognition and remains a focus of research in computational neuroscience and parallel processing due to its ability to address complex, stochastic problems¹⁰.

Support vector machine (SVM)

Support Vector Machine (SVM) is a supervised learning algorithm employed for classification and regression tasks, though it is predominantly used for classification. The primary objective of SVM is to identify an optimal hyperplane that separates data into two or more classes, maximizing the distance between the nearest points of each class—known as support vectors—and the hyperplane. This distance, referred to as the margin, enhances the model’s generalization ability when maximized¹¹. For cases where data are not linearly separable, SVM utilizes a kernel function to map the data into a higher-dimensional space where linear separation becomes feasible. In this study, an SVM with a Gaussian Radial Basis Function (RBF) kernel is applied to classify seismic features for fault detection. By integrating Principal Component Analysis (PCA) to reduce data dimensionality and standardizing features, this approach improves both accuracy and computational efficiency. The combination of MLP and SVM constitutes our new hybrid machine. This configuration allows for enhanced classification of fault and non-fault areas by fully utilizing the integrated seismic attributes.

The new hybrid machine

The integration of MLP and SVM creates a robust hybrid system. By combining neural networks’ learning capabilities with SVMs’ margin optimization, our new hybrid machine significantly improves the fault detection process. The strategic integration of multiple seismic attributes amplifies the hybrid machine’s effectiveness, resulting in superior detection performance.

Discussion and findings

Application to synthetic data

This study first applies the algorithms to synthetic seismic data with artificially induced faults. Two sample 128 × 128 seismic sections are used: one representing the seismic data (Figure. 2a) and the other containing fault labels (Fig. 2b). The data are transformed into 1D vectors, normalized, and processed to compute 12 features, of which six (selected based on quality weighting) are fed into the hybrid algorithm (Fig. 3). The hybrid MLP-SVM algorithm achieves an accuracy of approximately 94% on this dataset. The results of this method on 2D synthetic data are shown in the Figure. 4. We further add quantitative evaluation on the synthetic dataset using MSE, Dice coefficient, precision, recall, F1 score, and ROC-AUC for MLP, SVM, and hybrid model (Table 1). The hybrid model achieves the highest Dice score (0.91) and F1 score (0.90), confirming its superior performance.

Table 1 Quantitative evaluation on the synthetic dataset.

Full size table

Application of the hybrid algorithm to real two-dimensional data

After validation on synthetic data, the algorithm was optimized for real data, including labeled seismic data (Fig. 5). Initially, 37 features were considered, which were extracted from Petrel software and from direct MATLAB calculations. A number of these features were similar. To improve the performance, we evaluated them based on variance and visual interpretability. Out of these, 9 features were selected because they showed higher scores in visual quality and geological interpretability. The details of the evaluation by F1-score and Dice methods are presented in Table 2, which lists the features in order of quality. The first three features (chaos, variance, and domain contrast) obtained much higher scores compared to the other features, indicating their dominant contribution to fault detection. These 9 features were weighted and subsequently entered as input layers for the algorithm into the MLP and SVM classifiers (Figure. 6). The hyperparameters of both models were tuned accordingly: the MLP has five hidden layers ([128, 128, 64, 32, 16]) with ReLU activations and softmax output, trained for 200 epochs using scalable conjugate gradient optimization and cross-entropy loss. The SVM uses an RBF kernel with KernelScale = auto, BoxConstraint = 1, and PCA dimensionality reduction with 95% variance preservation. Each input was weighted based on the variance method and included in the algorithm accordingly. The input data was divided into 80% training and 20% test sets. The SVM and MLP algorithms were first trained using the training data and subsequently evaluated on the test data. Then, the predictions from the SVM and MLP models were combined to derive the final fault detection results. The confidence level for this combined approach was approximately 89% (accuracy of the combined model: 88.83%).

Subsequently, the hybrid algorithm was applied to real 2D data. Figure 7a shows the prediction results of the MLP method, which achieved a confidence level of approximately 83%. While this method successfully identified the errors, it did not have the desired detection quality. As a result, the SVM method was analyzed and tested, and its results are shown in Fig. 7b, achieving a confidence level of approximately 80%. Although this approach partially identified the errors, it was not sufficient to provide optimal results. To overcome this problem, the two methods were combined using a weighted approach. After conducting experiments and tuning the neural network parameters to achieve an optimal configuration, the hybrid algorithm was implemented. The results, shown in Fig. 8, clearly show the predicted errors aligned with the guidelines (yellow lines) on the image, increasing the accuracy of error detection. Next, we performed a detailed comparison of our proposed hybrid method with the U-Net. The U-Net architecture was configured with the usual parameters (four encoder-decoder blocks, ReLU activations, batch normalization, Adam optimizer, learning rate 0.001, 50 epochs). The results showed that the U-Net produces competitive accuracy (Fig. 9), but importantly, our hybrid approach was chosen due to its simplicity, reduced computational requirements, and suitability for 2D seismic data, where computational infrastructure is limited. We performed comparisons with CNN- and U-Net-based methods using published implementations or results from scientific papers on similar datasets. Our hybrid method matches or in some cases surpasses the F1 scores of these models while training on a CPU in less than 25 min (Table 3).

Table 2 Ablation study showing attribute contributions.

Full size table

Table 3 Comparison with CNN and U-Net results.

Full size table

Interpretation of results

Compared to studies that rely on single methods, the combined approach shows higher accuracy in fault pattern detection. This is due to the complementary strengths of MLP and SVM in data interpretation. While previous research often used single algorithms or multiple methods independently, the innovation here lies in their integration. In Table 4, the performance of this combined approach is calculated and shown on the synthetic and real data used in this paper. Note that this approach has been evaluated solely on the presented dataset, with the potential to be modified and tested on diverse datasets. However, the computation time increases due to model combination, necessitating future optimizations.

Table 4 Performance metrics across available datasets.

Full size table

Conclusion

We present a novel hybrid machine for fault detection using integrated seismic markers. This method combines the strengths of MLP and SVM to achieve higher accuracy in identifying fault structures. The integration of multiple seismic markers into the framework of the novel hybrid machine has been crucial in enhancing the fault detection results. This hybrid technique provides a powerful tool for interpreting seismic data. It can also be used to improve the reprocessing process, for example, velocity correction in faulted areas, and can be used as a guide. Future efforts will include extending this approach to 3D datasets and applying it to different data to further improve the system.

Declaration of generative AI and AI-assisted technologies in the writing process

While preparing this work, the authors used Grok (Grok 3) to improve grammar and readability. After using this tool/service, the author(s) reviewed and edited the content as needed and take(s) full responsibility for the content of the publication.

Data availability

Data Availability Statement: The seismic data supporting the findings of this study are provided in the Supplementary Material section, within a RAR file in .CVS format, and can be accessed there. For any further inquiries or data requests, please contact Hadi Esmaeili at hadi.esmaeili@ut.ac.ir.

References

Ashraf, U. et al. Classification of reservoir facies using well log and 3D seismic attributes for prospect evaluation and field development: A case study of Sawan gas field, Pakistan. J. Petrol. Sci. Eng. 175, 338–351 (2019).
Article Google Scholar
Bashir, Y. et al. Seismic expression of miocene carbonate platform and reservoir characterization through geophysical approach: application in central Luconia, offshore Malaysia. J. Petroleum Explor. Prod. Technol. 11 (4), 1533–1544 (2021).
Article Google Scholar
Jiang, R. et al. Sweet spot prediction through fracture genesis using multi-scale geological and geophysical data in the karst reservoirs of cambrian Longwangmiao carbonate Formation, Moxi-Gaoshiti area in Sichuan Basin, South China. J. Petroleum Explor. Prod. Technol. 12 (5), 1313–1316 (2021).
Article Google Scholar
Chopra, S. & Marfurt, K. J. Seismic Attributes for Prospect Identification and Reservoir Characterization (Society of Exploration Geophysicists, 2007).
Peace, A. et al. The role of pre-existing structures during rifting, continental breakup, and transform system development, offshore West Greenland. Basin Res. 30 (3), 373–394 (2018).
Article ADS Google Scholar
Marfurt, K. J., Kirlin, R. L., Farmer, S. L. & Bahorich, M. S. 3-D seismic attributes using a semblance-based coherency algorithm. Geophysics 63 (4), 1150–1165 (1998).
Article ADS Google Scholar
Tanner (2001)Tanner, M. (2001). Fault detection and interpretation using seismic attributes. The Leading Edge, 20(3), 310–317.
Taner, M. T. & Sheriff, R. E. Application of amplitude, frequency, and Other Attributes To Stratigraphic and Hydrocarbon Determination 301–327 (Seismic Stratigraphy Applications to Hydrocarbon Exploration, 1977).
Marfurt, K. J. Robust estimates of amplitude gradients and applications to 3D seismic data. Geophysics 71 (1), P1–P9 (2006).
Google Scholar
Menhaj, M. B. Fundamentals of Artificial Neural Networks Vol. 1, p. 29 (Amirkabir University of Technology, 2002). (in Persian).
Cortes, C. & Vapnik, V. Support-vector networks. Mach. Learn. 20 (3), 273–297 (1995).
Article Google Scholar
Roberts, A. Curvature attributes and their application to 3D interpreted horizons. First Break. 19 (2), 85–100 (2001).
Article Google Scholar
Tingdahl et al. (2005)Tingdahl, K. M., de Groot, P., & Le Poidevin, J. (2005). Seismic fault detection using attribute combinations and neural networks. Geophysical Prospecting, 53(4), 513–523.
Aminzadeh et al. (2006)Aminzadeh, F., de Groot, P., & Brac, J. (2006). Soft computing and intelligent data analysis in oil exploration. Amsterdam: Elsevier.
Ashraf et al. (2020)Ashraf, U., Riaz, M., Ahmed, S., & Rehman, A. (2020). Machine learning-based fault detection and interpretation using 3D seismic attributes: A hybrid neural approach. Journal of Applied Geophysics, 182, 104176.
Xiao et al. (2020)Xiao, Z., Wu, X., & Fomel, S. (2020). FaultSeg3D: Using synthetic data to train an end-to-end convolutional neural network for 3D seismic fault segmentation. Geophysics, 85(4), O47–O58.
Zhang, P., Li, W., Liu, Y. & Zhao, X. Seismic fault recognition using a Self-Supervised Swin-UNETR with adaptable architecture. Remote Sens. 16 (5), 922 (2024).
Article ADS Google Scholar
Li, M., Fang, Z. H., Li, J. T. & Huang, J. MS-Unet: A Multi-Scale U-Shaped convolutional neural network for seismic fault recognition. Processes 13 (4), 778 (2025).
Google Scholar
Haralick, R. M. et al. Textural features for image classification. IEEE Trans. Syst. Man. Cybernetics. SMC-3 (6), 610–621 (1973).
Article ADS Google Scholar
Monga, V. & Evans, B. L. Robust perceptual image hashing using feature points. In Proceedings of the IEEE International Conference on Image Processing, 313–316. (2006).
Oztan, Y. et al. Ant tracking method for boundary detection. IEEE Trans. Med. Imaging. 20 (8), 760–771 (2001).
Google Scholar
Barnes, A. E. Attributes for coherence calculations and their applications to 3D data interpretation. Geophysics 65 (3), 915–929 (2000).
Google Scholar
Radovich, B. J. & Oliveros, R. B. 3D sequence interpretation of seismic instantaneous attributes from the gorgon field. Lead. Edge. 17 (9), 1286–1293 (1998).
Article Google Scholar
Cui et al. (2025)Cui, Y., Li, J., Wang, H., & Zhang, F. (2025). MS-UNet: A Multi-Scale Feature Fusion U-Net for 3D Seismic Fault Detection. Processes, 13(4), 778.
Chen et al. (2023)Chen, Z., Wu, B., & Ma, D. (2023). Transformer U-Net for Fault Detection in 3D Seismic Data. Remote Sensing, 15(4), 1039.
Wang et al. (2023)Wang, Q., Li, Y., & Wu, X. (2023). FaultSSL: Seismic Fault Detection via Semi-Supervised Learning. Geophysics, 88(3), M79–M91.

Download references

Funding

This research did not receive any specific grant from funding agencies in the public, commercial, or not-for-profit sectors.

Author information

Authors and Affiliations

Seismology, Institute of Geophysics, University of Tehran, Tehran, Iran
Hadi Esmaeili
Institute of Geophysics, University of Tehran, Tehran, Iran
Majid Bagheri
Razi University, Kermanshah, Iran
Shamseddin Esmaeili

Authors

Hadi Esmaeili
View author publications
Search author on:PubMed Google Scholar
Majid Bagheri
View author publications
Search author on:PubMed Google Scholar
Shamseddin Esmaeili
View author publications
Search author on:PubMed Google Scholar

Contributions

Author Contributions StatementH.E. (Hadi Esmaeili) conducted the research, performed the analysis, and wrote the manuscript. S.E. (Shamcedin Esmaeili) contributed to the writing of the manuscript and assisted in the research and analysis process. M.B. (Majid Bagheri) supervised the research, provided guidance and oversight throughout the study, contributed to the interpretation of the results, and reviewed the manuscript. All authors approved the final version of the manuscript.

Corresponding author

Correspondence to Hadi Esmaeili.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Below is the link to the electronic supplementary material.

Supplementary Material 1

Supplementary Material 2

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Esmaeili, H., Bagheri, M. & Esmaeili, S. Integrating multiple seismic attributes for fault detection using a new hybrid machine learning. Sci Rep 15, 42744 (2025). https://doi.org/10.1038/s41598-025-26889-y

Download citation

Received: 05 May 2025
Accepted: 30 October 2025
Published: 28 November 2025
Version of record: 28 November 2025
DOI: https://doi.org/10.1038/s41598-025-26889-y

Subjects

Abstract

Similar content being viewed by others

Ensemble of hybrid model based technique for early detecting of depression based on SVM and neural networks

Multilabel classification for defect prediction in software engineering

Enhancing the classification of seismic events with supervised machine learning and feature importance

Introduction

Methodology

Curvature attribute

Gray-level co-occurrence matrix (GLCM)

Ant-tracking attribute

Dip and azimuth attributes

Chaos

Variance

Sweetness (sweet)

Correlation

Dip steering

Energy

Gradient magnitude (GradianMag)

Amplitude contrast

Flatness

Research approach

Multi-layer perceptron (MLP) with backpropagation

Support vector machine (SVM)

The new hybrid machine

Discussion and findings

Application to synthetic data

Application of the hybrid algorithm to real two-dimensional data

Interpretation of results

Conclusion

Declaration of generative AI and AI-assisted technologies in the writing process

Data availability

References

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher’s note

Supplementary Information

Supplementary Material 1

Supplementary Material 2

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Quick links