Abstract
The classification of coal bursting liability (CBL) is essential for the mitigation and management of coal bursts in mining operations. This study establishes an index system for CBL classification, incorporating dynamic fracture duration (DT), elastic strain energy index (WET), bursting energy index (KE), and uniaxial compressive strength (RC). Utilizing a dataset comprising 127 CBL measurement groups, the impacts of various optimization algorithms were assessed, and two prominent machine learning techniques, namely the back propagation neural network (BPNN) and the support vector machine (SVM), were employed to develop twelve distinct models. The models’ efficacy was evaluated based on accuracy, F1-score, Kappa coefficient, and sensitivity analysis. Among these, the Levenberg–Marquardt back propagation neural network (LM-BPNN) model was identified as superior, achieving an accuracy of 96.85%, F1-score of 0.9113, and Kappa coefficient of 0.9417. Further validation in Wudong Coal Mine and Yvwu Coal Industry confirmed the model, achieving 100% accuracy. These findings underscore the LM-BPNN model’s potential as a viable tool for enhancing coal burst prevention strategies in coal mining sectors.
Similar content being viewed by others
Introduction
Coal1 serves as a fundamental source of energy and a crucial industrial raw material essential for the advancement of human civilization. Within the realm of mining engineering, ensuring the safe extraction of coal stands as a paramount objective. Moreover, proactive measures2,3 in early detection and scientific prevention of disasters arising from coal mining operations and engineering constructions form the cornerstone of coal mining practices.
Coal mines are prone to various disasters4,5,6. As a dynamic disaster severely threatening to the safety production of coal mines7,8,9,10, coal burst often causes mine tunnel damage, casualties, and economic losses. Coal burst is affected by many factors, such as components of coal-rock mass, energy, and mining disturbance. Based on different properties of coal, Bieniawski et al.11,12 proposed the bursting liability theory, who pointed that bursting liability is an inherent property of coal-rock mass for burst failure13. The “Rules for Prevention and Control of Coal bursts in Coal Mines,” issued by the National Coal Mine Safety Supervision Administration of China, stipulates that coal seams that have experienced coal burst within the mine field, or coal seams that have been identified as having coal burst tendency (or rock strata of roof and floor) and are assessed as having coal burst risk are coal burst coal seams. The mine with coal burst coal seam is a coal burst mine. Coal bursting liability14, serving as an intrinsic factor and essential condition for the occurrence of coal bursts, has emerged as a key basis for evaluating and predicting coal burst hazards in coal mines. As an internal cause and necessary condition of coal bursts, CBL of different coals is significant difference.
Therefore, judging the class of CBL accurately is essential for the prevention and control of coal bursts. Mathematical and statistical methods have been widely adopted in classification and evaluation problems, as well as in CBL classification. Guided by theory of unascertained measure model, Xu et al.15 established a comprehensive model for CBL classification by applying confidence degree to sorting and calculating index weight with information entropy. According to the entropy weight-ideal point method, Wang et al.16 developed a CBL classification model, and achieved good engineering effect. Jia et al.17 introduced the attribute mathematics theory, determined the weight of each index with the similar number and similar weight method, identified the attribute by the confidence criterion, and established an attribute mathematical model for CBL classification. Guo18 determined the grey distance correlation degree of the evaluation object through the grey correlation analysis, constructed the grey identification model for CBL classification according to the minimum membership degree principle. Wang et al.7 applied Mahalanobis distance discriminant analysis (DDA) method to establish a DDA model for CBL classification based on 95 groups of training samples.
The above studies have achieved a series of good results in CBL classification, but are limited to mathematical modelling under small samples, the approach to problem-solving is complex and subjective, and the learning ability of model is poor. So, it is difficult to meet the convenience and intelligence requirements of current CBL classification. In addition, the existing performance evaluation indicator of classification model is single, the accuracy is used chiefly as the main indicator, which only considers overall classification effect, and it is difficult to ensure the CBL classification effect of different levels. In recent years, machine learning methodologies, such as BPNN19,20 and SVM21,22 have been increasingly employed to address various challenges within the domain of mining engineering. These advanced techniques have demonstrated notable efficacy in tackling analogous issues encountered in this field. Aiming at the shortcomings of CBL classification, this paper established an extensive sample dataset of CBL, introduced machine learning methods to carry out intelligent classification research of bursting liability, considered the CBL classification effect of different levels, and comprehensively used the accuracy, F1-score and Kappa coefficient to evaluate classification performance and select the best learning model.
Based on the above explanation, this study presents several novel contributions to the CBL classification:
-
1. The largest dataset comprises an extensive collection of samples sourced from pertinent literature sources. This dataset encompasses 127 distinct data sets, systematically incorporating stress and energy variables into its framework.
-
2. Nine unconstrained optimization algorithms were integrated with BP neural networks, and the SVM models were optimized utilizing the APSO algorithm. Subsequently, hybrid models were developed for comprehensive research in CBL.
-
3. By conducting a thorough comparison of accuracy, F1-score, and Kappa coefficient, alongside a sensitivity analysis of pertinent indicators, a refined optimization mechanism tailored to the CBL model was formulated.
Methods
Overview of CBL
CBL characterizes the ability of coal to store strain energy and experience impact-induced fractures, essential for predicting coal burst incidents. Classified into three levels—none, weak, and strong—CBL’s intensity directly influences the probability of coal bursts. The stronger the CBL, the higher the likelihood of coal burst occurrences. The CBL evaluation is typically determined through a comprehensive analysis of various laboratory-determined discriminative indexes. Commonly used indexes include the duration of dynamic fracture (DT), elastic strain energy index (WET), bursting energy index (KE), and uniaxial compressive strength (RC).
DT is defined as the period required for a coal sample under uniaxial compression to proceed from its ultimate strength to total failure, as illustrated in Fig. 1. A shorter failure time indicates a stronger CBL.
When calculating DT, the average value of the dynamic failure time for each group of samples should be taken, as shown in Eq. (1).
where DT is the average dynamic failure time (ms), DTi is the dynamic failure time of the i-th specimen (ms), n is the number of specimens per group.
(2) WET is defined as the quotient of elastic deformation energy to plastic deformation energy (dissipated deformation energy) in a coal sample under uniaxial compression before it reaches a specific stress threshold (prior to failure), as depicted in Fig. 2. An increase in the WET value signifies a stronger CBL. The formula for calculating WET is presented in Eq. (2).
(3) KE is articulated as the proportion of accumulated deformation energy (F1) preceding peak stress to the dissipated deformation energy (F2) subsequent to peak stress, as evidenced in the stress-strain curve of a coal sample under uniaxial compression, illustrated in Fig. 3. An elevated KE value delineates a stronger CBL, indicating a higher predisposition towards coal burst events. The calculation formula for KE is given in Eq. (3).
(4) RC is defined by the quotient of the failure load (Q) to the bearing surface area (A) of a standardized coal sample subjected to uniaxial compression. This metric, illustratively presented in Eq. (4), serves as a pivotal indicator of CBL, where an augmented RC value signals an increased risk to coal burst.
Machine learning methods
In recent years, machine learning methods have been widely used in the classification and evaluation of engineering problems. The ‘black box’ nature of machine learning models allows them to circumvent complex mechanistic issues in decision-making, thereby attracting significant attention. Machine learning research began with the introduction of neural networks23, nearly 80 years, perceptron24, Hubel-Wiesel biological visual model25, back propagation neural network (BPNN)26, decision tree27, support vector machine (SVM)28, boosting29 and other models have been developed, which have become the mainstream of machine learning applications.
Index selection is essential in studying of CBL classification by machine learning methods. Referring to previous studies7,16,30, duration of dynamic fracture (DT), elastic strain energy index (WET), bursting energy index (KE), and uniaxial compressive strength (RC) as the classification indexes were selected and divided the CBL level into three classes: strong (class 1), weak (class 2), and none (class 3), and they are considered as the output of machine learning model.
As the most representative machine learning methods, BPNN and SVM are adopted for CBL classification in this paper. The best model is selected by establishing different machine learning models and evaluating their performance. A total of twelve machine learning models are established, including ten BPNN models and two SVM models.
Ten BPNN models include one standard model based on gradient descent and nine improved models according to different unconstrained optimization algorithms (quantized conjugate gradient method, Fletcher-Reeves conjugate gradient method, Polak-Ribiere conjugate gradient method, momentum gradient descent method, adaptive lr gradient descent method, adaptive lr momentum gradient descent method, elastic gradient descent method, Levenberg‒Marquardt algorithm, and quasi-Newton algorithm). Principles of these algorithms are described in references30,31,32. Table 1 shows the ten BPNN models based on different algorithms.
The two SVM models established in this paper, one is linear SVM (LSVM) model, which transforms the constrained problem of soft margin maximization into an unconstrained problem using the Lagrange function. The other is the APSO-SVM model, which uses an adaptive particle swarm optimization algorithm (APSO) to optimize the SVM model. This algorithm improves the traditional particle swarm optimization algorithm, and the specific principle can be found in the reference33.
Data standardization
Data standardization is dimensionless, which can eliminate the influence of different dimensions and value ranges of indexes on classification results. In this paper, unified extreme value processing method is selected for data standardization. The principle is shown in Eq. (5).
where xij is the original data of j-th index of i-th sample; xij* is the processed data, whose value is between 0 and 1, and the data distribution is consistent with that of before processed; max(xj) and min(xj) are the maximum and minimum value of original data of index j, respectively.
Performance evaluation
In this paper, three indicators, the accuracy, F1-score, and Kappa coefficient, are used to evaluate classification performance of different machine learning models comprehensively and scientifically.
(1) The accuracy34 is the ratio of the number of simples for accurate classification to the number of samples for modeling. The calculation formula for accuracy is given in Eq. (6).
where TP is the true positive in confusion matrix; FP is the true positive in confusion matrix; FN is the false negative in confusion matrix; TN is the true negative in confusion matrix.
The confusion matrix is shown in Fig. 4.
(2) F1-score35 is an effective indicator to measure the classification accuracy. It considers the accuracy and recall of the classification model and is a harmonic average method of the accuracy and recall of the model. The value of F1-score is between 0 and 1, and the closer to 1, the stronger the comprehensive classification ability of the classifier. The calculation formula for F1-score is given in Eqs. (7–9).
where the Precision is the proportion of positive examples in the model, and Recall is the proportion of predicted positive examples in the total positive examples.
(3) Kappa coefficient36 is a parameter calculated based on confusion matrix in statistics to measure the classification accuracy. The value of Kappa coefficient is close to 1, indicating that the actual output and classified output have strong consistency. The calculation formula for Kappa coefficient is given in Eqs. (10) and (11).
where the pe is the total probability the raters both saying positive and negative randomly.
Research route
In this paper, firstly, an imbalanced sample set consisting of 127 groups of CBL samples is established, and the sample set is divided into a training set and a test set according to the ratio of 7:3. Then, twelve machine learning models are adopted to classify the samples, and the classification results are evaluated and analyzed for model optimization, and ultimately, the best model is applied to engineering contexts. The research flowchart is shown in Fig. 5.
CBL classification using ML methods
Data analysis
By consulting a large number of literatures, 127 well-documented CBL samples have been collected (All data represent the average values obtained from multiple coal samples in this group, with specific details provided in the Supplementary Materials.), including 53 strong bursting liability samples, 67 weak bursting liability samples, and 7 none bursting liability samples. Figure 6 shows the data distribution of DT, WET, KE, and RC with different bursting liabilities. Table 2 presents in detail the statistical characteristics of the indexes data. It can be seen that the data of each index is evenly distributed without obvious abnormal values, and the index selection is reasonable.
In order to ensure the rationality of modelling, further correlation analysis is required. The Pearson correlation coefficient, a widely used measure of correlation in various fields, serves as an index of correlation analysis. Its value is between −1 and 1, which can measure the linear correlation between the two indexes. The greater the absolute value of correlation coefficient, the stronger the correlation between indexes. Figure 7 shows the correlation analysis of DT, WET, KE, and RC, including significant level (P), correlation coefficient (r) and curve fitted by the correlation coefficient. When analyzing the correlation of indexes, it is necessary to test their significance level first. After passing the significance test, the correlation coefficient can be used for correlation analysis. Figure 7 shows the CBL classification indexes have weak or no correlation, indicating the selected indexes are scientific and reasonable.
Based on it, the database, containing 127 groups of samples, was divided into a training set (70%) and a test set (30%), and the data were tested by using fivefold cross-validation, as shown in Fig. 8. Professional data analysis software is adopted to process data and construct models in this paper.
Model construction
BPNN
BPNN is a model proposed by Rumelhart and McClelland37. The input layer of BPNN built in this paper has four neurons, namely, DT, WET, KE, and RC. The configuration of the hidden layer is based on the guidelines provided in references38,39. Two hidden layers are set, and the number of hidden layer nodes selected after multiple debugging is 10 and 4, respectively. The output layer of BPNN has three neurons, corresponding to the class 1 to 3 of CBL level, as shown in Fig. 9.
The upper limit of the training times of the ten BPNN models built in this paper is set to 1000, the learning rate is 0.01, the minimum error of the training target is 0.0001, and the display frequency is 25, that is, it is displayed once every 25 times of training, the minimum performance gradient is 1*e−6, and the maximum number of failures is 6. The operation process of the model is shown in Fig. 10.
SVM
SVM40,41 is a supervised learning algorithm proposed by Vapnik and other scholars to analyze data and classify decisions. Based on the statistical learning theory and structural risk minimization principle, SVM uses the limited sample information to establish a classification hyperplane in the feature space for obtaining classification and recognition abilities. It offers advantages such as requiring fewer samples, handling nonlinearity, and functioning effectively in high-dimensional spaces. In the classification process, SVM maps the low-dimensional data to the high-dimensional space through the higher-dimensional space of kernel function26, as shown in Fig. 11.
As for the selection of model parameters, the penalty factor c of LSVM model is 1.0, the kernel function is linear kernel function, and the parameter g is set to the reciprocal of the number of sample features. APSO-SVM model selects the RBF radial basis kernel function, penalty factor c and kernel function parameter g adopt APSO for parameter optimization, and other parameters are the same as LSVM model.
Results
Classification result and evaluation
For the multi-index and multi-classification problem, the accuracy, F1-score, and Kappa coefficient were used as the evaluation indicators in this paper. The accuracy can better reflect the overall classification performance of model, and the other two indicators can better reflect the classification accuracy of model. Based on this, the confusion matrixes are constructed for twelve learning models (see Fig. 12), and the values of three evaluation indicators of each model are calculated, as shown in Table 3.
Analysis and optimization
Analysis
-
(1)
Classification performance analysis
According to Table 3, the LM-BPNN model has the highest accuracy rate (96.85%), the accuracy rate (81.89%) of GDM-BPNN model is lowest, and the average accuracy rate of twelve models is 90.09%. It can be seen that the accuracy rate of CBL classification model constructed in this paper is high, but there is a large gap between the accuracy of models based on the same machine learning method and different optimization algorithms.
Unlike the accuracy, F1-score focuses more on identifying the data with the actual sample level, which is a measurement standard that combines the accuracy rate and recall rate. Table 3 shows that the LM-BPNN model has a highest F1-score value (0.9113), the standard BPNN model has a lowest F1-score value (0.6885), and the average F1-score value of twelve models is 0.8109. Compared with the accuracy, F1 score considers the situation of samples with different levels. For example, the accuracy of RP-BPNN model is 88.98%, however, the classification effect of samples of class 3 is poor (only two groups of samples are correct) seen from the confusion matrix, and the F1 score value is only 0.7641, which is lower than that of the CGF-BPNN model with an accuracy rate of 87.40%.
Kappa coefficient is also an evaluation indicator under imbalanced samples. Different from F1-score, Kappa coefficient focuses more on categories with a small sample size. Table 3 shows that the Kappa coefficient value (0.9417) of LM-BPNN is highest, the Kappa coefficient value (0.6485) of GDM-BPNN is smallest, and the average value of twelve models is 0.7999. Compared with the accuracy rate and F1-score value, the fluctuation of Kappa coefficient of different models is more obvious.
Figure 13 shows the change curve of the accuracy rate, F1-score value, and Kappa coefficient value of twelve classification models. It can be seen that the overall trend of three change curve is similar, among which the accuracy curve is the most stable. The Kappa coefficient is similar to the accuracy trend, but the fluctuation is largest, which is determined by design intention and calculation method of Kappa coefficient. The accuracy is directly involved in the calculation of Kappa coefficient, so it is closer in the overall trend. The design of Kappa coefficient is to consider the categories with few samples. Among the three types of CBL samples, there are fewer class 3 samples, so the classification results of class 3 samples have a greater impact on the Kappa coefficient value and are prone to obvious fluctuations. F1-score not only considers the imbalanced sample set, but also does not pay special attention to a certain category of samples, which is more balanced than the other two evaluation indicators.
-
(2)
Analysis of model characteristics
BPNN model is a multilayer feedforward neural network model. The gradient descent method and nine unconstrained optimization algorithms were selected to train BPNN models. Different algorithms result in different classification effects. As an optimization algorithm of differentiable function, gradient descent method solves the learning problem of multi-layer neural network, but it also has some shortcomings. It can be seen from Table 3 that the classification accuracy of standard BPNN model is lower than that of most improved BPNN models, and also lower than the average accuracy of twelve models.
The nine improved BPNN models were established according to different unconstrained optimization algorithms. As far as the classification effect is concerned, they present positive and negative optimization, that is, whether the classification effect is better than the standard BPNN model. GDM-BPNN model and BFG-BPNN model show negative optimization, in which GDM-BPNN model adds a momentum factor in the weight updating stage of gradient descent method to strengthen the anti-vibration ability of the model. The classification results indicate that improving the model’s anti-vibration ability does not correlate with, or enhance, CBL classification accuracy. BFG-BPNN model is established based on quasi-Newton method. Compared with gradient descent method, this method optimizes the convergence speed of BPNN, but the gradient of objective function needs to be known when calculating, the model established in this paper is not given, resulting in low accuracy.
The other seven improved BPNN models show positive optimization. Due to optimization angles of different unconstrained optimization algorithms are different, three improved BPNN models with highest classification accuracy were analyzed in this paper, namely, LM-BPNN model, GDA-BPNN model, and GDX-BPNN model. LM-BPNN model is a BPNN model optimized according to the Levenberg–Marquardt algorithm. Levenberg–Marquardt algorithm is a classical trust region algorithm, which optimizes the Newton method most commonly used in optimization problems, and also optimizes the complex computational problems in quasi-Newton method, so it has the advantages of gradient method and Newton method at the same time. The classification results show that the accuracy of LM-BPNN model is 96.85%, which is the highest among all models. GDA-BPNN model and GDX-BPNN model are optimized based on adaptive lr gradient descent method and adaptive lr momentum gradient descent method, respectively. The optimization of traditional gradient descent method is mainly reflected in the adaptive learning rate. This method adapts to each parameter in the update process and determines the large or small update amount according to the importance of each parameter, so that the algorithm has a faster convergence speed and better classification accuracy. The classification results of this paper also verify the advantages of adaptive learning rate method.
For the two SVM models established in this paper, from the classification results, the performance of LSVM model using linear kernel function is better than APSO-SVM model. The correlation between classification indicators has been verified above, so LSVM model is more suitable for the sample database built in this paper. APSO algorithm is an improved particle swarm optimization algorithm. The APSO-SVM model established in reference33 is a classification model applicable to nonlinear data. Through the analysis of principle and results, the sample database built in this paper is more suitable for linear models or models that do not require index correlation.
Model optimization
In order to select the best machine learning model from the twelve models, the following optimization rules were developed in this paper: Since larger values in the three performance evaluation indicators signify better performance, priority is given to the classification model with the highest values. When the values of evaluation indices are similar, preference is given to models with superior stability and suitability for the established sample database.
Through comprehensive comparative analysis, it can be seen that the accuracy (96.85%) of LM-BPNN model is the highest, representing the best overall classification accuracy; F1 score value (0.9113) and Kappa coefficient value (0.9417) are also the highest, representing that the CBL classification results of different classes are good, which can be verified by the confusion matrix of the model. In summary, LM-BPNN model is selected as the best model in this paper.
Sensitivity analysis
In the sphere of machine learning, the initial phase of model deployment is underscored by data preprocessing, a critical procedure adopted across a spectrum of machine learning methodologies within mining engineering41,42,43,44,45 to bolster model precision and reliability. This investigation pursues the identification of the superior model via a sensitivity analysis centered on data preprocessing techniques specific to index data. Through the application of diverse preprocessing strategies, including differentiated extreme value processing, averaging processing, and normalized processing, sample data is transformed into a dimensionless format. The classification results of the models built using these methods are compared with the results obtained using the unified extreme value processing method in data analysis. The performance evaluation results of the models can be found in Fig. 14.
Referring to Fig. 14, the LM-BPNN model (No.9#), selected for this investigation, demonstrates commendable performance across all four dimensionless data preprocessing techniques. The model's three evaluative metrics—Accuracy, F1-score, and Kappa coefficient—register at comparably elevated tiers, underscoring the model’s robustness to variations in data preprocessing and its enhanced classification consistency. Further analysis, as depicted in Fig. 14c, reveals that the LM-BPNN model consistently secures the apex position in terms of the Kappa coefficient value across disparate data processing strategies. This evidences the model’s aptitude for handling imbalanced sample databases effectively. The study’s sample database, characterized by a significant imbalance with a distribution ratio of 53:67:7 across three sample categories, benefits from the model’s superior ability to classify samples belonging to the less represented category, specifically the coal samples categorized under weak CBL, thereby outshining other models under comparison.
Engineering application
To verify the engineering application performance, LM-BPNN model is utilized to Wudong Coal Mine and N2105 working surface of a coal mine in Yvwu Coal Industry.
-
(1)
Wudong coal mine
Wudong Coal Mine46 is located in the north and south wings of Badaowan syncline in Urumqi mining area, China. The mining level of the coal seam is +475 and the mining depth is about 375 m. The immediate roof of the B3 + 6 coal seam consists of grey argillaceous rocks, with a thickness ranging from 1.5 to 2.1 m. It has a blocky structure and becomes loose when exposed to water, leading to collapse in the later stages of mining. The basic roof is composed of siltstone, with a thickness of 24.49 m. It is characterized by its hard and stable nature and does not collapse during mining operations. The immediate floor is composed of charcoal mudstone, with an average thickness of 1.35 m. It has the potential to collapse in the later stages of mining. The basic floor consists of fine-siltstone. LM-BPNN model is applied to the CBL classification of five groups of coal in Wudong Coal Mine, and the classification results are consistent with the actual impact dynamics of the coal seam (the coal seam has experienced three times of coal burst during mining), as shown in Table 4 for details.
-
(2)
N2105 working surface of a coal mine in Yvwu coal industry
Yvwu Coal Mine is located in the middle east of Qinshui Coalfield, Shanxi Province, China. It is 16 km long from north to south and 10 km wide from east to west, with a total area of 161.205 km2. The burial depth of N2105 working face is 507–597 m. The average thickness of the coal seam being mined is 6.31 m, with a uniaxial compressive strength of 6.88 MPa. The main overlying roof consists of fine-siltstone sandstone, approximately 11 m thick. It has been identified that both the coal seam and its roof and floor have a weak CBL. After mining, the coal body is plastic damaged, and the bolt support is partially ineffective47. The classification results of LM-BPNN model are consistent with the actual situation on site. See Table 5 for details.
Discussion
In this study, the largest database of CBL samples was created for classification research. The LM-BPNN model was selected from 12 CBL classification models. It is applied in Wudong Coal Mine and Yvwu Coal Mine. Setting itself apart from existing studies15,17,18,48, this paper not only expands the database but also diverges in research methodologies, manifested in three key points: Firstly, new technical means are introduced for CBL classification. Secondly, the ML model is optimized using unconstrained optimization algorithms and heuristic algorithms. Thirdly, a new evaluation mechanism suited for CBL classification is established through the integration of sensitivity analysis.
The research findings of this paper can be applied to mines requiring coal bursts control measures. It is important to note that the applicability of our model might vary, because there may be significant differences in coal mine conditions in production.
Conclusions
In view of the shortcomings of traditional CBL classification methods, this paper selected DT, WET, KE and RC as classification indexes, constructed a database containing 127 samples, and established twelve machine learning models based on BPNN and SVM for performance evaluation and optimization, and obtained the following conclusions:
-
(1) Machine learning methods were introduced for CBL intelligent classification research for the first time, and the classification accuracy of the machine learning model established was between 81.89% and 96.85%.
-
(2) The LM-BPNN model was identified as the most effective, demonstrating optimal performance among twelve models with a classification accuracy of 96.85%, an F1 score of 0.9113, and a Kappa coefficient of 0.9417. These metrics were used to assess the model’s classification capabilities across samples, supported by comprehensive analysis and sensitivity analysis.
-
(3) The LM-BPNN model was applied to Wudong Coal Mine and N2105 working surface of a coal mine in Yvwu Coal Industry. The predictive results are consistent with actual conditions, yielding a new method for CBL classification with promising application prospects.
Data availability
All data generated or analysed during this study are included in this published article; further inquiries can be directed to the corresponding author.
Abbreviations
- CBL:
-
Coal bursting liability
- BPNN:
-
Back propagation neural network
- DDA:
-
Distance discriminant analysis
- DT :
-
Duration of dynamic fracture
- W ET :
-
Elastic strain energy index
- K E :
-
Bursting energy index
- R C :
-
Uniaxial compressive strength
- N :
-
Number of specimens per group
- DT i :
-
Dynamic failure time of the i-th specimen
- Φ sp :
-
Elastic deformation energy
- Φ st :
-
Plastic deformation energy
- F 1 :
-
Accumulated deformation energy
- F 2 :
-
Dissipated deformation energy
- Q :
-
Failure load
- A :
-
Bearing surface area
- SVM:
-
Support vector machine
- SCG:
-
Quantized conjugate gradient
- CGF:
-
Fletcher-Reeves conjugate gradient
- CGP:
-
Polak-Ribiere conjugate gradient
- GDM:
-
Momentum gradient descent
- GDA:
-
Adaptive lr gradient descent
- GDX:
-
Adaptive lr momentum gradient descent
- RP:
-
Elastic gradient descent
- LM:
-
Levenberg–Marquardt
- BFG:
-
Quasi–Newton
- LSVM:
-
Linear support vector machine
- APSO:
-
Improved particle swarm optimization algorithm
- TP :
-
True positive in confusion matrix
- TN :
-
True negative in confusion matrix
- FP :
-
False positive in confusion matrix
- FN :
-
False negative in confusion matrix
- p e :
-
Parameters of kappa coefficient
- P :
-
Significant level
- r :
-
Pearson correlation coefficient
- c :
-
Penalty factor
- g :
-
Kernel function
- GB/T:
-
Recommended national standards in China
- Coal burst:
-
A phenomenon where coal and rock in mines suddenly fracture and release energy, often leading to the destruction of mine tunnels or working faces.
- Bursting liability:
-
Refers to the potential or tendency for rock or coal seams to experience shock ground pressure when subjected to external forces.
- Working face:
-
In coal mining, the location or area where coal mining operations are directly carried out, representing the forefront of coal production.
- Immediate roof:
-
In coal mining, the rock layer directly above the coal seam.
- Mudstone:
-
A fine-grained sedimentary rock, primarily composed of tiny mineral particles, typically smaller than 0.0625 mm in size. Mudstone is harder when dry and may become softer when wet.
- Mining depth:
-
The vertical distance from the surface to the operational face of the mine.
- Coal seam:
-
A layer in the underground sedimentary rock formation, primarily composed of coal and serving as the main source of coal resources.
- Basic roof:
-
In coal mining, the harder or more stable rock layer located directly above the immediate roof.
- Siltstone:
-
A fine-grained sedimentary rock, composed of mineral particles with sizes between those of mudstone and sandstone.
- Immediate floor:
-
In coal mining, the rock layer directly beneath the coal seam.
- Main overlying roof:
-
The principal rock layer located directly above the immediate roof, usually harder and more stable, playing a crucial role in the overall stability of the mine.
- Sandstone:
-
A medium to coarse-grained sedimentary rock, primarily made up of sand-sized mineral particles, known for its good porosity and permeability
- Grey argillaceous rocks:
-
Sedimentary rocks rich in clay minerals, giving the rock a muddy texture. These rocks are typically grey in color, varying in shades due to the presence of organic matter or iron.
- Charcoal mudstone:
-
A type of mudstone that contains a higher proportion of carbonaceous material, usually darker in color, ranging from grey-black to black.
- Fine-siltstone:
-
A fine-grained sedimentary rock, composed of mineral particles sized between mudstone and sandstone, generally ranging from 0.0039 to 0.0625 mm.
- Burial depth:
-
The vertical distance from the surface to the position of a rock layer or stratum.
References
Ju, Y. et al. Transparentized solutions and interpretation for the effects of discontinuous structures and multiphysics on rock failure. J. Ch. Coal Soc. 47, 210–232. https://doi.org/10.13225/j.cnki.jccs.YG21.2067 (2022).
Dong, L. J. et al. Acoustic emission source location method and experimental verification for structures containing unknown empty areas. Int. J. Min. Sci. Technol. 32, 487–497. https://doi.org/10.1016/j.ijmst.2022.01.002 (2022).
He, S. Q. et al. Study on the seismic damage and dynamic support of roadway surrounding rock based on reconstructive transverse and longitudinal waves. Adv. Geo Energy Res. 9, 156–171. https://doi.org/10.46690/ager.2023.09.04 (2023).
Wang, K. et al. A numerical investigation of hydraulic fracturing on coal seam permeability based on PFC-COMSOL coupling method. Int. J. Coal Sci. Technol. 9, 10. https://doi.org/10.1007/s40789-022-00484-2 (2022).
Du, F. et al. Rockburst mechanism and the law of energy accumulation and release in mining roadway: A case study. Int. J. Coal Sci. Technol. 9, 67. https://doi.org/10.1007/s40789-022-00521-0 (2022).
Linghu, J. et al. Influence of deep magma-induced thermal effects on the regional gas outburst risk of coal seams. Int. J. Coal Sci. Technol. 8, 1411–1422. https://doi.org/10.1007/s40789-021-00452-2 (2021).
Wang, C. et al. Research on the classification model of coal’s bursting liability based on database with large samples. Arab. J. Geosci. https://doi.org/10.1007/s12517-019-4562-2 (2019).
Wang, C. et al. Evaluation of bedding effect on the bursting liability of coal and coal-rock combination under different bedding dip angles. Adv. Geo Energy Res. 11, 29–40. https://doi.org/10.46690/ager.2024.01.04 (2024).
Qiu, L. M. et al. Response law and indicator selection of seismic wave velocity for coal seam outburst risk. Adv. Geo Energy Res. 9, 198–210. https://doi.org/10.46690/ager.2023.09.07 (2023).
Gong, F. Q., Wang, Y. L., Wang, Z. G., Pan, J. F. & Luo, S. A new criterion of coal burst proneness based on the residual elastic energy index. Int. J. Min. Sci. Technol. 31, 553–563. https://doi.org/10.1016/j.ijmst.2021.04.001 (2021).
Bieniawski, Z. T., Denkhaus, H. G. & Vogler, U. W. Failure of fractured rock. Int. J. Rock Mech. Min. Sci. 6, 323–341. https://doi.org/10.1016/0148-9062(69)90009-6 (1969).
Bieniawski, Z. T. Mechanism of brittle fracture of rock: Part I—theory of the fracture process. Int. J. Rock Mech. Min. Sci. 4, 395–406. https://doi.org/10.1016/0148-9062(67)90030-7 (1967).
Qi, Q. X. et al. Study of bursting liability of coal and rock. Chin. J. Rock Mech. Eng. 30, 2736–2742 (2011).
GB/T 25217.2-2010. Classification and Laboratory Test Method on Bursting Liability of Coal. Beijing: Standards Press of China (2010).
Xu, J. K., Wang, E. Y. & Wang, C. Study of rock burst tendency of coal based on uncertainty measurement theory. Saf. Coal Mines 42, 19–22. https://doi.org/10.13347/j.cnki.mkaq.2011.03.013 (2011).
Wang, C., Wang, E. Y. & Liu, X. F. Classification of rock burst tendency of coal seam based on entropy and ideal point method. J. Liaoning Tech. Univ. Nat. Sci. Ed. 31, 838–841 (2012).
Jia, X. W. & Wang, E. Y. Coal burst tendency classification based on attribute mathematical model. Saf. Coal Mines 45, 156–158. https://doi.org/10.13347/j.cnki.mkaq.2014.04.046 (2014).
Guo, J. D. Application of variable weight and grey classification recognition model in rock burst tendency evaluation of coal seam. J. N. Ch. Inst. Sci. Technol. 14, 44–49. https://doi.org/10.3969/j.issn.1672-7169.2017.01.009 (2017).
Lv, W., Wang, M. & Zhu, X. Model for prediction of surface subsidence coefficient in backfilled coal mining areas based on genetic algorithm and BP neural network. J. Comput. Methods Sci. Eng. 16, 745–753. https://doi.org/10.3233/JCM-160688 (2017).
Dong, Q. Y. et al. Fine detection of water-bearing collapse column based on information fusion of seismic exploration and TEM. J. Appl. Geophys. 206, 104806. https://doi.org/10.1016/j.jappgeo.2022.104806 (2022).
Amirkiyaei, V., Ghasemi, E. & Kadkhodaei, M. H. Development of empirical models to predict stope wall stability in open stope mines using gene-expression programming. Arab. J. Geosci. 16, 616. https://doi.org/10.1007/s12517-023-11730-5 (2023).
Zhu, J. et al. Risk assessment of deep coal and gas outbursts based on IQPSO-SVM. Int. J. Environ. Res. Public Health 19, 12869. https://doi.org/10.3390/ijerph191912869 (2022).
McCulloch, W. S. & Pitts, W. A logical calculus of the ideas immanent in nervous activity. Bull. Math. Biol. 1–2, 99–115. https://doi.org/10.1016/S0092-8240(05)80006-0 (1990).
Rosenblatt, F. The perceptron: A probabilistic model for information storage and organization in the brain. Psychol. Rev. 65, 386–408. https://doi.org/10.1037/h0042519 (1958).
Hubel, D. H. & Wiesel, T. N. Receptive fields, binocular interaction and functional architecture in the cat’s visual cortex. J. Physiol. 160, 54–106. https://doi.org/10.1113/jphysiol.1962.sp006837 (1962).
Rumelhart, D. E. & McClelland, J. L. Parallel distributed processing: Explorations in the microstructure of cognition. Foundations 1, 318–362 (1962).
Safavian, S. R. & Landgrebe, D. A survey of decision tree classifier methodology. IEEE Trans. Syst. Man Cybern. 21, 660–674. https://doi.org/10.1109/21.97458 (1991).
Cortes, C. & Vapnik, V. Support-vector networks. Mach. Learn. 20, 273–297. https://doi.org/10.1007/BF00994018 (1995).
Yoav, F. & Freund, R.E (1996) Experiments with a new boosting algorithm. International Conference on Machine Learning, 148–156
Li, Y. F. Comparison and Optimization of Multi-Index Evaluation Models for Coal Sample Bursting Liability Classification (Kunming University of Science and Technology, 2022).
Li, K. S., Li, K. & Zhang, W. S. PCA face recognition algorithm based on improved BP neural network. Comput. Appl. Softw. 31, 158–161. https://doi.org/10.3969/j.issn.1000-386x.2014.01.041 (2014).
Wang, C., Li, Y. F., Zhang, C. L., Liu, L. & Huang, X. C. Study on influence of indicator dimensionless classification model of coal’s bursting liability. Ch. Saf. Sci. J. 29, 20–25. https://doi.org/10.13225/j.cnki.jccs.YG21.2067 (2019).
Li, Y. F. et al. Rockburst prediction based on the KPCA-APSO-SVM model and its engineering application. Shock Vib. 2021, 12. https://doi.org/10.1155/2021/7968730 (2021).
Jalilian, M., Ghasemi, E. & Kadkhodaei, M. H. Stability assessment of open spans in underground entry-type excavations by focusing on data mining methods. Min. Metall. Explor. https://doi.org/10.1007/s42461-024-00945-z (2024).
Chang, C., Zhu, N., Yang, K. & Yang, F. Data and analytics for heating energy consumption of residential buildings: The case of a severe cold climate region of China. Energy Build. 172, 104–115. https://doi.org/10.1016/j.enbuild.2018.04.037 (2018).
Wang, J. Z., Han, J., Liu, Y. X. & Ding, S. L. Application of kappa statistic in consistency and reproducibility test. J. Shandong Univ. Health Sci. 3, 32–35 (1996).
Rumelhart, D., Hinton, G. & Williams, R. Learning representations by back-propagating errors. Nature 323, 533–536. https://doi.org/10.1038/323533a0 (1986).
Lippmann, P. R. An introduction to computing with neural nets. IEEE ASSP Mag. 2, 4–22. https://doi.org/10.1109/MASSP.1987.1165576 (1987).
Cyberko, G. Approximations by superpositions of a sigmoidal function. Math. Control Signals Syst. 2, 303–314. https://doi.org/10.1007/BF02551274 (1989).
Brierley, S., Chiasson, H. J., Lee, E. & Zak, S. On stability independent of delay for linear systems. IEEE Trans. Autom. Control 27, 252–254. https://doi.org/10.1109/TAC.1982.1102854 (1982).
Mahmoud, M. S. & Al-Muthairi, N. F. Design of robust controller for time-delay systems. IEEE Trans. Autom. Control 39, 995–999. https://doi.org/10.1109/9.284878 (1994).
Lou, Y. H. & Li, K. G. A rockburst grade evaluation method based on principal component analysis and the catastrophe progression method. Front. Earth Sci. 11, 1163187. https://doi.org/10.3389/feart.2023.1163187 (2023).
Li, Y. F., Wang, C. & Liu, Y. Classification of coal bursting liability based on support vector machine and imbalanced sample set. Minerals 13(1), 15. https://doi.org/10.3390/min13010015 (2023).
Wang, Q. W. et al. Exploration and improvement of fuzzy evaluation model for rockburst. Min. Metall. Explor. https://doi.org/10.1007/s42461-024-00933-3 (2024).
Wang, C. et al. BP neural network model optimization for rockburst prediction based on nine unconstrained optimization algorithms. J. Kunming Univ. Sci. Technol. 46, 32–37. https://doi.org/10.1007/s12517-019-4562-2 (2021).
Hao, Y. X. Study on Rockburst of Wudong Near-Vertical Coal Seams and Impact Protection with Constant Resistance and Large Deformation Supporting (China University of Mining and Technology, 2016).
Chen, J. G. Study on Theory of Disaster Induced by Rock Burst of Roadway Floor with Wide Coal Pillar and the Prevention Techniques (China University of Mining and Technology, 2016).
Zhou, J., Chen, C., Wang, M. Z. & Khandelwal, M. Proposing a novel comprehensive evaluation model for the coal burst liability in underground coal mines considering uncertainty factors. Int. J. Min. Sci. Technol. 31, 799–812 (2021).
Acknowledgements
The authors thank reviewers and editor for their help in revising the paper. This research was supported by the National Natural Science Foundation of China (42367024), the Key Laboratory of Safety and High-efficiency Coal Mining, Ministry of Education (Anhui University of Science and Technology) (JYBSYS202307), the Major Science and Technology Special Project of Yunnan Province (202202AG050014).
Author information
Authors and Affiliations
Contributions
Chao Wang: Writing-Original Draft, Data Curation, Validation; Yv Liu: Writing, Formal Analysis, Investigation; Yuefeng Li: Conceptualization, Writing-review & editing, Methodology, Supervision; Xiaofei Liu: Writing, Data Curation, Formal Analysis; Qiwei Wang: Data Curation, Formal Analysis.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Wang, C., Liu, Y., Li, Y. et al. Classification of coal bursting liability of some chinese coals using machine learning methods. Sci Rep 14, 14030 (2024). https://doi.org/10.1038/s41598-024-61801-0
Received:
Accepted:
Published:
Version of record:
DOI: https://doi.org/10.1038/s41598-024-61801-0
Keywords
This article is cited by
-
Factor of safety prediction for slope stability using PCA and BPNN in Guangdong’s H mining area
Scientific Reports (2025)
















