Automatic construction of risk transmission network about subway construction based on deep learning models

Liang, Yanxiang; Xu, Na; Chang, Hong; Qian, Shan; Liu, Yao

doi:10.1038/s41598-025-99561-0

Download PDF

Article
Open access
Published: 11 May 2025

Automatic construction of risk transmission network about subway construction based on deep learning models

Yanxiang Liang¹,
Na Xu¹,
Hong Chang²,
Shan Qian¹ &
…
Yao Liu¹

Scientific Reports volume 15, Article number: 16383 (2025) Cite this article

1685 Accesses
1 Citations
1 Altmetric
Metrics details

Subjects

Abstract

Safety risks management is a critical part during the subway construction. However, conventional methods for risk identification heavily rely on experience from experts and fail to effectively identify the relationship between risk factors and events embedded in accident texts, which fail to provide substantial guidance for subway safety risks management. With a dataset comprising 562 occurrences of subway construction accidents, this study devised a domain-specific entity recognition model for identifying safety hazards during the subway construction. The model was constructed by a Bidirectional Long Short-Term Memory Network with Conditional Random Fields (BiLSTM-CRF). Additionally, a domain-specific entity causal relation extraction model employing Convolutional Neural Networks (CNN) was also developed in thsi model. The constructed models automatically extract safety risk factors, safety events, and their causal relationships from the texts about subway accidents. The precision, recall, and F₁ scores of Metro Construction Safety Risk Named Entity Recognition Model (MCSR-NER-Model) all exceeded 77%. Its performance in the specialized domain named entity recognition (NER) with a limited volume of textual data is satisfactory. The Metro Construction Safety Risk Domain Entity Causal Relationship Extraction Model (MCSR-CE-Model) achieved an impressive accuracy, recall, and F₁ score of 98.96%, exhibiting excellent performance. Moreover, the extracted entities were normalized and domain dictionary was developed. Based on the processed entities and relationships processed by the domain dictionary, 533 domain entity causal relation triplets were obtained, facilitating the establishment of the directed and unweighted complex network and case database about the risks of subway construction. This research successfully converted accident texts into a causal chain structure of “safety risk factors to risk events,” providing detailed categorization of safety risks and events. Concurrently, it revealed the interrelationships and historical statistical patterns among various safety risk factors and categories of risk events through the complex safety risks network. The construction of the database facilitated project managers in conducting management decisions about safety risks.

Integrated multimethod analysis of miners’ safety behavior and risk interaction for practical applications

Article Open access 06 October 2025

A text mining-based approach for comprehensive understanding of Chinese railway operational equipment failure reports

Article Open access 30 July 2025

Study on risk assessment of tunnel construction across mined-out region based on combined weight-two-dimensional cloud model

Article Open access 28 February 2025

Introduction

Metro infrastructure construction is a dynamic systems engineering with complex phenomena and chaotic characteristics¹. A variety of elements such as intricate geological and hydrological settings can result in formidable challenges of construction and organizational coordination^2,3. Safety hazards associated with subway construction are intricate, veiled, and dynamic, which may lead to financial losses, personal harm, ecological disruption, project setbacks and diminished structural integrity^4,5. Traditionally, the identification and analysis of safety risks in context relied on the industry specialists, academics, and seasoned project leaders⁶. With the accumulation of historical data, the distinctive spatiotemporal attributes, highly nonlinear aspects, and intricate interconnections presented formidable hurdles for the comprehensive analysis^7,8. The perception, analysis and inference of risks from security experts are also inevitably affected by cognitive bias and individual subjectivity⁹. To solve the problems mentioned above, this study conducted automatic risks identification of accident data through data mining technology based on the experience of experts, explored safety hazards and related rules in development and evolution. This study can are significant to make up for individual subjective limitations of industry staff, which can improve the safety risks management level.

In the realm of subway construction safety risks management, the analysis of accident cases has been widely utilized to facilitate the administration and enhancement of engineering safety, thereby serving as a potent resource for mitigating similar hazardous situations and risk incidents^10,11,12. At present, researchers mostly focus on individual risk events or specific types of subway accidents. Relevant researches are all about the descriptive statistics for risk incidents in specific country or region^13,14,15. However, risks do not arise suddenly and in isolation¹⁶. A multitude of interrelations exists among risk events, which is often overlooked by singular case analysis^17,18. In a specific event, risk factors lead to the occurrence of original risk events, secondary events and derivative events in turn, forming a complete causal chain of risk transmission. The combination of multiple risk chains formed a risk network^19,20. By mining and analyzing sets of causal chains in risk transmission, the interactions among various risk factors and events can be revealed in comprehensive, multi-category accident investigations. This approach effectively circumvents the limitations inherent in singular risk factor and event analysis, contributing to the continual improvement the level of a more systematic, comprehensive safety risks management.

Therefore, BiLSTM-CRF and CNN models were introduced in this study for entity recognition and relationship extraction in domain accident texts. BiLSTM was used to capture long-term dependencies in sentences. CRF play the role of addressing sequence labeling tasks and enhances named entity recognition. CNN can be used to reduce parameter count lowers computational costs of model, enabling parameter sharing and sparse connections. Consequently, the automatic extraction and transformation from accident reports of subway construction to causal chains with the structure of ‘safety risk factors - risk events’, addressing the deficiency of risk identification overly reliant on expert experience. This contributes enhanced the efficiency of domain-specific safety risks recognition. By constructing a set of causal chains for metro construction safety risks, it unveiled the intricate impact relationships and risk transmission pathways among various risk factors and events in Chinese subway construction spanning nearly two decades from a multi-case perspective. This approach resolved the problems of limited study cases and singular categories, mitigating the individual subjectivity and limitations associated with conventional analysis.

For accident texts of subway construction, we developed and trained a model for entity recognition in the domain of safety risks in subway construction by using a Bidirectional Long Short-Term Memory Network combined with Conditional Random Fields (BiLSTM-CRF). It can facilitate the automatic extraction of safety risk factors and domain entities in risk events from accident texts. We established and trained a causal relationship extraction model based on Convolutional Neural Networks (CNN) to extract causal relationships among domain entities. It also can automatically construct a causal chain of “subway construction safety risks factors-risk events,” thereby revealing the universal laws of interaction among risk factors and risk events. In order to further clarify the research, the following assumptions were proposed: The BiLSTM-CRF and CNN models could accurately extract safety risk factors and causal relationships from textual data. The event text data represented the authentic safety risks scenarios in subway construction. Although all risk factors were not covered, its comprehensiveness was sufficient to support the objectives of this research.

Literature review

Construction safety risks identification based on text mining

The identification of safety risks constitutes is a complex system engineering task. Conventional approaches to safety risks identification primarily concentrated on individual risk factors and specific types of accidents. As for the safety risks identification in subway construction, the conventional focused on particular construction procedures and stages, employing expert surveys and interviews, brainstorming sessions and literature research as methods for risk recognition. These approaches heavily rely on experience, which are susceptible to individual cognitive limitations and subjective factors. Fang et al. inferred that nearby pipelines and existed buildings are the primary risk factors during the subway construction based on process control and situational surveys²¹. Meanwhile, Zhang et al. investigated the interaction between safety risks management performance and the perceived significance of each risk factor by conducting surveys and semi-structured interviews with subway construction workers in the Southeast region²². Shi et al. (2024) utilized text mining techniques and DEMATEL-ISM method to identify and evaluate safety risk factors²³. Researchers proposed the subway construction safety risks identification and early warning systems based on construction drawings, Internet of Things, BIM, and other technological tools, progressively broadening the scope and categories of risk identification. Li et al. introduced the BIM-based subway construction safety risks identification and early warning system. It utilized engineering parameter information to achieve safety risks identification²⁴. Guo et al. creatively combined BIM with D-S evidence theory to enhance risk management capabilities for complex underground projects²⁵. However, the majority of risk identification data sources come from numerical data collected by construction machinery and image data obtained through optical equipment. It leads to the limited reports about emerging hazard patterns and subtle differences inferred from unstructured and semi-structured textual data such as subway accident reports and records. Furthermore, the constraint in data categories makes it difficult for risk identification to encompass all risk factors and events.

Subway construction safety management based on natural language processing

Natural Language Processing (NLP) is utilized to facilitate the comprehension of human language systems in computer. It is primarily applied in tasks such as text classification, information recommendation, and information extraction²⁶. At present, NER mainly has three mainstream methods in safety management about subway construction: rule-based, statistical machine learning method, and deep learning method²⁷. Tang et al. utilized text mining to extract risk data from texts, guiding on-site management of subway construction²⁸. Huo et al. utilized text mining to extract key features related to subway accidents from raw data, developing a new causal path selection model²⁹. By interrupting causal propagation along these paths, construction safety can be enhanced. Rules are created by experts and scholars in professional fields to meet their own research needs. Li et al. performed entity recognition from human factors, management, and risks by BP neural network model³⁰. The recognition results were used to predict potential accident types and propose safety management measures during the construction. Machine learning method requires high text standardization. In addition, it only can operate on limited data volumes and generally exhibit moderate effectiveness in entity recognition. Deep learning methods exhibited excellent performance in NER recognition, leading to further improvements in entity identification accuracy and efficiency³¹. Zhou et al. developed a double deep Q-network deep reinforcement learning model to predict subway construction safety risks, which is conducive to enhancing safety management at subway construction sites³². There remains a scarcity of research utilizing deep learning techniques for the NLP mining and analyzing of accident investigation reports, safety records, and other accident texts about subway construction in scholarly literature. With data mining remaining predominant, the entity recognition of safety risks in subway construction is still at an early stage.

Research methods

Research framework

The knowledge structure framework of safety risks in subway construction is depicted in Fig. 1. This framework can be segmented into four sections: corpus construction, entity recognition model, relationship extraction model, and the construction of a subway construction safety risks network. To ensure the representativeness of the results, knowledge extraction was conducted on 562 accident texts among 20 years. The BiLSTM-CRF model was utilized for entity recognition in safety risks about subway construction, while a convolutional neural network model was employed to extract causal relationships among domain entities. In order to get a more comprehensive analysis of safety risk factors and risk events, this research normalized safety risks factors and risk event entities based on the named entity recognition of domain entities, constructing a domain synonym dictionary. Ultimately, this research identified 533 causal relationship triplets in the domain of subway construction safety risks, which served as the basis for constructing a directed unweighted complex network and case database for safety risks in subway construction.

BiLSTM-CRF model

The research employed the BiLSTM-CRF as the deep learning framework for named entity recognition in safety risks of subway construction. BiLSTM is a type of recurrent neural network that processes information from both directions of a sequence to capture context effectively. The CRF is a discriminative probabilistic undirected graph model, which represents the conditional probability distribution of one set of random variables under another given distribution of random variables³³. The BiLSTM model effectively captures long-range dependencies by processing sequence information bidirectionally, thereby enhancing the comprehensive utilization and understanding of contextual information. CRF is well-suited for sequence labeling tasks, ensuring the consistency and contextual relevance of predicted entity labels. The BiLSTM-CRF model is adept at handling text characterized by sequential patterns, necessitating the capture of long-range dependencies and ensuring coherent labeling within context. Its performance shines in named entity recognition, where contextual comprehension is pivotal for precise predictions. In this research, the output entities in accident texts were labeled and got the optimal global label sequence with the constraints of CRF. This method considers the influence of label results from other characters during the output of labels, effectively enhancing the recognition effectiveness of this model. Initially, the subway construction accident text was converted into a character vector representation on a per-character basis, denoted as $\:\{{\text{x}}_{1}$, $\:{\text{x}}_{2}$, …, $\:{\text{x}}_{\text{n}}\}$($\:{\text{x}}_{\text{t}}$∈[1,n]), serving as the input data for this model. The word vector features utilized the 100-dimensional “Chinese word vector library” trained from Wikipedia, encompassing 16,991 characters, to effectively express character features. Subsequently, the data was input into the LSTM neural network in a forward and backward sequences to obtain forward and backward hidden vectors ($\:\overrightarrow{{\text{h}}_{\text{t}}}$ and $\:\overleftarrow{{\text{h}}_{\text{t}}}$) containing semantic information about the accidents in subway construction. The obtained two vectors were concatenated to form the final output vector $\:{\text{h}}_{\text{t}}$, serving as the input for the CRF layer. Finally, the CRF model was employed to obtain the predicted labels for named entities in safety risks of subway construction, which are displayed through the output layer.

Convolutional neural network model

The research utilized a deep learning framework based on CNN for extracting entity causal relationships in safety risks of subway construction. The CNN model consists of embedding layer, convolutional layers, pooling layer, and fully connected layer. The embedding layer represents word vectors through embedding word features and position features. The convolutional layer captures the overall semantic information of sentences³⁴. The pooling layer compresses the results of the convolutional layer using max-pooling to extract significant features, control overfitting, and obtain the feature vector of a sentence³⁵. The fully connected layer integrates highly abstracted features obtained through multiple convolutions to produce output probabilities for various classification, that is, relationship classification results³⁶. The training process of this model is depicted in Fig. 2.

Model evaluation criteria

The recognition performance of the Metro Construction Safety Risk Named Entity Recognition Model (MCSR-NER-Model) and the Metro Construction Safety Risk Domain Entity Causal Relationship Extraction Model (MCSR-CE-Model) were evaluated based on three evaluation metrics: Precision (P), Recall (R), and F₁ Score³⁷. The introduction and calculation formulas for each metric are displayed as follows.

Precision represents the proportion between the number of correctly identified entities and the total number of identified entities.

$$\:P=\frac{TP}{TP+FP}\times\:100\text{\%}$$

Recall indicates the proportion between the number of correctly identified entities and the total number of pre-labeled entities.

$$\:R=\frac{TP}{TP+FN}\times\:100\text{\%}$$

The F₁ Score is the weighted geometric mean of precision (P) and recall (R).

$$\:{\text{F}}_{1}=\frac{2\text{P}\text{R}}{\text{P}+\text{R}}\times\:100\text{\%}$$

In the above formulas, TP represents the number of samples predicted as positive class that are actually positive, FN represents the number of samples predicted as negative class that are actually positive, and FP represents the number of samples predicted as positive class that are actually negative.

Experiment and results

Data acquisition

The data set utilized in this research are the 562 accident texts collected from March 2001 to November 2021, including 130 reports of accidents about subway construction and 432 accident bulletins published by the Ministry of Housing and Urban-Rural Development and news media (Table 1). The dataset consists of encompassing 1821 informative sentences. In contrast to free text, these accident investigation reports contain extensive descriptions of safety risk factors, accident circumstances, outcomes, impacts, and responsibilities, which can facilitate the comprehensive analysis and causal chain delineation of accidents. It also can promote the risk event data mining, and structured documentation. The bulletins and notices published by the Ministry of Housing and Urban-Rural Development succinctly described subway accident causes, risk events name, risk outcomes, and their consequences. It can help to clarify the critical information about subway accidents, such as safety risk factors and accident outcomes.

Table 1 The information of data acquisition.

Subjects

Abstract

Similar content being viewed by others

Integrated multimethod analysis of miners’ safety behavior and risk interaction for practical applications

A text mining-based approach for comprehensive understanding of Chinese railway operational equipment failure reports

Study on risk assessment of tunnel construction across mined-out region based on combined weight-two-dimensional cloud model

Introduction

Literature review

Construction safety risks identification based on text mining

Subway construction safety management based on natural language processing

Research methods

Research framework

BiLSTM-CRF model

Convolutional neural network model

Model evaluation criteria

Experiment and results

Data acquisition

Text preprocessing

Domain named entity identification

Text sequence annotation

Training data structure and environment configuration

Analysis of training results

Domain entity discovery

Causality extraction about domain entities

Relationship types identification

Text sequence annotation

Training data structure and hardware construction

Training results analysis

Construction of domain dictionary

Normalization of domain entities

Construction of domain dictionary

Results and applications

Subway construction safety risks complex network construction

Construction of subway construction accident database

Recommendations for mitigating subway construction risks

Conclusion

Data availability

References

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Ethical statement

Additional information

Publisher’s note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Quick links