Improved railway track faults detection using Mel-frequency cepstral coefficient and constant-Q transform features

Shafique, Rahman; Kanwal, Khadija; Chunduri, Venkata; Choi, Gyu Sang; Ashraf, Imran

doi:10.1038/s41598-025-14763-w

Download PDF

Article
Open access
Published: 22 August 2025

Improved railway track faults detection using Mel-frequency cepstral coefficient and constant-Q transform features

Rahman Shafique¹^na1,
Khadija Kanwal²^na1,
Venkata Chunduri³,
Gyu Sang Choi¹ &
…
Imran Ashraf¹

Scientific Reports volume 15, Article number: 30914 (2025) Cite this article

1970 Accesses
1 Citations
Metrics details

Subjects

Abstract

Regular inspection of the health of railway tracks is crucial to maintaining reliable and safe train operations. Some factors including cracks, rail discontinuity, ballast issues, burn wheels, super-elevation, loose nuts and bolts, and misalignment developed on the railways due to pre-emptive investigations, non-maintenance, and delay in detection pose grave threats and danger to the safe operation of railway transportation. In the past, manual inspection was performed for the rail track by a rail cart which is both prone to error and inefficient due to human biases and error. Several train accidents are reported in Pakistan; it is important to automate these techniques to avoid such train accidents for the safety of countless lives. This study aims to enhance railway track fault detection using an automatic rail track fault detection technique with acoustic analysis. Moreover, the proposed method contributes to making the dataset large by using the CTGAN technique. Results show that acoustic data may help to determine the railway track faults effectively and logistic regression is used to perform the classification for railway track faults with an accuracy of 100%.

Prognostics of unsupported railway sleepers and their severity diagnostics using machine learning

Article Open access 11 April 2022

Automatic 3D railroad alignment detection using modified Hough transform

Article Open access 13 December 2025

Railway infrastructure maintenance efficiency improvement using deep reinforcement learning integrated with digital twin based on track geometry and component defects

Article Open access 10 February 2023

Introduction

The railway network is a highly essential transportation conduit in various developing nations, such as Pakistan, and is utilized to satisfy public transit demands. The railway structure is crucial for trade and supply networks¹. The railway market has gotten stronger, opening up new opportunities for the country’s public and economy. According to², the railway industry’s annual report from 2016 to 2018 showed a growth rate ranging from 1.3% to 2.4%. As a result, high-performance railway operations are essential to ensure that the railway runs continuously and that passengers are safe. The railway system is getting more burdened and complex as the number of train passengers grows. According to³, mechanical forces and environmental factors accelerate the deterioration of train rails. The railway tracks are crucial components of the railway network. Rail track inspection helps decrease accidents, injuries, and deaths⁴. From 2013 to 2020, the registered train accidents were 127 due to rail track faults according to the annual reports in Pakistan⁴. People including students, tourists, and commuters use trains for traveling in Pakistan. From 2012 to 2017, a total of 757 train accidents have been reported⁵. Additionally, 22 goods and 16 passenger trains were derailed in 2014, and 37 goods and 37 passenger train accidents were reported in 2015. In 2019, 11434 railway accidents were recorded, causing 937 casualties and 7730 injuries⁶. However, the train accident ratio is higher in under-developing countries⁷. The railway network contributes to the Pakistani economy as it has a huge network in the North-South corridor that links the seaport of Karachi with the country’s main production centers and population⁸. In 2020, 152 train accidents are reported causing 19 deaths⁹. In 2021, 32 casualties and 64 injuries were recorded in railway accidents⁹.

Timely detection and proper inspection of faults may protect several human lives and reduce the financial losses for railway systems¹⁰. However, the maintenance and inspection of railway tracks is a time-consuming and expensive activity. Several non-destructive evaluation (NDE) methods for railway track inspection have been applied including detection using phased array technology¹¹. Eddy current testing¹², guided wave detection¹³, ultrasonic testing¹⁴, and other techniques have been the focus of rail track inspections. However, there has been a recent surge in enthusiasm for employing machine learning models, the Internet of Things (IoT), and deep learning networks in these inspections. These advanced technologies aim to enhance the speed, precision, uniqueness, and overall success of non-destructive evaluation (NDE) approaches. The integration of acoustic transducers¹⁵ and high-speed cameras¹⁶ with machine learning classifiers is becoming common to modernize traditional inspection methods. In particular, hand-crafted feature engineering (HCFE) has been utilized in audio and image-based machine learning applications. Nevertheless, HCFE demands domain-specific expertise, extensive problem-solving, and system modifications to optimize performance (PS18). Moreover, railway track classification and inspection have three main stages. Firstly, preprocessing of ‘wav’ files is performed to eliminate the undesired sounds. Secondly, feature extraction is performed with spectrograms. Thirdly, the classification method is trained to detect rail track faults.

Maintaining a reliable and safe rail network demands uninterrupted train operations, which entails extensive monitoring of hundreds of thousands of kilometers of track. This endeavor requires substantial investments of time and money. Timely and sufficient maintenance of railway tracks is crucial; any failures can disrupt train services, leading to potential financial and human consequences¹⁷. Crack identification is very important to run the system rapidly and efficiently. In Pakistan, track inspection is currently conducted using a railway cart, where human specialists manually assess the track to identify areas requiring repairs. Recognizing the critical importance of track inspection, this study introduces and integrates an intelligent automated system for analyzing the condition of train tracks. In summary, the study offers the following contributions:

This study investigates the use of various machine learning and deep learning models for autonomously evaluating railway tracks, focusing on distinguishing between three distinct track conditions: wheel burn, superelevation, and standard track.
A significant dataset is also produced for studies with the acoustic signals from an ECM-X7BMP microphone that was collected over one year.
The Mel-frequency cepstrum coefficients (MFCC) and Constant-Q transform (CQT) characteristics of audio signals are combined with various classifiers to automatically detect track problems. Conditional GAN (CTGAN) is also used to create an equal amount of samples for each error.

The sections of this paper are grouped as follows: Section 2 provides a summary of several forms of fractures seen in railway tracks, as well as major studies on identifying defects in rail tracks. Section 3 describes data collecting procedures, equipment, and suggested study strategy. Section 4 presents the findings and comments, while Section 5 provides the conclusion.

Related work

Track inspection is an essential task that has been adopted periodically to control the conditions of rail tracks and avoid train accidents. Geometric inspection and structural inspection are two main classes for the inspections of rail tracks¹⁸. Structural checks are performed to detect structural faults such as wheel burn, superelevation, or other structural issues. Geometric inspections are used to identify geometric anomalies such as rail misalignment and other comparable degradation. Furthermore, geometric anomalies are caused by structural flaws, which can lead to train accidents. The authors explained various geometric and structural flaws in¹⁹.

Researchers worked on the detection of geometric defects with an SVM model in²⁰. The RAS problem-solving competition 2015 dataset was used for experimentation. The study considered some severe geometric defects which may increase the geometric defects. To detect structural defects, a structural inspection is performed using shallow machine learning methods in this study. SVM was used in this study which also worked on a novel parameter called positive and un-labeled learning performance (PULP). Moreover, PULP was applied to check the performance of models on different datasets comprising faulty results. In²¹, experimentation was performed to detect faults on railway tracks. Both Support SVM and CNN were utilized in this study for analyzing an image-based dataset. Rail fasteners are classified as missing, good, or broken. This technique showed improved accuracy in detecting defects in rail fasteners and ties.

The study²² investigated fault detection using traditional acoustic-based systems, enhancing performance and reducing train accidents through deep learning methodologies. Additionally, the research concentrated on LSTM, 2D convolutional, and 1D convolutional approaches. Various types of faults, such as wheel burn, superelevation, and normal tracks, were identified in this study. Experimental analysis was conducted on a real acoustic dataset to detect rail track faults.LSTM model shows improved results with 99.7% accuracy. In the study²³, local binary pattern (LLBP) was employed on railway images to classify track fasteners. Gabor filters²⁴, SVM²⁵, and edge detection²⁶ methods were utilized to identify fasteners in railway images. Faster Region-based CNN was utilized for detecting rail track faults in²⁷. CNN and ResNet-50 were applied in study²⁸ to detect structural defects and damages, particularly related to broken rail fasteners. The study utilized Haar-like feature sets, including geometric features for fasteners, achieving a 94% accuracy with CNN and 94.4% accuracy with ResNet-50. Additionally, various classification methods, including SVM, GNB, KNN, RF, Adaboost, and Gradient Boosting Decision Trees (GBDT), were tested and evaluated to detect and analyze missing clamps in the fastening structure in²⁹.

The categorization of railway cracks with acoustic-emission waves based on a multi-branch CNN is discussed in³⁰. The railway fastener defects are identified from images using CNN³¹, residual network³², GAN, faster region-CNN³³, and point cloud deep learning (PCDL)³⁴. Dynamic stiffness for rail pads was anticipated with machine learning techniques including KNN, multi-linear regression, regression tree, gradient boosting, RF, SVM, and MLP³⁵. Feature extraction approaches have also been investigated for rail track fault detection.

The study³⁶ introduced tree-based classification approaches such as RF and DT which performed a comparison of deep learning techniques for rail track inspections. The authors proposed a new RF-based approach that is used to combine LMD, TFD, and TD feature extraction for the detection of track slab deformation. In³⁷, an automated inspection technique based on IoT is presented for rail track fault detection. Acoustic data is used to rail track fault classification including wheel burn, crash sleeper, loose nuts, and bolts, low joint, creep, and point and crossing. The experimental results showed that acoustic data may successfully support selective track defects and localized these defects in real time. This method achieved a 98.4% accuracy with MLP.

Materials and methods

This section discusses dataset collection and strategy, feature extraction techniques, machine learning methods for classification, and the recommended approach.

Dataset collection

The dataset holds crucial importance in the automated identification of faulty railroad tracks. Illustrated in Figure 1, the mechanical cart provided by officials at the Rahim Yar Khan station of Pakistan Railways Khanpur district was utilized for collecting this dataset. A setup was arranged on-site at Khanpur’s train station to gather the necessary data. Positioned at a safe maximum distance of 1.75 inches from the point of contact between the wheel and the track, two microphones were installed. These microphones were affixed to the right and left sides of the cart for data collection purposes. The propulsion of the mechanical cart was facilitated by a generator, which operated at an average speed of 35 km/h to drive the cart’s engine.

The audio data collection did not specify the geographic location. Two ECM-X7BMP Unidirectional electric condenser microphones, equipped with 3-pole locking small plugs, were mounted on the left and right wheels of the railway cart. These microphones possess an output impedance of 1.2 k and a sensitivity of 44.0 3 dB. Additional specifications of the microphones are detailed in Table 1.

Table 1 Important parameters of Sony ECM-X7BMP microphone.

Subjects

Abstract

Similar content being viewed by others

Prognostics of unsupported railway sleepers and their severity diagnostics using machine learning

Automatic 3D railroad alignment detection using modified Hough transform

Railway infrastructure maintenance efficiency improvement using deep reinforcement learning integrated with digital twin based on track geometry and component defects

Introduction

Related work

Materials and methods

Dataset collection

Proposed methodology for track fault detection

Obtaining implementation code

Experiment details

Mel-Frequency Cepstral Coefficients(MFCC)

Constant-Q-transform

Composite travel generative adversarial network

Supervised machine learning models

Decision tree

Support vector machine

Random forest

Logistic regression

Naive Bayes

K Nearest Neighbors

Ensemble classifier

Deep learning models

Long short-term memory

Convolutional neural network

Recurrent neural network

Gated recurrent unit

Results and discussion

Results of machine learning classifiers using MFCC features

Results of machine learning classifiers using CQT features

Results of classifiers using hybrid features

Results of machine learning classifiers using MFCC features with CTGAN

Results of machine learning classifiers using CQT features with CTGAN

Results of machine learning classifiers using Hybrid features with CTGAN

Results of deep learning classifiers using MFCC and CQT features

Results of deep learning classifiers using MFCC and CQT with CTGAN

Results of deep learning classifiers using hybrid features and hybrid features with CTGAN

K-fold cross-validation results

Comparison With existing studies

Conclusions and future work

Data availability

Code availability

References

Funding

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Ethical approval

Consent to participate

Consent for publication

Conflicts of interest

Additional information

Publisher’s note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Quick links