Abstract
In response to the monitoring needs of conveyor roller faults in open-pit mines, this paper proposes a fiber optic sensing roller fault diagnosis method based on dynamic model driving. The study first analyzed the coupling mechanism between fiber optic scattering light field and mechanical vibration, and constructed a non-invasive real-time monitoring system. To address the inherent vibration issues caused by equipment structural damage in open-pit mines under extreme weather conditions, a diagnostic method driven by dynamic models is proposed. By constructing a dynamic model of the conveyor system, the mathematical model of the vibration amplitude obtained from theoretical calculations is mixed and analyzed with the light converted vibration signals collected on site. Short time Fourier transform (STFT) is used for signal conversion, and Mel algorithm is used for preprocessing. By combining an improved GoogLeNet LSTM hybrid deep learning model, the preprocessed signals were trained for classification and recognition, and a database of roller fault features was successfully constructed. The experimental results show that the improved DY GougLeNet LSTM model has a testing accuracy of 96.7% in roller fault diagnosis, which is effectively improved compared to the basic model (CNN) with an accuracy of 83.7%. The relevant achievements provide reliable technical support for unmanned operation of open-pit mine conveying systems.
Introduction
Existing studies1,2 have demonstrated that mechanical faults are typically preceded by characteristic precursors in acoustic vibration signals. This study focuses on fault monitoring of conveyor idlers, establishing a non-invasive real-time monitoring system through mechanistic analysis of the coupling between fiber-optic scattering light fields and mechanical vibrations.
Depending on the scattering generation mechanisms3,4, optical wave propagation in fibers simultaneously produces three distinct scattering phenomena with different central frequencies: Rayleigh scattering, Brillouin scattering, and Raman scattering, as shown in Fig. 1. Rayleigh scattering, as an elastic scattering phenomenon, maintains identical frequency to the incident light while demonstrating the highest scattering intensity. Its distinct correlation with external vibrations has established Rayleigh scattering as the predominant mechanism for distributed vibration sensing applications.
In recent years, accurate and effective identification of fiber optic sensing events has attracted significant research attention globally5,6. Lin et al.7 proposed a recognition framework for fiber optic vibration signals based on wavelet packet Shannon entropy for feature extraction. Combined with a radial basis function neural network, their method achieved an average recognition rate of 82.67% for three vibration events: climbing, walking, and knocking. Ma et al.8 developed a classification algorithm using Mel-frequency cepstral coefficients (MFCCs) and support vector data description (SVDD), achieving 86.67% average accuracy for rain, trampling, and climbing events. Sun et al.9 introduced a Hilbert-Huang transform algorithm with complementary ensemble empirical mode decomposition, attaining 85% recognition accuracy for four typical fiber optic vibration signals. Xu et al.10 analyzed idler fault signals under locked, bearingless, and fractured conditions. Jia11 designed an idler fault detection system for major failure modes, while Roy et al.12 proposed a hybrid cepstrum method combining MFCCs and inverted MFCCs with long short-term memory (LSTM) networks to diagnose normal idlers, rolling element faults, and eccentric rotation faults. Although these methods significantly improve time efficiency in feature extraction, their limited capability to precisely characterize time–frequency information hinders effective differentiation of complex fiber optic vibration events.
To address the efficiency degradation caused by high-dimensional manual feature vectors, this study proposes a deep learning framework based on adaptive feature extraction. Leveraging signal two-dimensionalization preprocessing, we exploit GoogLeNet13,14,15—an optimized convolutional neural network (CNN) with multi-scale hierarchical architecture—to effectively capture both local and global spatial features while reducing computational complexity. The proposed method converts one-dimensional time-series signals from Mel-distributed fiber optic vibration monitoring systems into two-dimensional image representations, preserving complete temporal dependencies and signal integrity.
As demonstrated in prior studies16,17,18,19, GoogLeNet excels at adaptively extracting spatial correlation features through convolutional and pooling operations, whereas LSTM networks specialize in capturing temporal correlation features via precise memory cell control. Unlike conventional deep learning models, the integrated GoogLeNet-LSTM architecture jointly learns spatial–temporal patterns from two-dimensional inputs, enabling automated extraction of time-domain, frequency-domain, and spatial-domain characteristics for enhanced classification.
For practical validation, vibration signals were collected from belt conveyor idlers under typical failure scenarios in open-pit mines, including dust-induced overheating, bearing wear, and shaft fracture. After preprocessing with Mel filtering, the signals were used to train a GoogLeNet-LSTM classifier and establish an idler fault database. To address structural damage-induced intrinsic vibrations in harsh mining environments—where altered system stiffness and damping generate anomalous vibrations—we further propose a dynamics model-driven fault diagnosis method. This approach integrates a conveyor system dynamics model into the fiber optic vibration processing framework. Field-acquired optical vibration signals are compared with theoretically predicted values, and qualified data (within predefined dynamic amplitude thresholds) undergo short-time Fourier transform (STFT) to generate time–frequency images. These images are processed via Mel filtering and input into the trained database for final fault diagnosis, as detailed in Fig. 2.
Considering the differences in structure and load force in the conveyor conveying area, the conveyor is divided into zones 1 to N. Considering the special circumstances such as rain cover noise and fiber optic damage in the on-site working conditions, which may interfere with fault identification. To this end, the theoretical vibration signal amplitude M solved by the dynamic model will be compared with the actual measured S on site. When 0.8Sm < Mm < 1.2Sm is satisfied, it approximately proves that there are no obvious problems in the on-site measurement, and further signal feature processing and analysis can be carried out. Otherwise, the characteristic signals of the area will be extracted from the system and alerted to on-site workers for review through an alarm, as shown in Fig. 3.
Fiber optic vibration sensing-based fault location analysis in conveyor idlers
Assuming the frequency of the laser pulse signal is \(\nu\) with a pulse duration of W, the backward Rayleigh scattering light at the input end of the sensing fiber after propagating through the entire sensing fiber of length l can be expressed as20,21:
In the equation, \(a_{i}\) is the amplitude of the i-th scatterer, \(c\) represents the speed of light, \(\alpha\) represents the attenuation coefficient of the sensing fiber, \(\tau_{i}\) represents the total round-trip time of the i-th scatterer, and n represents the effective refractive index of the fiber, where22,23,24:
Based on Eq. (2), the optical power of the backward Rayleigh scattering light can be specifically calculated as:
In Eq. (3), \(\varphi_{i,j}\) denotes the phase difference between the i-th and j-th scattered light waves. The first term on the right-hand side (RHS) represents the summation of power contributions from all independent scattered light waves within the sensing fiber, which remains unaffected by the source frequency and external vibration events. The second term on the RHS corresponds to the interference-derived power from backward Rayleigh scattered waves within the pulse width. When the distance between the i-th scattering event and the input end of the sensing fiber is denoted as d, the following relationship holds25,26:
Therefore, the phase difference formed between the i-th and j-th scattered light waves can be calculated as:
When a vibration event with amplitude A acts on the sensing fiber, the strain effect and elasto-optic effect induce simultaneous changes in both the fiber length and refractive index at the affected position. Assuming the vibration occurs near the i-th and j-th scattering light waves, the phase difference \(\Phi_{i,j}\) between them can be expressed as27,28,29:
According to Eq. (6), the phase variation of backward Rayleigh scattered light is directly proportional to the amplitude of vibration events applied to the sensing fiber. Therefore, the specific vibration location can be demodulated through a differential processing algorithm.
Dynamic modeling of belt conveyor systems
Through holistic mechanical analysis of the conveyor system, it is evident that the idler frames are longitudinally distributed along the belt with intricate mass distribution and complex kinematic characteristics. Therefore, formulating dynamic equations for the conveyor system using Lagrangian equations proves particularly effective. The dynamic model of the conveyor system is illustrated in Fig. 4. The complete Lagrangian equations can generally be expressed as follows30,31,32,33,34:
where:
\(F_{j} (t)\) is the external excitation force, j = 1,2,3,…;
\(\partial x_{j}\) is the generalized displacement, j = 1,2,3,…;
\(\partial \dot{x}_{j}\) is the generalized velocity, j = 1,2,3,…;
\(T\) is the kinetic energy of the system;
\(V\) is the potential energy of the system;
\(D\) is the energy dissipation function of the system.
m1 ~ m4: Masses of the support frame, lower roller, left roller, and right roller, respectively (kg).
x1 ~ x4: Displacements of the support frame, lower roller, left roller, and right roller, respectively (m).
k11,c11 and k12,c12: Stiffness and damping coefficients between the left/right sides of the support frame and the ground, respectively.
k21,c21 and k22,c22: Stiffness and damping coefficients between the left/right sides of the lower roller and the support frame, respectively.
k31,c31 and k41,c41: Stiffness and damping coefficients between the lower roller and the left/right rollers, respectively.
k32,c32 and k42,c42: Stiffness and damping coefficients between the left/right rollers and the support frame, respectively.
J: Moment of inertia of the support frame (kg·m2/s2).
a: Horizontal displacement between the center of the support frame and the lower support point (m).
b: Thickness of the support frame (m).
c: Horizontal displacement from the center of the support frame (point O) to points B and D (m).
θ: Small angular displacement of the support frame (rad).
α: Angle between the left roller and the horizontal plane (rad).
β: Angle between the right roller and the horizontal plane (rad).
Ft1,Ft2: Left and right support forces acting on the support frame (N).
F2,F3,F4: Forces acting on the lower roller, left roller, and right roller, respectively (N).
Using the energy method, the kinetic energy of the conveyor system can be determined as:
In the equation,\(\dot{x}_{i} (i = 1\sim 10)\) represents the velocity parameters of each mass block.
During actual operation of the conveyor system, uneven material distribution—particularly at the chute discharge port—can induce significant lateral vibrations. Considering the relatively small vibration amplitude of the support frame, it is assumed that \(\sin \theta \approx \theta\). Consequently, the corresponding displacements and velocities at the two ends beneath the support are as follows:
According to the relative form of Newton’s second law, it can be concluded that:
Furthermore, based on the positional relationship between the left and right idlers, it can be deduced that their respective displacements and velocities are:
Signal feature extraction and recognition methodology
Research on two-dimensional feature extraction method based on Mel spectrogram
To enhance the effectiveness and real-time performance of feature vector construction in adaptive feature extraction for deep learning models, this paper proposes a two-dimensional transformation method based on Mel spectrograms. This approach enables deep learning models to efficiently extract time-domain, frequency-domain, and spatial-domain features from sensing signals in real-time.
By directly applying the Short-Time Fourier Transform (STFT) to a one-dimensional time-series sensing signal, its corresponding time–frequency spectrogram can be obtained. This spectrogram can then be directly fed into a two-dimensional convolutional neural network (2D-CNN) for recognition and classification. If the sensing signal acquired by a distributed optical fiber vibration sensing (DOVS) system is denoted as \(x(t)\), its STFT-processed spectrum \(y(f,\tau )\) can be expressed as35,36,37:
In the equation, \(h(t - \tau )\) represents the Hamming window. When the length of the Hamming window function is set to 2048 with a step size of 1024, performing the Short-Time Fourier Transform (STFT) directly on the sensing signal yields a two-dimensional time–frequency spectrum of dimensions 1025 × 28 × 3. According to the classification principles of deep learning models, directly inputting this 1025 × 28 × 3 two-dimensional time–frequency spectrum into a 2D convolutional neural network (CNN) for processing would significantly degrade the training efficiency and computational performance of recognition and classification tasks, especially under standard host computer configurations with limited.
To effectively address the issue of excessive dimensionality in the time–frequency diagrams generated by the STFT of sensing signals, this paper employs the Mel time–frequency spectrum method to accurately characterize the frequency energy distribution of the sensing signals across different time scales. This approach not only reduces the dimensionality of the time–frequency representation but also preserves the salient time–frequency features of the original signal38,39.
First, the Short-Time Fourier Transform (STFT) is performed on the sensing signal to obtain its corresponding time–frequency spectrum. Subsequently, this spectrum is multiplied by a predefined Mel filter bank to derive a reduced time–frequency spectrum matrix. Finally, a logarithmic operation is applied to the reduced matrix to obtain the final Mel time–frequency spectrum matrix40, as illustrated in Fig. 5.
If the frequency of the sensing signal in the Mel scale is denoted as \(f_{Mel}\),its corresponding relationship with the linear frequency f is expressed as:
According to the conversion principle of the Mel time–frequency spectrum, the Mel spectrum is generated by applying the Mel filter bank to amplify the low-to-mid frequency components and attenuate the mid-to-high frequency components in the original time–frequency representation. This ensures the preservation of critical time–frequency information while significantly reducing dimensionality. If the Mel filter bank contains M equal-height triangular filter functions, denoted as \(H_{Mel} (f)\), then:
In the equation,\(f_{c} (m)\) represents the center frequency of the m-th equal-height triangular filter, where \(1 \le m \le M\). When M = 28 is set, the resulting Mel time–frequency spectrum after the transformation has dimensions of 28 × 28 × 3. Therefore, compared to the time–frequency spectrum obtained via STFT, the Mel spectrum significantly enhances both the training efficiency and recognition performance of deep learning models.
Construction of a hybrid deep learning model based on GoogLeNet-LSTM
The GoogLeNet architecture proposed in this paper draws inspiration from Inception v3, incorporating three types of convolutional kernels: 1 × 1, 5 × 5, and 7 × 7. Specifically, the 5 × 5 kernel is decomposed into two cascaded 3 × 3 kernels, and the 7 × 7 kernel is implemented using three stacked 3 × 3 kernels. To further reduce model parameters, each 3 × 3 convolutional layer is itself factorized into a combination of two 1 × 3 kernels and one 3 × 1 kernel41,42. The final design stacks ten such Inception modules, omits the conventional auxiliary classifier components, retains average pooling functionality43,44,45, and forms the complete GoogLeNet architecture as illustrated in Fig. 6.
Building on the adaptive feature extraction capabilities of GoogLeNet and LSTM neural networks in signal processing, this paper proposes an enhanced GoogLeNet-LSTM hybrid deep learning model specifically designed for distributed optical fiber vibration sensing systems46,47,48. This architecture enables adaptive simultaneous extraction of both spatial and temporal features embedded in sensing signals49,50, as illustrated in Fig. 7 below.
Data acquisition and experimental validation
A field-deployed experimental test platform was established, where a fiber-optic vibration monitoring host is housed within the laboratory. Vibration-sensitive optical fibers are deployed bilaterally along the conveyor line, enabling the host to acquire substantial field-operational acoustic-vibration data. This data is subsequently uploaded to a mobile server, where embedded algorithmic modules perform real-time feature extraction and pattern recognition on the acquired data, as illustrated in Fig. 8.
During on-site data collection, four types of support rollers were selected and installed in different preset positions for data collection and training, as shown in Fig. 9.
As shown in Fig. 10(a–d), the time-domain waveform plots obtained through on-site vibration optical fiber data collection and processing demonstrate distinct characteristics across roller conditions: normal support rollers exhibit stable waveform fluctuations with amplitudes near 0.04mm; Dust-induced overheating rollers show localized energy surges featuring periodic concentrations peaking near 0.06mm; Bearing-worn rollers demonstrate shaft-drum separation phenomena with more pronounced periodic fluctuations than overheated rollers, reaching maximum amplitudes near 0.08mm; While shaft-fractured rollers manifest periodic energy oscillations with peak vibration amplitudes near 0.15mm.
As illustrated in Fig. 11(a–d), the time–frequency images derived from STFT and Mel transformation of four support roller types reveal distinct signatures: normal rollers concentrate energy primarily in low-frequency ranges with negligible high-frequency fluctuations; dust-induced overheating rollers exhibit broadband energy fluctuations during 9–10s intervals with significant energy accumulation near 600Hz; bearing-worn rollers demonstrate peak energy within the 300–800Hz frequency band accompanied by a secondary 1500Hz energy zone; while shaft-fractured rollers display pervasive broadband oscillations throughout 0 ~ 8s periods, exhibiting maximum energy intensity centered near 600Hz.
The experimental configuration is set as follows: sampling rate at 16 kHz, STFT feature extraction parameters optimized via orthogonal experiments (Hanning window length 1024 points, overlap 512 points, FFT points 1024, Mel filter bank 40 channels, retaining the first 13 cepstral coefficients), training employs the Adam optimizer (initial learning rate 0.001 with cosine decay strategy), 40 training epochs, batch size 64, and cross-validation performed every 30 iterations. Data preprocessing includes Z-score normalization and random sample shuffling to ensure model generalization capability.
As shown in Figs. 12, 13, 14, 15, the training iteration graphs of three models—CNN, LSTM, GoogLeNet-LSTM, and the Dynamically Driven GoogLeNet-LSTM (DY-GoogLeNet-LSTM)—demonstrate distinct performance characteristics. The CNN algorithm shows slow improvement in iteration accuracy, stabilizing only around the 15th iteration round despite significant fluctuations in the training set. The LSTM model achieves stabilization by the 10th iteration round, yet its training set consistently underperforms the validation set. The GoogLeNet-LSTM model stabilizes at the 3rd iteration round while maintaining a training set that consistently outperforms the validation set. The DY-GoogLeNet-LSTM model stabilizes by the 5th iteration round with its training set consistently approaching the validation set’s performance.
Figure 16 presents post-test confusion matrices, revealing substantial improvements: the LSTM model’s accuracy rises from 83.7% to 85.7%, the enhanced GoogLeNet-LSTM model reaches 95.3%, and the further optimized DY-GoogLeNet-LSTM achieves 96.7% accuracy. These results validate the effectiveness of the improved model in acoustic vibration fault detection, providing a reliable technical solution for conveyor fault diagnosis.
Conclusions
Based on theoretical research, model construction, and experimental validation, this study forms the following conclusions regarding the proposed fiber-optic sensing fault diagnosis method integrating dynamics model-driven and improved hybrid deep learning for fault identification of open-pit mine conveyor rollers under extreme operating conditions:
(1) Normal rollers exhibit stable vibration amplitudes of 0.04 mm, whereas dust-induced overheating, bearing-worn, and axle fracture faults show peak amplitudes increasing to 0.06 mm, 0.08 mm, and 0.15 mm respectively, with axle fracture amplitude reaching 3.75 times that of normal conditions.
(2) Normal rollers concentrate energy in low-frequency bands, while axle fractures demonstrate broadband energy oscillations across 0–8 s with significant energy density intensification near 600 Hz. Bearing-worn states exhibit characteristic energy bands within 300 ~ 800 Hz and 1500 Hz ranges, and dust-induced overheating displays transient broadband energy aggregation during 9–10 s intervals.
(3) The proposed DY-GoogLeNet-LSTM model achieves a test accuracy of 96.7%, outperforming the baseline CNN model (83.7%) by 13%. By leveraging dynamics model-driven strategies, it effectively suppresses the fluctuation error observed in the CNN model at the 30th training iteration, enabling stable and efficient fault feature extraction and classification.
Data availability
The datasets used and analysed during the current study available from the corresponding author on reasonable request will be made available on request.
References
Chen, N. N. et al. Scraper conveyor shape sensing technology based on orthogonal optical fiber strain. Meas. Sci. Technol. 34(12), 125103 (2023).
Hou, Y. R. et al. Belt conveyor speed detection based on fiber-optic Sagnac interferometer vibration sensor. Laser Phy. 34(3), 035102 (2024).
Xie, M. et al. Research on roller monitoring technology based on distributed fiber optic sensing system. Sci. Rep. 14(1), 10048 (2024).
Wu, H. J. et al. Signal processing in smart fiber-optic distributed acoustic sensor. Acta Opt. Sin. 44(1), 0106009 (2024).
Zeng, M. Y. et al. Reconstruction of vehicle-induced vibration on concrete pavement using distributed fiber optic. IEEE Trans. Intell. Transp. Syst. 23(12), 24305–24317 (2022).
Du, S. B. et al. A pipeline inspection gauge positioning method based on distributed fiber optic vibration sensing. IEEE Sens. J. 24(24), 40855–40865 (2024).
Lin, X. H. et al. Advances in intelligent identification of fiber-optic vibration signals in oil and gas pipelines. J. Pipeline Sci. Eng. 4(4), 100184 (2024).
Ma, D. Y. et al. Advances in machine-learning techniques for distributed fiber-optic sensing performance enhancement. Laser Optoelectron. Prog. 62(3), 0300002 (2025).
Sun, Z. S. et al. Intelligent sensing analysis using Mel-time-frequency-imaging and deep learning for distributed fiber-optic vibration detection. IEEE Sens. J. 22(22), 21933–21941 (2022).
Xu, H. Y., Kou, Q. K. & Feng, X. Y. Fiber optic vibration signal recognition based on IMFCC and CDET. IEEE Sens. J. 24(19), 30490–30499 (2024).
Jia, B. Y. et al. Online sensing method for transmission line conductor ice cover based on fiber optic sensing information fusion and continuous wavelet decomposition. Opt. Quantum Electron. 56(9), 1418 (2024).
Roy, S. S. et al. Multimodel Phishing URL Detection Using LSTM, Bidirectional LSTM, and GRU Models. Future Internet 14(11), 340 (2022).
Al-Qudah, S. & Yang, M. J. Effective hybrid structure health monitoring through parametric study of GoogLeNet. Ai 5(3), 1558–1574 (2024).
Suder, J. & Marciniak, T. Foreign object debris detection system using GoogLeNet. Prz. Elektrotech. 99(11), 249–252 (2023).
Sunnetci, K. M. et al. Face mask detection using GoogLeNet CNN-based SVM classifiers. Gazi Univ. J. Sci. 36(2), 645–658 (2023).
Kaya, Y. et al. A new approach to COVID-19 detection from x-ray images using angle transformation with GoogleNet and LSTM. Meas. Sci. Technol. 33(12), 124011 (2022).
Jang, S. J. & Jang, S. Y. Short-term crack in sewer forecasting method based on CNN-LSTM hybrid neural network model. J. Korean Geosynth. Soc. 21(2), 11–19 (2022).
Salini, R., Latha, G. & Khilar, R. Plant disease detection with modified deep joint segmentation and combined GoogleNet-IRNN. J. Phytopathol. 172(3), e13313 (2024).
Reddy, B. R. & Kumar, R. L. Classification of health care products using hybrid CNN-LSTM model. Soft Comput. 27(13), 9199–9216 (2023).
Nakamura, A. et al. Inter-modal interferometry using two-mode region of single-mode fibers for vibration sensing. J. Lightwave Technol. 42(18), 6431–6436 (2024).
Jiang, X. et al. GNSS-over-fiber sensing system for high precision 3D nodal displacement and vibration detection. IEEE Photonics Technol. Lett. 35(8), 402–405 (2023).
Zhang, C. L. & Wang, P. Fuzzy logic system assisted sensing resource allocation for optical fiber sensing and communication integrated network. Sensors 22(20), 7708 (2022).
Gui, X. et al. Distributed optical fiber sensing and applications based on large-scale fiber bragg grating array: Review. J. Lightwave Technol. 41(13), 4187–4200 (2023).
Jin, R. et al. Study on dual-wavelength wide-spectrum erbium-doped fiber ring laser for optical fiber interferometric sensing system. J. Lightwave Technol. 41(17), 5796–5804 (2023).
Liu, L. et al. Multimode fibre tapers based in-line Mach-Zehnder fibre-optic interferometer for curvature and vibration sensing. J. Mod. Opt. 71(16–18), 581–591 (2024).
Zhao, Y. D. et al. Distributed optical fiber vibration sensing system with high spatial resolution and large bandwidth. Acta Opt. Sin. 42(19), 1906004 (2022).
Zhang, B. et al. Study on the anti-vibration performance of a metal tube armored fiber grating sensing probe. Opt. Fiber Technol. 68, 102826 (2022).
Travers, P. et al. Distributed strain sensing inside a fiber coil under vibration. J. Lightwave Technol. 40(18), 6280–6287 (2022).
Ge, Z. et al. High-accuracy event classification of distributed optical fiber vibration sensing based on time-space analysis. Sensors 22(5), 2053 (2022).
Sun, X. X., Xiao, H. & Meng, W. J. Dynamics modeling and simulation of steel cord conveyor belt based on dynamic elastic modulus. Proc. Inst. Mech. Eng. Part C-J. Mech. Eng. Sci. 238(14), 6910–6922 (2024).
Tupkar, R. et al. Optimizing belt tension and stretch dynamics: A modeling approach for medium-duty conveyor systems. Eng. Res. Express 7(2), 025413 (2025).
Hao, N. N. et al. Research on dynamic characteristics of the turnover system of rail belt conveyor. Meas. Sci. Technol. 35(2), 025116 (2024).
Marasova, D., Andrejiova, M. & Grincova, A. Dynamic model of impact energy absorption by a conveyor belt in interaction with the support system. Energies 15(1), 64 (2022).
Yang, C. Y. et al. Low-order dynamical model and distributed coordinated model predictive control for multi-stage belt conveyor systems. J. Process Control 124, 83–91 (2023).
Xiong, W. H. et al. Multichannel feature fusion network-based technique for heart sound signal classification and recognition. Expert Syst. Appl. 273, 126839 (2025).
Liu, S. et al. Study on weak sound signal separation and pattern recognition under strong background noise in marine engineering. J. Low Freq. Noise Vib. Act. Control 43(2), 595–608 (2024).
Chen, X. & Zeng, Z. Y. Bird sound recognition based on adaptive frequency cepstral coefficient and improved support vector machine using a hunter-prey optimizer. Math. Biosci. Eng. 20(11), 19438–19453 (2023).
Ahsan, M. & Salah, M. M. Similarity index of the STFT-based health diagnosis of variable speed rotating machines. Intell. Syst. Appl. 20, 200270 (2023).
Peng, P. Z. et al. Seizure prediction in EEG signals using STFT and domain adaptation. Front Neurosci. 15, 825434 (2022).
Liu, M. R. et al. A sound source localization method based on multi-scale cross-STFT complex-valued convolutional neural network. Sens. Rev. 45(3), 374–386 (2025).
Liang, B. S. et al. F-LSTM: FPGA-Based Heterogeneous Computing Framework for Deploying LSTM-Based Algorithms. Electronics 12(5), 1139 (2023).
Dang, Z. Y. et al. CA-LSTM: An improved LSTM trajectory prediction method based on infrared UAV target detection. Electronics 12(19), 4081 (2023).
Sun, J. Q. et al. AC-LSTM: Anomaly state perception of infrared point targets based on CNN plus LSTM. Remote Sens. 14(13), 3221 (2022).
Mishra, A. K., Kumar, R. & Bal, D. P. ESG volatility prediction using GARCH and LSTM models. Financ. Internet Q. 19(4), 97–114 (2023).
Alshingiti, Z. et al. A deep learning-based phishing detection system using CNN, LSTM, and LSTM-CNN. Electronics 12(1), 232 (2023).
Alsumaidaee, Y. et al. Detection of corona faults in switchgear by using 1D-CNN, LSTM, and 1D-CNN-LSTM methods. Sensors 23(6), 3108 (2023).
Ryu, G. A. et al. SCE-LSTM: Sparse critical event-driven LSTM model with selective memorization for agricultural time-series prediction. Agriculture 13(11), 2044 (2023).
Wei, W. et al. LSTM-SN: Complex text classifying with LSTM fusion social network. J Supercomput. 79(9), 9558–9583 (2023).
Patel, N. et al. F-LSTM: Federated learning-based LSTM framework for cryptocurrency price prediction. Electron. Res. Arch. 31(10), 6525–6551 (2023).
Murugesan, R., Mishra, E. & Krishnan, A. H. Forecasting agricultural commodities prices using deep learning-based models: Basic LSTM, bi-LSTM, stacked LSTM, CNN LSTM, and convolutional LSTM. Int. J. Sustain. Agric. Manag. Inform. 8(3), 242–277 (2022).
Acknowledgements
The authors would like to thank Letpub for the English language editing.
Funding
This work were supported by “Tianchi Talent” Introduction Plan、Postdoctoral Program: 410713 and Natural Science Foundation of Xinjiang Autonomous Region 2024D01A101、2024D01A102.
Author information
Authors and Affiliations
Contributions
Hongyu Zhang: Writing—original draft, Methodology, Investigation,Conceptualization. Zhiyong Yang: Writing—review & editing, Validation, Supervision, Methodology, Conceptualization. Fuli Wu:Validation, Methodology. Shihong Liang: Writing—review & edit-ing, Validation, Supervision, Methodology, Conceptualization. Guilin Hu: Writing—review & editing,Validation, Supervision, Conceptualization. Defeng Hou: Supervision, Methodology, Conceptualization. Wenyu Ding and Fucheng Yang: Writingreview & editing, Validation.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.
About this article
Cite this article
Zhang, H., Yang, Z., Wu, F. et al. Research on fault diagnosis method of fiber optic sensing roller driven by dynamic model. Sci Rep 15, 37146 (2025). https://doi.org/10.1038/s41598-025-21122-2
Received:
Accepted:
Published:
Version of record:
DOI: https://doi.org/10.1038/s41598-025-21122-2















