A foreign object detection dataset and network for electrified railway catenary systems

Li, Fengqi; Cao, Jinhao; Yang, Haolin; Diao, Li; Wang, Yanhua; Yu, Shiyu; Yan, Xiaohong; Xu, Fengqiang

doi:10.1038/s41598-026-39129-8

Download PDF

Article
Open access
Published: 14 February 2026

A foreign object detection dataset and network for electrified railway catenary systems

Fengqi Li¹,
Jinhao Cao¹,
Haolin Yang¹,
Li Diao¹,
Yanhua Wang¹,
Shiyu Yu¹,
Xiaohong Yan¹ &
…
Fengqiang Xu¹

Scientific Reports , Article number: (2026) Cite this article

73 Accesses
Metrics details

We are providing an unedited version of this manuscript to give early access to its findings. Before final publication, the manuscript will undergo further editing. Please note there may be errors present which affect the content, and all legal disclaimers apply.

Subjects

Abstract

Intrusion of foreign objects into the Electrified Railway Catenary System can lead to power failures, train service interruptions, and even casualties, making accurate detection essential for safe operation. Due to the scarcity of railway datasets, this study constructs a Railway Catenary Foreign Object Dataset to support model training and evaluation. Existing detection methods often struggle with complex railway environments, diverse object morphologies, and varying scales. To address these challenges, we propose a Railway Catenary Foreign Object Detection Network. It leverages the hierarchical architecture and window-based attention mechanism of Swin Transformer for multi-scale semantic feature extraction and global relational modeling, effectively distinguishing foreground from background. A Multi-branch Fusion Feature Pyramid Network is designed to deeply fuse low- and high-level features across scales, improving detection of objects of different sizes. Additionally, a Regional Receptive Field-Enhanced Edge Module expands the receptive field and enhances edge extraction for elongated foreign objects. Extensive experiments on the constructed dataset demonstrate the effectiveness of the proposed approach, achieving an Average Precision of 60.2%, with 53.8% for small object detection.

RailFOD23: A dataset for foreign object detection on railroad transmission lines

Article Open access 16 January 2024

Multi-scale attention network (MSAN) for track circuits fault diagnosis

Article Open access 17 April 2024

A cost effective real time rail track monitoring system leveraging multi sensor fusion and multi objective optimization

Article Open access 28 August 2025

Data availability

The constructed Railway Catenary Foreign Object Dataset is available on GitHub at https://github.com/dashgfuidng/CPSSFOdataset.git

References

Bocciolone, M., Bucca, G., Collina, A. & Comolli, L. Pantograph–catenary monitoring by means of fibre bragg grating sensors: Results from tests in an underground line. Mech. Syst. Signal Process. 41, 226–238. https://doi.org/10.1016/j.ymssp.2013.06.030 (2013).
Google Scholar
Hofler, H., Dambacher, M., Dimopoulos, N. & Jetter, V. Monitoring and inspecting overhead wires and supporting structures. In IEEE Intelligent Vehicles Symposium512–517, 2004. https://doi.org/10.1109/IVS.2004.1336436 (IEEE 2004).
Cao, Z. et al. Railway intrusion detection based on machine vision: A survey, challenges, and perspectives. IEEE Trans. Intell. Transp. Syst. 25, 6427–6448 (2024).
Google Scholar
Wu, X., Yuan, P., Peng, Q., Ngo, C.-W. & He, J.-Y. Detection of bird nests in overhead catenary system images for high-speed rail. Pattern Recogn. 51, 242–254. https://doi.org/10.1016/j.patcog.2015.09.010 (2016).
Google Scholar
Qiu, S. et al. Automated detection of railway defective fasteners based on yolov8-fam and synthetic data using style transfer. Autom. Constr. 162, 105363. https://doi.org/10.1016/j.autcon.2024.105363 (2024).
Google Scholar
Bai, T., Yang, J., Xu, G. & Yao, D. An optimized railway fastener detection method based on modified faster r-cnn. Measurement 182, 109742. https://doi.org/10.1016/j.measurement.2021.109742 (2021).
Google Scholar
Lian, L. et al. Rvsa-3d: voxel-based fully sparse attention 3d object detection for rail transit obstacle perception. Pattern Recogn. 112324 (2025).
Lian, L. et al. Rae3d: Multiscale aggregation-enhanced 3d object detection for rail transit obstacle perception. IEEE Trans. Ind. Inf. (2025).
Chen, Z. et al. Foreign object detection method for railway catenary based on a scarce image generation model and lightweight perception architecture. IEEE Trans. Circ. Syst. Video Technol. (2025).
Vaswani, A. et al. Attention is all you need. In Guyon, I. et al. (eds.) Advances in Neural Information Processing Systems, vol. 30 (Curran Associates, Inc., 2017).
Dosovitskiy, A. et al. An image is worth 16x16 words: Transformers for image recognition at scale. In International Conference on Learning Representations (2021).
Zhao, Z., Kang, J., Sun, Z., Ye, T. & Wu, B. A real-time and high-accuracy railway obstacle detection method using lightweight cnn and improved transformer. Measurement 238, 115380. https://doi.org/10.1016/j.measurement.2024.115380 (2024).
Google Scholar
Liu, Z. et al. Swin transformer: Hierarchical vision transformer using shifted windows. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 10012–10022 (2021).
Chen, Z., Yang, J., Feng, Z. & Zhu, H. Railfod23: A dataset for foreign object detection on railroad transmission lines. Sci. Data 11, 72 (2024).
Google Scholar
Zendel, O. et al. Railsem19: A dataset for semantic rail scene understanding. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 0–0 (2019).
Yue, Z., Xie, J., Zhao, Q. & Meng, D. Semi-supervised video deraining with dynamical rain generator. In 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 642–652, https://doi.org/10.1109/CVPR46437.2021.00070 (2021).
DeVries, T. Improved regularization of convolutional neural networks with cutout. arXiv preprint arXiv:1708.04552 (2017).
Cai, Z. & Vasconcelos, N. Cascade r-cnn: Delving into high quality object detection. In 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 6154–6162. https://doi.org/10.1109/CVPR.2018.00644 (2018).
Wang, P. et al. Understanding convolution for semantic segmentation. In 2018 IEEE Winter Conference on Applications of Computer Vision (WACV), 1451–1460, https://doi.org/10.1109/WACV.2018.00163 (2018).
Lin, T.-Y. et al. Feature pyramid networks for object detection. In 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 936–944, https://doi.org/10.1109/CVPR.2017.106 (2017).
Carion, N. et al. End-to-end object detection with transformers. In European conference on computer vision, 213–229 (Springer, 2020).
Liu, S. et al. Dab-detr: Dynamic anchor boxes are better queries for detr. In The Twelfth International Conference on Learning Representations(ICLR) (2022).
SZhang, H. et al. Dino: Detr with improved denoising anchor boxes for end-to-end object detection. In The Twelfth International Conference on Learning Representations(ICLR) (2023).
Sun, P. et al. Sparse r-cnn: End-to-end object detection with learnable proposals. In 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 14449–14458, https://doi.org/10.1109/CVPR46437.2021.01422 (2021).
Ge, Z., Liu, S., Wang, F., Li, Z. & Sun, J. Yolox: Exceeding yolo series in 2021. ArXiv abs/2107.08430 (2021).
Varghese, R. & M., S. Yolov8: A novel object detection algorithm with enhanced performance and robustness. In 2024 International Conference on Advances in Data Engineering and Intelligent Computing Systems (ADICS), 1–6, https://doi.org/10.1109/ADICS58448.2024.10533619 (2024).
Chen, S., Sun, P., Song, Y. & Luo, P. Diffusiondet: Diffusion model for object detection. In 2023 IEEE/CVF International Conference on Computer Vision (ICCV), 19773–19786, https://doi.org/10.1109/ICCV51070.2023.01816 (2023).
Wang, A. et al. Yolov10: Real-time end-to-end object detection. In NeurIPS 2024 (2024).
Lv, Y., Chen, W., Chen, B., Zhang, Z. & Wang, J. Rt-detr: Detrs beat yolos on real-time object detection. In Proc. of the IEEE/CVF Int. Conf. on Computer Vision Workshops (ICCVW), 2420–2430 (2023).

Download references

Acknowledgements

Thanks for the hard work of the editors and the constructive suggestions of the anonymous reviewers.

Funding

This research was supported in part by the National Natural Science Foundation of China (No.62276042), by the Artificial Intelligence Scientific and Technological Innovation Program of Liaoning Province (No.2023JH26/10100008), and by the Science and Technology Plan Joint Program of Liaoning Province (No.2025-MSLH-138, No.2025 BSLH-100, No.2025-BSLH-101).

Author information

Authors and Affiliations

School of Railway Intelligent Engineering, Dalian Jiaotong University, Dalian, 116028, China
Fengqi Li, Jinhao Cao, Haolin Yang, Li Diao, Yanhua Wang, Shiyu Yu, Xiaohong Yan & Fengqiang Xu

Authors

Fengqi Li
View author publications
Search author on:PubMed Google Scholar
Jinhao Cao
View author publications
Search author on:PubMed Google Scholar
Haolin Yang
View author publications
Search author on:PubMed Google Scholar
Li Diao
View author publications
Search author on:PubMed Google Scholar
Yanhua Wang
View author publications
Search author on:PubMed Google Scholar
Shiyu Yu
View author publications
Search author on:PubMed Google Scholar
Xiaohong Yan
View author publications
Search author on:PubMed Google Scholar
Fengqiang Xu
View author publications
Search author on:PubMed Google Scholar

Contributions

Li contributed to the conceptualization, methodology, project administration, and writing—original draft. Cao was responsible for investigation, methodology, software, visualization, and writing—original draft. Yang contributed to data curation, software, and visualization. Diao performed formal analysis, investigation, and validation. Wang contributed to formal analysis, resources, and validation. Yu was responsible for data curation, resources, and writing—review & editing. Yan contributed to data curation and writing—review & editing. Xu provided funding acquisition, supervision, and writing—review & editing. All authors reviewed the manuscript.

Corresponding author

Correspondence to Fengqiang Xu.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Cite this article

Li, F., Cao, J., Yang, H. et al. A foreign object detection dataset and network for electrified railway catenary systems. Sci Rep (2026). https://doi.org/10.1038/s41598-026-39129-8

Download citation

Received: 10 November 2025
Accepted: 03 February 2026
Published: 14 February 2026
DOI: https://doi.org/10.1038/s41598-026-39129-8