Abstract
Intrusion of foreign objects into the Electrified Railway Catenary System can lead to power failures, train service interruptions, and even casualties, making accurate detection essential for safe operation. Due to the scarcity of railway datasets, this study constructs a Railway Catenary Foreign Object Dataset to support model training and evaluation. Existing detection methods often struggle with complex railway environments, diverse object morphologies, and varying scales. To address these challenges, we propose a Railway Catenary Foreign Object Detection Network. It leverages the hierarchical architecture and window-based attention mechanism of Swin Transformer for multi-scale semantic feature extraction and global relational modeling, effectively distinguishing foreground from background. A Multi-branch Fusion Feature Pyramid Network is designed to deeply fuse low- and high-level features across scales, improving detection of objects of different sizes. Additionally, a Regional Receptive Field-Enhanced Edge Module expands the receptive field and enhances edge extraction for elongated foreign objects. Extensive experiments on the constructed dataset demonstrate the effectiveness of the proposed approach, achieving an Average Precision of 60.2%, with 53.8% for small object detection.
Similar content being viewed by others
Data availability
The constructed Railway Catenary Foreign Object Dataset is available on GitHub at https://github.com/dashgfuidng/CPSSFOdataset.git
References
Bocciolone, M., Bucca, G., Collina, A. & Comolli, L. Pantograph–catenary monitoring by means of fibre bragg grating sensors: Results from tests in an underground line. Mech. Syst. Signal Process. 41, 226–238. https://doi.org/10.1016/j.ymssp.2013.06.030 (2013).
Hofler, H., Dambacher, M., Dimopoulos, N. & Jetter, V. Monitoring and inspecting overhead wires and supporting structures. In IEEE Intelligent Vehicles Symposium512–517, 2004. https://doi.org/10.1109/IVS.2004.1336436 (IEEE 2004).
Cao, Z. et al. Railway intrusion detection based on machine vision: A survey, challenges, and perspectives. IEEE Trans. Intell. Transp. Syst. 25, 6427–6448 (2024).
Wu, X., Yuan, P., Peng, Q., Ngo, C.-W. & He, J.-Y. Detection of bird nests in overhead catenary system images for high-speed rail. Pattern Recogn. 51, 242–254. https://doi.org/10.1016/j.patcog.2015.09.010 (2016).
Qiu, S. et al. Automated detection of railway defective fasteners based on yolov8-fam and synthetic data using style transfer. Autom. Constr. 162, 105363. https://doi.org/10.1016/j.autcon.2024.105363 (2024).
Bai, T., Yang, J., Xu, G. & Yao, D. An optimized railway fastener detection method based on modified faster r-cnn. Measurement 182, 109742. https://doi.org/10.1016/j.measurement.2021.109742 (2021).
Lian, L. et al. Rvsa-3d: voxel-based fully sparse attention 3d object detection for rail transit obstacle perception. Pattern Recogn. 112324 (2025).
Lian, L. et al. Rae3d: Multiscale aggregation-enhanced 3d object detection for rail transit obstacle perception. IEEE Trans. Ind. Inf. (2025).
Chen, Z. et al. Foreign object detection method for railway catenary based on a scarce image generation model and lightweight perception architecture. IEEE Trans. Circ. Syst. Video Technol. (2025).
Vaswani, A. et al. Attention is all you need. In Guyon, I. et al. (eds.) Advances in Neural Information Processing Systems, vol. 30 (Curran Associates, Inc., 2017).
Dosovitskiy, A. et al. An image is worth 16x16 words: Transformers for image recognition at scale. In International Conference on Learning Representations (2021).
Zhao, Z., Kang, J., Sun, Z., Ye, T. & Wu, B. A real-time and high-accuracy railway obstacle detection method using lightweight cnn and improved transformer. Measurement 238, 115380. https://doi.org/10.1016/j.measurement.2024.115380 (2024).
Liu, Z. et al. Swin transformer: Hierarchical vision transformer using shifted windows. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 10012–10022 (2021).
Chen, Z., Yang, J., Feng, Z. & Zhu, H. Railfod23: A dataset for foreign object detection on railroad transmission lines. Sci. Data 11, 72 (2024).
Zendel, O. et al. Railsem19: A dataset for semantic rail scene understanding. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 0–0 (2019).
Yue, Z., Xie, J., Zhao, Q. & Meng, D. Semi-supervised video deraining with dynamical rain generator. In 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 642–652, https://doi.org/10.1109/CVPR46437.2021.00070 (2021).
DeVries, T. Improved regularization of convolutional neural networks with cutout. arXiv preprint arXiv:1708.04552 (2017).
Cai, Z. & Vasconcelos, N. Cascade r-cnn: Delving into high quality object detection. In 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 6154–6162. https://doi.org/10.1109/CVPR.2018.00644 (2018).
Wang, P. et al. Understanding convolution for semantic segmentation. In 2018 IEEE Winter Conference on Applications of Computer Vision (WACV), 1451–1460, https://doi.org/10.1109/WACV.2018.00163 (2018).
Lin, T.-Y. et al. Feature pyramid networks for object detection. In 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 936–944, https://doi.org/10.1109/CVPR.2017.106 (2017).
Carion, N. et al. End-to-end object detection with transformers. In European conference on computer vision, 213–229 (Springer, 2020).
Liu, S. et al. Dab-detr: Dynamic anchor boxes are better queries for detr. In The Twelfth International Conference on Learning Representations(ICLR) (2022).
SZhang, H. et al. Dino: Detr with improved denoising anchor boxes for end-to-end object detection. In The Twelfth International Conference on Learning Representations(ICLR) (2023).
Sun, P. et al. Sparse r-cnn: End-to-end object detection with learnable proposals. In 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 14449–14458, https://doi.org/10.1109/CVPR46437.2021.01422 (2021).
Ge, Z., Liu, S., Wang, F., Li, Z. & Sun, J. Yolox: Exceeding yolo series in 2021. ArXiv abs/2107.08430 (2021).
Varghese, R. & M., S. Yolov8: A novel object detection algorithm with enhanced performance and robustness. In 2024 International Conference on Advances in Data Engineering and Intelligent Computing Systems (ADICS), 1–6, https://doi.org/10.1109/ADICS58448.2024.10533619 (2024).
Chen, S., Sun, P., Song, Y. & Luo, P. Diffusiondet: Diffusion model for object detection. In 2023 IEEE/CVF International Conference on Computer Vision (ICCV), 19773–19786, https://doi.org/10.1109/ICCV51070.2023.01816 (2023).
Wang, A. et al. Yolov10: Real-time end-to-end object detection. In NeurIPS 2024 (2024).
Lv, Y., Chen, W., Chen, B., Zhang, Z. & Wang, J. Rt-detr: Detrs beat yolos on real-time object detection. In Proc. of the IEEE/CVF Int. Conf. on Computer Vision Workshops (ICCVW), 2420–2430 (2023).
Acknowledgements
Thanks for the hard work of the editors and the constructive suggestions of the anonymous reviewers.
Funding
This research was supported in part by the National Natural Science Foundation of China (No.62276042), by the Artificial Intelligence Scientific and Technological Innovation Program of Liaoning Province (No.2023JH26/10100008), and by the Science and Technology Plan Joint Program of Liaoning Province (No.2025-MSLH-138, No.2025 BSLH-100, No.2025-BSLH-101).
Author information
Authors and Affiliations
Contributions
Li contributed to the conceptualization, methodology, project administration, and writing—original draft. Cao was responsible for investigation, methodology, software, visualization, and writing—original draft. Yang contributed to data curation, software, and visualization. Diao performed formal analysis, investigation, and validation. Wang contributed to formal analysis, resources, and validation. Yu was responsible for data curation, resources, and writing—review & editing. Yan contributed to data curation and writing—review & editing. Xu provided funding acquisition, supervision, and writing—review & editing. All authors reviewed the manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.
About this article
Cite this article
Li, F., Cao, J., Yang, H. et al. A foreign object detection dataset and network for electrified railway catenary systems. Sci Rep (2026). https://doi.org/10.1038/s41598-026-39129-8
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-026-39129-8


