Skip to main content

Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

Advertisement

Scientific Reports
  • View all journals
  • Search
  • My Account Login
  • Content Explore content
  • About the journal
  • Publish with us
  • Sign up for alerts
  • RSS feed
  1. nature
  2. scientific reports
  3. articles
  4. article
Hierarchical depth aware YOLO for efficient metal surface defect detection
Download PDF
Download PDF
  • Article
  • Open access
  • Published: 04 April 2026

Hierarchical depth aware YOLO for efficient metal surface defect detection

  • Qiyuan Qin1,
  • Anis Salwa Mohd Khairuddin1,
  • Noorhayati Idros1,3,
  • Haichuan Liu1,
  • Liming Fan1,
  • Zuoming Yang1,
  • JIAYI LI1 &
  • …
  • Chenjinhang Zhu2 

Scientific Reports (2026) Cite this article

  • 1193 Accesses

  • Metrics details

We are providing an unedited version of this manuscript to give early access to its findings. Before final publication, the manuscript will undergo further editing. Please note there may be errors present which affect the content, and all legal disclaimers apply.

Subjects

  • Engineering
  • Mathematics and computing

Abstract

Accurate and real-time metal surface defect detection under complex backgrounds and large appearance variations remains a critical challenge in intelligent manufacturing. Existing lightweight detectors often suffer from suboptimal performance due to uniformly applied feature refinement strategies across different network depths, which limits their ability to balance fine-grained representation and computational efficiency. To address this issue, we propose a hierarchical depth-aware refinement framework, termed HDR-YOLO, which explicitly aligns feature enhancement mechanisms with the distinct roles of shallow and deep representations. Specifically, a Query-Focused Convolution (QFC) block is introduced in shallow layers to enhance high-resolution texture and edge information, while a Query-Based Fusion (QBF) block is employed in deeper layers to improve global semantic modeling through adaptive feature interaction. The proposed design enables more effective detection of small-scale defects and irregular fine-grained patterns. Extensive experiments on the NEU-DET and GC10-DET datasets demonstrate that HDR-YOLO improves mAP@0.5 by 3.92% and 7.67%, respectively, over the baseline, while maintaining competitive inference efficiency. These results validate that depth-aware refinement is an effective strategy for enhancing lightweight defect detection under real-time industrial constraints.

Similar content being viewed by others

Research on a lightweight model for laser-cut diamond defect detection based on multi-module collaborative optimization

Article Open access 09 May 2026

YOLO-DCF: dual distillation and context-aware fusion for defect detection

Article Open access 03 April 2026

A high precision and lightweight method for steel surface defect detection based on improved YOLOv5

Article Open access 02 December 2025

Data availability

The datasets used in this study are publicly available. The NEU-DET dataset can be obtained from its original source, and the GC10-DET dataset is available from the corresponding public repository. The source code and additional data generated during the current study are available from the corresponding author upon reasonable request.

References

  1. Maleki, E. et al. Surface post-treatments for metal additive manufacturing: Progress, challenges, and opportunities. Addit. Manuf. 37, 101619 (2021).

    Google Scholar 

  2. Ragab, M. G. et al. A comprehensive systematic review of yolo for medical object detection (2018 to 2023). IEEE Access 12, 57815–57836 (2024).

    Google Scholar 

  3. Wang, Q. et al. A casting surface dataset and benchmark for subtle and confusable defect detection in complex contexts. IEEE Sens. J. 24, 16721–16733 (2024).

    Google Scholar 

  4. Mery, D. Aluminum casting inspection using deep object detection methods and simulated ellipsoidal defects. Mach. Vision Appl. 32, 72 (2021).

    Google Scholar 

  5. Karimi, N., Mishra, M. & Lourenço, P. B. Automated surface crack detection in historical constructions with various materials using deep learning-based yolo network. Int. J. Archit. Herit. 19, 581–597 (2025).

    Google Scholar 

  6. Ren, Z., Fang, F., Yan, N. & Wu, Y. State of the art in defect detection based on machine vision. Int. J. Precis. Eng. Manuf. Green Technol. 9, 661–691 (2022).

    Google Scholar 

  7. Tian, J. H. et al. An improved yolov5n algorithm for detecting surface defects in industrial components. Sci. Rep. 15, 9756 (2025).

    Google Scholar 

  8. Maruschak, P., Konovalenko, I. & Osadtsa, Y. Surface defects of rolled metal products recognised by a deep neural network under different illuminance levels and low-amplitude vibration. Int. J. Adv. Manuf. Technol. 139, 449–464 (2025).

    Google Scholar 

  9. Hütten, N. et al. Deep learning for automated visual inspection in manufacturing and maintenance: A survey of open-access papers. Appl. Syst. Innov. 7, 11 (2024).

    Google Scholar 

  10. Alzubaidi, L. et al. Review of deep learning: concepts, cnn architectures, challenges, applications, future directions. J. Big Data 8, 53 (2021).

    Google Scholar 

  11. Agarwal, A. & Patni, K. Lung cancer detection and classification based on alexnet cnn. In Proceedings of the 6th International Conference on Communication and Electronics Systems (ICCES) (2021).

  12. Shah, S. R. et al. Comparing inception v3, vgg16, vgg19, cnn, and resnet50: A case study on early detection of a rice disease. Agronomy 13, 1633 (2023).

    Google Scholar 

  13. Xu, W., Fu, Y.-L. & Zhu, D. Resnet and its application to medical image processing: Research progress and challenges. Comput. Methods Programs Biomed 240, 107660 (2023).

    Google Scholar 

  14. Xie, X. et al. Oriented r-cnn for object detection. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV) (2021).

  15. Xu, X. et al. Crack detection and comparison study based on faster R-CNN and mask R-CNN. Sensors 22, 1215 (2022).

    Google Scholar 

  16. Bi, X., Hu, J., Xiao, B., Li, W. & Gao, X. Iemask R-CNN: Information-enhanced mask R-CNN. IEEE Trans. Big Data 9, 688–700 (2022).

    Google Scholar 

  17. Arwidiyarti, D. Single shot multibox detector (SSD) in object detection: A review. IJACI: Int. J. Adv. Comput. Inform. 1, 118–127 (2025).

    Google Scholar 

  18. Jiang, P., Ergu, D., Liu, F., Cai, Y. & Ma, B. A review of yolo algorithm developments. Proc. Comput. Sci. 199, 1066–1073 (2022).

    Google Scholar 

  19. Cheng, T. et al. Yolo-world: Real-time open-vocabulary object detection. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 16901–16911 (2024).

  20. Qu, Y. et al. Optimization algorithm for steel surface defect detection based on pp-yoloe. Electronics 12, 4161 (2023).

    Google Scholar 

  21. Wang, C. et al. Gold-yolo: Efficient object detector via gather-and-distribute mechanism. Adv. Neural Inform. Process. Syst. 36, 51094–51112 (2023).

    Google Scholar 

  22. Terven, J., Córdova-Esparza, D. M. & Romero-González, J. A. A comprehensive review of yolo architectures in computer vision: From yolov1 to yolov8 and yolo-nas. Mach. Learn. Knowled. Extract. 5, 1680–1716 (2023).

    Google Scholar 

  23. Hussain, M. Yolo-v1 to yolo-v8, the rise of yolo and its complementary nature toward digital manufacturing and industrial defect detection. Machines 11, 677 (2023).

    Google Scholar 

  24. He, L. H., Zhou, Y. Z., Liu, L., Cao, W. & Ma, J. H. Research on object detection and recognition in remote sensing images based on yolov11. Sci. Rep. 15, 14032 (2025).

    Google Scholar 

  25. Choromanski, K., Likhosherstov, V., Dohan, D. et al. Rethinking attention with performers. In Proceedings of the International Conference on Learning Representations (ICLR) (2021).

  26. Sinha, D. & El-Sharkawy, M. Thin mobilenet: An enhanced mobilenet architecture. In Proceedings of the IEEE 10th Annual Ubiquitous Computing, Electronics and Mobile Communication Conference (UEMCON), 280–285 (2019).

  27. Rahimzadeh, M. & Attar, A. A modified deep convolutional neural network for detecting covid-19 and pneumonia from chest x-ray images based on the concatenation of xception and resnet50v2. Inform. Med. Unlocked 19, 100360 (2020).

    Google Scholar 

  28. Wang, Y., Wang, H. & Xin, Z. Efficient detection model of steel strip surface defects based on yolo-v7. IEEE Access 10, 133936–133944 (2022).

    Google Scholar 

  29. Wang, K., Teng, Z. & Zou, T. Metal defect detection based on yolov5. In Journal of Physics: Conference Series 2218, 012050 (2022).

  30. Jocher, G. et al. ultralytics/yolov5: v3.0 (2020). Zenodo.

  31. Sohan, M., Thotakura, T. S. R. & Reddy, C. V. R. A review on yolov8 and its advancements. In Proceedings of the International Conference on Data Intelligence and Cognitive Informatics (2024).

  32. Wang, A. et al. Yolov10: Real-time end-to-end object detection. Adv. Neural Inform. Process. Syst. 37, 107984–108011 (2024).

    Google Scholar 

  33. Sun, Y., Yan, H., Shang, Z. & Yang, M. Mch-yolov12: Research on surface defect detection algorithm for aluminum profiles based on improved yolov12. Sensors 25, 5389 (2025).

    Google Scholar 

  34. Xue, R., Hua, S. & Xu, H. Feci-rtdetr: A lightweight unmanned aerial vehicle infrared small target detector algorithm based on rt-detr. IEEE Access (2025).

  35. Vaswani, A. et al. Attention is all you need. Adv. Neural Inform. Process. Syst. 30 (2017).

  36. Gu, A. & Dao, T. Mamba: Linear-time sequence modeling with selective state spaces. In First Conference on Language Modeling (2024).

Download references

Acknowledgement

This research work was supported by Universiti Malaya under Project Number UMREG025-2025.

Funding

This research work was supported by Universiti Malaya under Project Number UMREG025-2025.

Author information

Authors and Affiliations

  1. Department of Electrical Engineering, Faculty of Engineering, Universiti Malaya, 50603, Kuala Lumpur, Malaysia

    Qiyuan Qin, Anis Salwa Mohd Khairuddin, Noorhayati Idros, Haichuan Liu, Liming Fan, Zuoming Yang & JIAYI LI

  2. Inner Mongolia Guangju New Materials Co., Ltd., Wuhai Low Carbon Industrial Park, Hainan District, Wuhai, 016000, China

    Chenjinhang Zhu

  3. Centre of Printable Electronics, Institute of Advanced Studies, Universiti Malaya, 50603, Kuala Lumpur, Malaysia

    Noorhayati Idros

Authors
  1. Qiyuan Qin
    View author publications

    Search author on:PubMed Google Scholar

  2. Anis Salwa Mohd Khairuddin
    View author publications

    Search author on:PubMed Google Scholar

  3. Noorhayati Idros
    View author publications

    Search author on:PubMed Google Scholar

  4. Haichuan Liu
    View author publications

    Search author on:PubMed Google Scholar

  5. Liming Fan
    View author publications

    Search author on:PubMed Google Scholar

  6. Zuoming Yang
    View author publications

    Search author on:PubMed Google Scholar

  7. JIAYI LI
    View author publications

    Search author on:PubMed Google Scholar

  8. Chenjinhang Zhu
    View author publications

    Search author on:PubMed Google Scholar

Contributions

Q.Q. conceived and designed the study, developed the methodology, conducted all experiments, analyzed the results, and wrote the manuscript. A.S.B.M.K. supervised the research and provided critical feedback on the manuscript. N.B.I., H.L., L.F., Z.Y., J.L., and C.Z. provided guidance and advice on the research design and data interpretation. All authors reviewed the manuscript.

Corresponding author

Correspondence to Anis Salwa Mohd Khairuddin.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Qin, Q., Khairuddin, A.S.M., Idros, N. et al. Hierarchical depth aware YOLO for efficient metal surface defect detection. Sci Rep (2026). https://doi.org/10.1038/s41598-026-46074-z

Download citation

  • Received: 22 January 2026

  • Accepted: 24 March 2026

  • Published: 04 April 2026

  • DOI: https://doi.org/10.1038/s41598-026-46074-z

Share this article

Anyone you share the following link with will be able to read this content:

Sorry, a shareable link is not currently available for this article.

Provided by the Springer Nature SharedIt content-sharing initiative

Download PDF

Advertisement

Explore content

  • Research articles
  • News & Comment
  • Collections
  • Subjects
  • Follow us on Facebook
  • Follow us on X
  • Sign up for alerts
  • RSS feed

About the journal

  • About Scientific Reports
  • Contact
  • Journal policies
  • Guide to referees
  • Calls for Papers
  • Editor's Choice
  • Journal highlights
  • Open Access Fees and Funding

Publish with us

  • For authors
  • Language editing services
  • Open access funding
  • Submit manuscript

Search

Advanced search

Quick links

  • Explore articles by subject
  • Find a job
  • Guide to authors
  • Editorial policies

Scientific Reports (Sci Rep)

ISSN 2045-2322 (online)

nature.com footer links

About Nature Portfolio

  • About us
  • Press releases
  • Press office
  • Contact us

Discover content

  • Journals A-Z
  • Articles by subject
  • protocols.io
  • Nature Index

Publishing policies

  • Nature portfolio policies
  • Open access

Author & Researcher services

  • Reprints & permissions
  • Research data
  • Language editing
  • Scientific editing
  • Nature Masterclasses
  • Research Solutions

Libraries & institutions

  • Librarian service & tools
  • Librarian portal
  • Open research
  • Recommend to library

Advertising & partnerships

  • Advertising
  • Partnerships & Services
  • Media kits
  • Branded content

Professional development

  • Nature Awards
  • Nature Careers
  • Nature Conferences

Regional websites

  • Nature Africa
  • Nature China
  • Nature India
  • Nature Japan
  • Nature Middle East
  • Privacy Policy
  • Use of cookies
  • Legal notice
  • Accessibility statement
  • Terms & Conditions
  • Your US state privacy rights
Springer Nature

© 2026 Springer Nature Limited

Nature Briefing AI and Robotics

Sign up for the Nature Briefing: AI and Robotics newsletter — what matters in AI and robotics research, free to your inbox weekly.

Get the most important science stories of the day, free in your inbox. Sign up for Nature Briefing: AI and Robotics