Skip to main content

Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

Advertisement

Scientific Reports
  • View all journals
  • Search
  • My Account Login
  • Content Explore content
  • About the journal
  • Publish with us
  • Sign up for alerts
  • RSS feed
  1. nature
  2. scientific reports
  3. articles
  4. article
Multiscale diffusion-enhanced attention network for steel surface defect detection in Polysilicon Production
Download PDF
Download PDF
  • Article
  • Open access
  • Published: 16 January 2026

Multiscale diffusion-enhanced attention network for steel surface defect detection in Polysilicon Production

  • Yiwei Duan1,
  • Lizhen He1,
  • Zhisheng Wang1,
  • Jinhai Sa1,
  • Jiawen Yang2,
  • Xiaolong Chen3,
  • Bingdong Shi4,
  • Yangyang Zhang4 &
  • …
  • Jiawen Sun5,6 

Scientific Reports , Article number:  (2026) Cite this article

  • 379 Accesses

  • Metrics details

We are providing an unedited version of this manuscript to give early access to its findings. Before final publication, the manuscript will undergo further editing. Please note there may be errors present which affect the content, and all legal disclaimers apply.

Subjects

  • Energy science and technology
  • Engineering
  • Mathematics and computing

Abstract

Surface defect detection on steel components is crucial for quality control in polysilicon production. However, this task remains challenging due to tiny defect sizes, irregular geometries, complex backgrounds, and low contrast. To address these issues, we propose MSEOD-DDFusionNet (Multi-Scale and Effective Object-Detection Diffusion Fusion Network), a novel multi-scale diffusion-enhanced attention network. The network integrates four specialized modules: MTECAAttention (Multi-Scale Texture Enhancement Channel-Aware Attention) for lossless multi-scale feature fusion, ODConv (Omni-Dimensional Dynamic Convolution) for dynamic adaptation to irregular geometries, LMDP (Local Multi-Scale Discriminative Perception) for selective noise suppression and micro-defect amplification, and DDFusion (Diffusion-Driven Feature Fusion) for scene-aware noise modeling. Pruning further reduces computational complexity while improving accuracy. Extensive experiments on the specialized DDTE dataset and public benchmarks demonstrate state-of-the-art performance. Our model achieves 82.6% \(\hbox {mAP}_{50}\) and 61.6% \(\hbox {mAP}_{50-95}\) on DDTE, while maintaining a high inference speed of 193.5 FPS with only 8.46M parameters. It also shows excellent generalization across NEU-DET, GC10-DET, and cross-domain tasks, providing an efficient and accurate solution for industrial defect inspection.

Similar content being viewed by others

Steel surface defect detection algorithm based on improved YOLOv10

Article Open access 25 September 2025

DSAT: a dynamic sparse attention transformer for steel surface defect detection with hierarchical feature fusion

Article Open access 09 August 2025

Surface defect detection of hot rolled steel based on multi-scale feature fusion and attention mechanism residual block

Article Open access 01 April 2024

Data availability

The specific industrial dataset is subject to privacy restrictions and is not publicly available. To ensure reproducibility and enable further application, we provide the complete implementation, including code and pre-trained models, at: https://github.com/jiunian158/DDTE_Anonymous_Subset/blob/main/DDTE_Anonymous_Subset%20(3).zip

References

  1. Wang, Z., Zhou, W. & Li, Y. MSAF-YOLO: An Efficient Multi-Scale Attention Fusion Network for high-precision steel surface defect detection. Measurement 118640, https://doi.org/10.1016/j.measurement.2025.118640 (2025).

  2. Liu, R. et al. A lightweight model based on multi-scale feature fusion for ultrasonic welding surface defect detection. Eng. Appl. Artif. Intell. 161, 112208. https://doi.org/10.1016/j.engappai.2025.112208 (2025).

    Google Scholar 

  3. Wang, S. et al. Research on steel surface defect detection system based on YOLOv5s-SE-CA model and BEMD image enhancement. Nondestruct. Test. Eval. 1–20, https://doi.org/10.1080/10589759.2024.2393205 (2024).

  4. Pan, Y. & Zhang, L. Dual attention deep learning network for automatic steel surface defect segmentation.. Comput.-Aided Civ. Inf. 37, 1468–1487. https://doi.org/10.1111/mice.12792 (2022).

    Google Scholar 

  5. Hosseini, S. M., Ebrahimi, A., Mosavi, M. R. & Shahhoseini, HSh. A novel hybrid CNN-CBAM-GRU method for intrusion detection in modern networks. Results Eng. 28, 107103. https://doi.org/10.1016/j.rineng.2025.107103 (2025).

    Google Scholar 

  6. Yeung, C.-C. & Lam, K.-M. Efficient fused-attention model for steel surface defect detection. IEEE Trans. Instrum. Meas. 71, 1–11. https://doi.org/10.1109/TIM.2022.3176239 (2022).

    Google Scholar 

  7. Ziadlou, G., Emami, S. & Asadi-Gangraj, E. Network configuration distributed production scheduling problem: A constraint programming approach. Comput. Ind. Eng. 188, 109916. https://doi.org/10.1016/j.cie.2024.109916 (2024).

    Google Scholar 

  8. Wen, J., Zheng, Y., Zhang, Y. & Yu, W. Enhanced dual-channel feature fusion approach for rolling bearing fault diagnosis. Nondestruct. Test. Eval. 40, 3309–3337. https://doi.org/10.1080/10589759.2025.2507761 (2025).

    Google Scholar 

  9. Ye, Z. & Yu, J. AKSNet: A novel convolutional neural network with adaptive kernel width and sparse regularization for machinery fault diagnosis. J. Manuf. Syst. 59, 467–480. https://doi.org/10.1016/j.jmsy.2021.03.022 (2021).

    Google Scholar 

  10. Xu, X., Li, X., Ming, W. & Chen, M. A novel multi-scale CNN and attention mechanism method with multi-sensor signal for remaining useful life prediction. Comput. Ind. Eng. 169, 108204. https://doi.org/10.1016/j.cie.2022.108204 (2022).

    Google Scholar 

  11. Zhou, H. et al. Ast-gnn: An attention-based spatio-temporal graph neural network for interaction-aware pedestrian trajectory prediction. Neurocomputing 445, 298–308. https://doi.org/10.1016/j.neucom.2021.03.024 (2021).

    Google Scholar 

  12. Rombach, R., Blattmann, A., Lorenz, D., Esser, P. & Ommer, B. High-resolution image synthesis with latent diffusion models. In Proc. IEEE/CVF Conf. Comput. Vis. Pattern Recognit. 10684–10695, https://doi.org/10.48550/arXiv.2112.10752 (2022).

  13. Zhai, C., Wang, L. & Yuan, J. New Fusion Network with Dual-Branch Encoder and Triple-Branch Decoder for Remote Sensing Image Change Detection. Appl. Sci. 13, 6167. https://doi.org/10.3390/app13106167 (2023).

    Google Scholar 

  14. Fang, T. et al. Human-Guided Data Augmentation via Diffusion Model for Surface Defect Recognition Under Limited Data. IEEE Trans. Instrum. Meas. 74, 1–16. https://doi.org/10.1109/TIM.2025.3541684 (2025).

    Google Scholar 

  15. Lin, Z., Li, Z., Yu, J., Hu, M. & Wang, X. FFDDNet: Flexible focused defect detection network. IEEE Trans. Instrum. Meas. 74, 5019812. https://doi.org/10.1109/TIM.2025.3551459 (2025).

    Google Scholar 

  16. Wang, D., Shang, K., Wu, H. & Wang, C. Decoupled R-CNN: Sensitivity-specific detector for higher accurate localization. IEEE Trans. Circuits Syst. Video Technol. 32, 6324–6336. https://doi.org/10.1109/TCSVT.2022.3167114 (2022).

    Google Scholar 

  17. Fang, R. et al. FeatAug-DETR: Enriching one-to-many matching for DETRs with feature augmentation. IEEE Trans. Pattern Anal. Mach. Intell. 46, 6402–6415. https://doi.org/10.1109/TPAMI.2024.3381961 (2024).

    Google Scholar 

  18. Yang, K. et al. DETA: A point-based tracker with deformable transformer and task-aligned learning. IEEE Trans. Multimed. 25, 7545–7558. https://doi.org/10.1109/TMM.2022.3223213 (2023).

    Google Scholar 

  19. Zhang, M. et al. Oriented-DINO: Angle decoupling prediction and consistency optimizing for oriented detection transformer. IEEE Trans. Geosci. Remote Sens. 62, 5638315. https://doi.org/10.1109/TGRS.2024.3450200 (2024).

    Google Scholar 

  20. Li, Q., Shao, Y., Li, L., Li, J. & Hao, H. Weak surface defect detection for production-line plastic bottles with multi-view imaging system and LFF YOLO. Opt. Laser Eng. 181, 108369. https://doi.org/10.1016/j.optlaseng.2024.108369 (2024).

    Google Scholar 

  21. Zhu, C., Sun, Y., Zhang, H., Yuan, S. & Zhang, H. LE-YOLOv5: A lightweight and efficient neural network for steel surface defect detection. IEEE Access 12, 195242–195255. https://doi.org/10.1109/ACCESS.2024.3519161 (2024).

    Google Scholar 

  22. Liu, M., Chen, Y., Xie, J., He, L. & Zhang, Y. LF-YOLO: A lighter and faster yolo for weld defect detection of X-ray image. IEEE Sens. J. 23, 7430–7439. https://doi.org/10.1109/JSEN.2023.3247006 (2023).

    Google Scholar 

  23. Sun, J., Shen, X., Huang, H., Wang, Q. & Zhao, H. RT-SLAM: A Real-Time Visual SLAM System Integrating Enhanced RT-DETR and Optical Flow Techniques. IEEE Internet Things J. 12, 12803–12814. https://doi.org/10.1109/JIOT.2024.3522490 (2024).

    Google Scholar 

Download references

Funding

This research is funded by the Autonomous Region Science and Technology Plan Project “Research and Application of Artificial Intelligence Technology for Silicon-Based New Material Manufacturing” (Project No. 2023B01033).

Author information

Authors and Affiliations

  1. School of Software Engineering, Xinjiang University, Urumqi, 830046, China

    Yiwei Duan, Lizhen He, Zhisheng Wang & Jinhai Sa

  2. School of Public Administration, Public Administration Major, Xi’an University of Architecture and Technology, Xi’an, 710055, China

    Jiawen Yang

  3. Technical Department, Xinjiang Binghua Technology Co., Ltd., Urumqi, 830000, China

    Xiaolong Chen

  4. Smart Manufacturing Division, Xinte Energy Co. Ltd., Urumqi, 830011, China

    Bingdong Shi & Yangyang Zhang

  5. School of Computer Science and Technology, Xinjiang University, Urumqi, 830046, China

    Jiawen Sun

  6. State Grid Xinjiang Electric Power Company Economic Research Institute, Urumqi, 830002, China

    Jiawen Sun

Authors
  1. Yiwei Duan
    View author publications

    Search author on:PubMed Google Scholar

  2. Lizhen He
    View author publications

    Search author on:PubMed Google Scholar

  3. Zhisheng Wang
    View author publications

    Search author on:PubMed Google Scholar

  4. Jinhai Sa
    View author publications

    Search author on:PubMed Google Scholar

  5. Jiawen Yang
    View author publications

    Search author on:PubMed Google Scholar

  6. Xiaolong Chen
    View author publications

    Search author on:PubMed Google Scholar

  7. Bingdong Shi
    View author publications

    Search author on:PubMed Google Scholar

  8. Yangyang Zhang
    View author publications

    Search author on:PubMed Google Scholar

  9. Jiawen Sun
    View author publications

    Search author on:PubMed Google Scholar

Contributions

Yiwei Duan contributed to the study conception and design. Material preparation, data collection, and analysis were performed by Others. The first draft of the manuscript was written by Yiwei Duan; all authors commented on previous versions of the manuscript. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Jinhai Sa.

Ethics declarations

Competing interests

Each named author has substantially contributed to conducting the underlying research and drafting this manuscript. Additionally, to the best of our knowledge, the named authors have no conflict of interest, financial or otherwise.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Duan, Y., He, L., Wang, Z. et al. Multiscale diffusion-enhanced attention network for steel surface defect detection in Polysilicon Production. Sci Rep (2026). https://doi.org/10.1038/s41598-026-35913-8

Download citation

  • Received: 30 July 2025

  • Accepted: 08 January 2026

  • Published: 16 January 2026

  • DOI: https://doi.org/10.1038/s41598-026-35913-8

Share this article

Anyone you share the following link with will be able to read this content:

Sorry, a shareable link is not currently available for this article.

Provided by the Springer Nature SharedIt content-sharing initiative

Keywords

  • Surface defect detection
  • Polysilicon production
  • Industrial defect inspection
Download PDF

Advertisement

Explore content

  • Research articles
  • News & Comment
  • Collections
  • Subjects
  • Follow us on Facebook
  • Follow us on Twitter
  • Sign up for alerts
  • RSS feed

About the journal

  • About Scientific Reports
  • Contact
  • Journal policies
  • Guide to referees
  • Calls for Papers
  • Editor's Choice
  • Journal highlights
  • Open Access Fees and Funding

Publish with us

  • For authors
  • Language editing services
  • Open access funding
  • Submit manuscript

Search

Advanced search

Quick links

  • Explore articles by subject
  • Find a job
  • Guide to authors
  • Editorial policies

Scientific Reports (Sci Rep)

ISSN 2045-2322 (online)

nature.com sitemap

About Nature Portfolio

  • About us
  • Press releases
  • Press office
  • Contact us

Discover content

  • Journals A-Z
  • Articles by subject
  • protocols.io
  • Nature Index

Publishing policies

  • Nature portfolio policies
  • Open access

Author & Researcher services

  • Reprints & permissions
  • Research data
  • Language editing
  • Scientific editing
  • Nature Masterclasses
  • Research Solutions

Libraries & institutions

  • Librarian service & tools
  • Librarian portal
  • Open research
  • Recommend to library

Advertising & partnerships

  • Advertising
  • Partnerships & Services
  • Media kits
  • Branded content

Professional development

  • Nature Awards
  • Nature Careers
  • Nature Conferences

Regional websites

  • Nature Africa
  • Nature China
  • Nature India
  • Nature Japan
  • Nature Middle East
  • Privacy Policy
  • Use of cookies
  • Legal notice
  • Accessibility statement
  • Terms & Conditions
  • Your US state privacy rights
Springer Nature

© 2026 Springer Nature Limited

Nature Briefing AI and Robotics

Sign up for the Nature Briefing: AI and Robotics newsletter — what matters in AI and robotics research, free to your inbox weekly.

Get the most important science stories of the day, free in your inbox. Sign up for Nature Briefing: AI and Robotics