Abstract
Remote sensing imagery presents unique challenges for object detection due to wide fields of view, complex backgrounds, and the dense distribution of small targets, often rendering traditional methods ineffective. To address these limitations, we introduce GSS-YOLO, a lightweight network tailored for remote sensing environments. Our architecture integrates a Spatial Information Aggregation (SIA) module within a Cross-Stage Partial Network (C3) to optimize both detection accuracy and processing efficiency. Furthermore, we incorporate Spatial Pyramid Dilated Convolution (SPD-Conv) to enhance adaptability to low-resolution inputs, and embed a Global Context-Aware Module (GCAM) prior to the detection head to refine multi-scale feature representation. Evaluations on the USOD, VisDrone2019 and DIOR datasets demonstrate that GSS-YOLO achieves superior precision, recall, and robustness across both color and grayscale imagery, all while maintaining a lightweight architecture. Validated by ablation studies, this approach provides an efficient and robust solution for small target detection in complex remote sensing scenarios.
Similar content being viewed by others
Data availability
The data used in this study are available upon request from the corresponding author via email.
References
Zhang, M. et al. IRSAM: Advancing segment anything model for infrared small target detection p. 233–249 (Springer, 2024).
Yang, R. et al. KPE-YOLOv5: An improved small target detection algorithm based on YOLOv5. Electronics 12 (4), 817 (2023).
Wang, H., Qian, H. & Feng, S. GAN-STD: small target detection based on generative adversarial network. J. Real-Time Image Proc. 21 (3), 65 (2024).
Tong, X. et al. MSAFFNet: A multiscale label-supervised attention feature fusion network for infrared small target detection. IEEE Trans. Geosci. Remote Sens. 61, 1–16 (2023).
Dai, Y. et al. One-stage cascade refinement networks for infrared small target detection. IEEE Trans. Geosci. Remote Sens. 61, 1–17 (2023).
Dai, Y. et al. Attentional local contrast networks for infrared small target detection. IEEE Trans. Geosci. Remote Sens. 59 (11), 9813–9824 (2021).
Jing, R. et al. Sunflower-YOLO: Detection of sunflower capitula in UAV remote sensing images. Eur. J. Agron. 160, 127332 (2024).
Wu, Z. et al. Cbgs-yolo: A lightweight network for detecting small targets in remote sensing images based on a double attention mechanism. Remote Sens. 17 (1), 109 (2024).
Hui, Y., Wang, J. & Li, B. UAV remote sensing small target recognition algorithm for YOLOV7 based on dense residual super-resolution and anchor frame adaptive regression strategy. J. King Saud University-Computer Inform. Sci. 36 (1), 101863 (2024).
Zhang, W. et al. LS-YOLO: A novel model for detecting multiscale landslides with remote sensing images. IEEE J. Sel. Top. Appl. Earth Observations Remote Sens. 17, 4952–4965 (2024).
Hu, J. et al. CM-YOLO: Typical object detection method in remote sensing cloud and mist scene images. Remote Sens. 17 (1), 125 (2025).
Hui, Y. et al. SEB-YOLO: An improved YOLOv5 model for remote sensing small target detection. Sensors 24 (7), 2193 (2024).
Bian, D. et al. A refined methodology for small object detection: Multi-scale feature extraction and cross-stage feature fusion network. Digit. Signal Proc., : p. 105297. (2025).
Xia, Y. et al. Behavior detection Algorithm of Caged White-feather broiler based on multi-scale detail feature fusion and object relation inference. IEEE. pp. 1002–1006. (2023).
Zhang, D. et al. Unsupervised Pre-training with Language-Vision Prompts for Low-Data Instance Segmentation (IEEE Transactions on Pattern Analysis and Machine Intelligence, 2025).
Li, H. et al. VDG: vision-only dynamic gaussian for driving simulation (IEEE Robotics and Automation Letters, 2025).
Tang, B. et al. R2PLoc: A region-to-point UAV visual geo-localization framework leveraging hierarchical semantic representation (IEEE Transactions on Geoscience and Remote Sensing, 2025).
Liang, L. et al. DBMLLA: Double-branch Mamba-like linear attention network for hyperspectral image classification (IEEE Transactions on Geoscience and Remote Sensing, 2025).
Liang, L. et al. LKMA: Learnable Kernel and Mamba with Spatial-Spectral Attention Fusion for Hyperspectral Image Classification (IEEE Transactions on Geoscience and Remote Sensing, 2025).
Zhang, Z. et al. Multireceptive field: An adaptive path aggregation graph neural framework for hyperspectral image classification. Expert Syst. Appl. 217, 119508 (2023).
Yao, D. et al. Deep hybrid: multi-graph neural network collaboration for hyperspectral image classification. Def. Technol. 23, 164–176 (2023).
Zhang, T. et al. CCSFuse: Collaborative Compensation and Selective Fusion for UAV-Based RGB-IR Object Detection. IEEE Trans. Geosci. Remote Sens. 64, 1–14 (2025).
Zhuo, Z. et al. TAF-YOLO: A Small-Object Detection Network for UAV Aerial Imagery via Visible and Infrared Adaptive Fusion. Remote Sens. 17 (24), 3936 (2025).
Weng, T. & Niu, X. LMDENet: A Lightweight RGB-IR Object Detection Network for Low-Light Remote Sensing Images. Sensors 26 (4), 1130 (2026).
He, M. et al. Misaligned RGB-infrared object detection via adaptive dual-discrepancy calibration. Remote Sens. 15 (19), 4887 (2023).
Zhang, Y. et al. FFCA-YOLO for Small Object Detection in Remote Sensing Images. IEEE Trans. Geosci. Remote Sens. 62 ({}), 1–15 (2024).
Du, D. et al. VisDrone-DET2019: The vision meets drone object detection in image challenge results. pp. 0–0. (2019).
Li, K. et al. Object detection in optical remote sensing images: A survey and a new benchmark. ISPRS J. photogrammetry remote Sens. 159, 296–307 (2020).
Chen, J. et al. LKPF-YOLO: A Small Target Ship Detection Method for Marine Wide-Area Remote Sensing Images (IEEE Transactions on Aerospace and Electronic Systems, 2024).
Li, Y. & Zhang, Y. Robust infrared small target detection using local steering kernel reconstruction. Pattern Recogn. 77, 113–125 (2018).
Wang, J. et al. Remote sensing small object detection based on multi-contextual information aggregation. IEEE J. Sel. Top. Appl. Earth Observations Remote Sens., (2025).
Wang, C., Yeh, I. & Mark Liao, H. Yolov9: Learning what you want to learn using programmable gradient information p. 1–21 (Springer, 2024).
Wang, A. et al. Yolov10: Real-time end-to-end object detection. Adv. Neural. Inf. Process. Syst. 37, 107984–108011 (2024).
Khanam, R. & Hussain, M. Yolov11: An overview of the key architectural enhancements. arXiv preprint arXiv:2410.17725, (2024).
Funding
This work was supported by Jilin Province Youth Growth Science and Technology Program Project (No. 20220508041RC).
Author information
Authors and Affiliations
Contributions
Conceptualization, methodology, software, validation, Z.W.; formal analysis, investigation, resources, N.L.; writing—original draft preparation, Z.W.; writing—review and editing, D.W.; supervision, Z.W.; project administration, Z.T.; funding acquisition, D.W. All authors have read and agreed to the published version of the manuscript.
Corresponding author
Ethics declarations
Conflict of interest
The authors declare no conflict of interest.
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.
About this article
Cite this article
Wu, Z., Li, N., Tian, Z. et al. A lightweight feature fusion network for weak and small target detection in remote sensing. Sci Rep (2026). https://doi.org/10.1038/s41598-026-43560-2
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-026-43560-2


