Fig. 2
From: A multi-scale small object detection algorithm SMA-YOLO for UAV remote sensing images

Structure of NSSA, where \(S'\) represents \(S \times S\), and \(H'\) and \(W'\) represent H/S and W/S respectively. The figure illustrates the case when the sparse coefficient S is 2, dividing the input features into 4 non-overlapping tensor blocks of different colors. When the sparse coefficient S is 1, it is equivalent to a single block, performing global attention.