Fig. 3
From: Frame points attention convolution for deep learning on point cloud

Regular form and equivalent form of two-dimensional convolution on regular data. (a) is the conventional form of two-dimensional convolution. It defines multiple filters with the same shape as the local area to extract features from the image. (b) is the equivalent form, which defines the weights according to the input and output channels of each pixel. The filters in (a) are equivalent to the weights in (a). The two forms can be converted to each other.