Figure 8

Represents the layers for the feature population block that populate the channel number from 3 to C1 using the convolution layer with the size of the convolution weight kernel (K1) and a stride (S1). The default value for stride S1 is 2 × 2 pixels to move K1. The dimension unit is given in pixels.