Fig. 3
From: Real time weed identification with enhanced mobilevit model for mobile devices

MobileViT Efficient Channel Attention (ECA) block structure. k is size of local interaction in one-dimensional convolution, H is the height of the input data, W is the width of the input data, C is the number of channels.