Fig. 5: The main architecture of GH-UNet.
From: GH-UNet: group-wise hybrid convolution-VIT for robust medical image segmentation

GH-UNet architecture comprises: a the overall structure, and key components including b a hybrid vonvolution-VIT encoder, c an MSGA block, d a CSG block, and e a GDG block. concatenation denotes channel-wise concatenation, addition indicates element-wise addition, and multiplication refers to standard matrix multiplication.