Fig. 3
From: A Mamba based vision transformer for fine grained image segmentation of mural figures

The fundamental architecture of the P-SCconv decoder. Multiscale feature fusion and upsampling decoding are achieved through four distinct SCconv structures.