Fig. 4
From: Enhancing object pose estimation for RGB images in cluttered scenes

rotation network consists of two modules Multi-head Self-attention(MHSA) and the Iterative Refinement modules.
From: Enhancing object pose estimation for RGB images in cluttered scenes

rotation network consists of two modules Multi-head Self-attention(MHSA) and the Iterative Refinement modules.