Fig. 2 | Scientific Reports

Fig. 2

From: Origin centric and part based pose decomposition for 3D human pose estimation

Fig. 2

Overview of the network architecture. The proposed network is designed with L stacked loops. Within each loop, the Origin-centric Part Transformer Block and the Spatial Transformer Encoder are arranged in a parallel structure to facilitate feature fusion. Subsequently, these components are integrated with the Temporal Transformer Encoder in a serial structure to enable feature transmission.

Back to article page