Table 5 Ablation study on the two improvement modules.

From: An efficient and accurate 2D human pose estimation method using VTTransPose network

Model

Backbone

Params (Mb)

Memory (batch size = 4) (Mb)

AP (coco val gt bbox)

TransPose-H–S

HRNet-S-W32

8

3503

76.1

TransPose-H–S + twin attention

HRNet-S-W32

8

1953

76.3

VTTransPose

HRNet-S-W32 + V block

6

2007

76.5