Table 4 Comparisons on the COCO test-dev set.

From: An efficient and accurate 2D human pose estimation method using VTTransPose network

Method

Input size

#Params (M)

FLOPs (G)

AP

AP0.5

AP0.75

APM

APL

G-RMI28

353 × 257

42.6

57

64.9

85.5

71.3

62.3

70.0

Integral29

256 × 256

45.0

11.0

67.8

88.2

74.8

63.9

74.0

CPN7

384 × 288

58.8

29.2

72.1

91.4

80.0

68.7

77.2

RMPE30

320 × 256

28.1

26.7

72.3

89.2

79.1

68.0

78.6

SimpleBaseline27

384 × 288

68.6

35.6

73.7

91.9

81.1

70.3

80.0

HRNet-W328

384 × 288

28.5

16.0

74.9

92.5

82.8

71.3

80.9

HRNet-W488

256 × 192

63.6

14.6

74.2

92.4

82.4

70.9

79.7

HRNet-W488

384 × 288

63.6

32.9

75.5

92.5

83.3

71.9

81.5

DarkPose31

384 × 288

63.6

32.9

76.2

92.5

83.6

72.5

82.4

TokenPose16

256 × 192

13.5

5.7

74.0

91.9

81.5

70.6

79.8

DistilPose-S18

256 × 192

5.4

2.38

71.0

91.0

78.9

67.5

76.8

DistilPose-L18

256 × 192

21.3

10.33

73.7

91.6

81.1

70.2

79.6

GTPose-B19

256 × 192

13.5

74.5

92.2

82.2

70.7

79.8

TransPose-H-A417

256 × 192

17.3

17.5

74.7

91.9

82.2

71.4

80.7

TransPose-H-A617

256 × 192

17.5

21.8

75.0

92.2

82.3

71.3

81.1

TransPose-H–S17

256 × 192

8.0

10.2

73.4

91.6

81.1

70.1

79.3

VTTranspose (ours)

256 × 192

6.0

5.4

73.6

91.4

81.1

70.1

79.6