Table 4 Comparison with other methods on the MSCOCO validation set.

From: Curvelet-enhanced transformer architecture for blurred action fine-grained detection

Methods

Backbone

Frame size

Parameters (M)

AP

AP50

AP75

TransPose

TransPose-H-A4

256 × 192

17.3

0.753

–

–

SimCC

ResNet-50

256 × 192

25.7

0.708

–

–

HRNet

HRNet-W32

256 × 192

28.5

0.734

0.895

0.807

PRTR

ResNet-50

384 × 288

41.5

0.682

0.882

0.752

EBA

ResNet-18

256 × 256

17.0

0.713

0.915

0.781

RIFormer

HRFormer-B

256 × 192

43.2

0.756

0.908

0.828

BCIR

ResNet-50

256 × 192

34.0

0.675

0.872

0.740

AECA

ResNet-18

384 × 288

19.0

0.745

0.925

0.814

MCTN

DETR

256 × 192

24.5

0.759

0.926

0.822

MCTN

RT-DETRv3

256 × 192

32.8

0.767

0.938

0.836

MCTN

DETR

384 × 288

34.6

0.761

0.922

0.828

MCTN

RT-DETRv3

384 × 288

40.4

0.766

0.941

0.833