Table 5 Comparison of Polysemantic Transformer (PoT) experiments at various stages with different k \(\times\) k grids.

From: End to end polysemantic cooperative mixed task trainer for UAV target detection

Polysemantic Transformer (PoT) k \(\times\) k grids

#Params

FLOPs

Acc.

ResNet-2

ResNet-3

ResNet-4

ResNet-5

3\(\times\)3 (*3)

3\(\times\)3 (*4)

3\(\times\)3 (*5)

3\(\times\)3 (*3)

21.3M

3.5G

33.63

3\(\times\)3 (*3)

5\(\times\)5 (*4)

3\(\times\)3 (*5)

3\(\times\)3 (*3)

22.1M

3.5G

33.15

3\(\times\)3 (*3)

3\(\times\)3 (*4)

5\(\times\)5 (*5)

3\(\times\)3 (*3)

23.2M

3.8G

33.01

3\(\times\)3 (*3)

3\(\times\)3 (*4)

5\(\times\)5 (*5)

5\(\times\)5 (*3)

24.1M

3.9G

32.18

5\(\times\)5 (*3)

5\(\times\)5 (*4)

5\(\times\)5 (*5)

5\(\times\)5 (*3)

25.4M

4.1G

31.79

  1. (Experimented on VisDrone 2019 dataset).