Fig. 4
From: End to end polysemantic cooperative mixed task trainer for UAV target detection

(a) Ordinary self-attention module and (b) our Polysemantic Transformer (PoT) structure. \(+\) and \(\odot\) denote element-by-element summation and local matrix multiplication, respectively.