Fig. 4
From: RandMScan: accelerating parallel scan via matrix computation and random-jump strategy

Comparison of memory movement for different scan implementations on the SOFA accelerator for an input size of \(2^{20}\) elements. Both total memory transferred and per-element memory movement are shown.