Fig. 1
From: Decentralized queue control with delay shifting in edge-IoT using reinforcement learning

Training dynamics of the delay shift agent: average reward and 95% CI over 200 episodes.
From: Decentralized queue control with delay shifting in edge-IoT using reinforcement learning

Training dynamics of the delay shift agent: average reward and 95% CI over 200 episodes.