Extended Data Table 4 Reward components

From: Magnetic control of tokamak plasmas through deep reinforcement learning

  1. Description of reward components. All of these return an actual and a target value, and many allow time-varying target values. See Extended Data Table 3 for where and how they are used.