Nature

Extended Data Table 4 Reward components

From: Magnetic control of tokamak plasmas through deep reinforcement learning

Description of reward components. All of these return an actual and a target value, and many allow time-varying target values. See Extended Data Table 3 for where and how they are used.

Back to article page

Search

Advanced search

Quick links