Table 1 Statistics of individual task success rate with easy-to-hard task ordering

In LRL, we assess the performance of all tasks (row-wise) once the agent completes training on each one-time feeding task (column-wise). In multi-task reinforcement learning, the agent is evaluated after simultaneous training on all tasks (row-wise). Each datum is based on at least five trials, with average values reported for evaluation. The metrics ‘forgetting’ and ‘forward transfer’ are used to assess the specific characteristics of the LRL agent. ‘Forgetting’, in the range [−1, 1] (equation (2)), measures the extent of knowledge retention, with lower values indicating better performance. ‘Forward transfer’, in the range [0, 1] (equation (3)), evaluates how well earlier task knowledge supports subsequent tasks, where higher values denote better performance. NA, not available.

Quick links

Search