Table 5 Initial RTG values are set in depletion experiment one on the MuJoCo task.

From: Offline reinforcement learning combining generalized advantage estimation and modality decomposition interaction

Dataset

Initial RTG

halfcheetah

6000

Hopper

3600

walker2d

5000