Scientific Reports

Table 5 Initial RTG values are set in depletion experiment one on the MuJoCo task.

From: Offline reinforcement learning combining generalized advantage estimation and modality decomposition interaction

Dataset	Initial RTG
halfcheetah	6000
Hopper	3600
walker2d	5000

Back to article page

Search

Advanced search

Quick links