Scientific Reports

Table 7 The results of Ablation Experiment showcase the average normalized scores achieved by the halfcheetah, hopper and walker2d tasks, analyzed across three distinct levels of datasets.

From: Offline reinforcement learning combining generalized advantage estimation and modality decomposition interaction

Datatset	Method
Datatset	(a)	(b)	(c)	(d)	(e)	(f)	CGM
halfcheetah	48.2	61.5	53.1	48.6	5.94	64.1	64.6
hopper	70.3	88	63.9	77.3	4.07	89.53	91.9
walker2d	67.5	84	80.2	70.6	2.41	91.12	89.2
Time (h)	1.94	2.48	2.73	2.53	1.48	3.65	3.20

Significance values are in bold.

Back to article page

Search

Advanced search

Quick links