Table 1 Generalization performance evaluation in Test Scenario-4

Method	No distractor		1 distractor		3 distractors		6 distractors		10 distractors
	SR(%)	RPL	SR(%)	RPL	SR(%)	RPL	SR(%)	RPL	SR(%)	RPL
NBV	35.6	25.5	18.8	51.8	10.9	102.4	5.3	119.2	-	-
EI (STF)	54.8	15.8	43.7	30.1	35.6	49.8	26.4	101.4	17.1	125.8
EI (SI)	69.5	15.7	62.0	19.7	56.0	22.6	50.0	26.3	41.7	30.1
EI (EA)	77.9	14.3	70.6	15.4	64.8	16.0	57.4	16.7	49.6	17.4
EI (EA+SI)	81.4	12.1	75.8	12.8	71.7	13.2	66.5	13.7	60.4	14.3
CEL (Rules)	91.1	8.1	89.5	8.6	85.6	8.9	81.2	9.5	76.9	10.1

The values in bold indicate the best-performing results in every setting. SR is the success rate, and RPL is the relative path length. EL(EA + SI), EL(EA), EL(SI), and EL(STF) represent four common embodied learning (EL) paradigms. EA+SI is a policy learning method that uses the semantic feature cognition (SFC) and the sequential cognitive memory (SCM) to process observations. EA is a policy learning method that uses the SFC to process observations. SI is a transformer-based method that uses the SFC and the SCM to generate semantic tokens. STF is an adaptive embodied learning method based on the student-teacher framework. NBV is a region-based viewpoint optimization computational method.

Quick links

Search