Fig. 7: Graphic depicting policy rollout.

Here, the process of a control policy π generating trajectories of action at, state xt, and observation ot is visualized.

Here, the process of a control policy π generating trajectories of action at, state xt, and observation ot is visualized.