Figure 10
From: Safe reinforcement learning under temporal logic with reward design and quantum action selection

The generate optimal policy.
From: Safe reinforcement learning under temporal logic with reward design and quantum action selection

The generate optimal policy.