Algorithm 2
From: Reinforcement learning-based optimal control for stochastic opinion dynamics

Model-Free LSPI for Unknown Stochastic Dynamics.
From: Reinforcement learning-based optimal control for stochastic opinion dynamics

Model-Free LSPI for Unknown Stochastic Dynamics.