Extended Data Fig. 4: Modulation of path replay by experience and expectations.
From: Differential replay of reward and punishment paths predicts approach and avoidance

(a) Replay strength (y axis) during planning as predicted by a model containing path type (reward or loss), path experience (x axis), and path transition probability (darker lines indicate higher transition probability). Evidence of rewarding path replay increased when rewarding paths had not been experienced for longer, whereas the opposite was true for punishing paths. This was most prominent when rewarding paths were more likely to be transitioned to. (b) Replay strength (y axis) during planning was not significantly predicted by a model containing path probability (x axis) and choice (approach or avoid).