Extended Data Fig. 10: Transient VP neurons show evidence of reward prediction error encoding during learning, in contrast to persistent VP and GPe neurons. | Nature Neuroscience

Extended Data Fig. 10: Transient VP neurons show evidence of reward prediction error encoding during learning, in contrast to persistent VP and GPe neurons.

From: Dissociable roles of ventral pallidum neurons in the basal ganglia reinforcement learning network

Extended Data Fig. 10

a, Transient VP average population responses in the OTR-D and OTR-P blocks. In the OTR-P block, only trials in which the outcome was given are presented. Abscissa, time (-2 – 4 s). Vertical dashed lines at t = 0 and t = 2 s indicate the times of cue onset and outcome delivery, respectively; ordinate, firing rate in Hz, normalized by the mean discharge rate during the last 3 s of the ITI period. N = 29 neurons. Shaded regions represent SEMs. b, Transient VP, persistent VP and GPe average population responses in the first and last two trials of the NOV-D block. Top: reward trials; middle: neutral trials; bottom: aversive trials. N = 27, 12 and 94 neurons for the transient VP, persistent VP and GPe, respectively. Same conventions as in (a). c, Transient VP, persistent VP and GPe average population responses in the first and last trials of the NOV-P block, for trials in which the reward outcome was given (upper row) and in which it was omitted (lower row). N = 26, 12 and 93 neurons for the transient VP, persistent VP and GPe, respectively. Same conventions as in (a).

Back to article page