Fig. 3: TS dopamine release is consistent with encoding APE.
From: Dopaminergic action prediction errors serve as a value-free teaching signal

a, dLight recording in the VS. Each trace is the average of 200 trials. b, Example VS dopamine response size to contralateral cue binned every 40 trials. c, Average change in contralateral cue-aligned VS dopamine response. The solid orange represents the mean (n = 7 mice). The light orange trace is the mean predicted RPE response from 100 model agents. a.u., arbitrary units. d, Size of contralateral cue-aligned dopamine response in VS in the first and last session of training (n = 7 mice), P = 0.016 (paired two-sided t-test), Cohen’s d = −1.25. e–g, As a–c but for TS recordings (n = 6 mice). h, As d but for the TS (n = 9 mice), P = 0.006 (paired two-sided t-test). Cohen’s d = 1.19. i, Modelled responses for APE at the time of correct contralateral choice if the previous choice for that stimulus was ipsilateral or contralateral. j, As i but for an example average (mean) TS dopamine response. k, Regression coefficients. One-sided t-test against zero, corrected using the Bonferroni method for multiple comparisons. VS: n = 7 mice, P = 0.005, 1.0, 1.0, 1.0, 1.0 (left to right), (Cohen’s d = 2.23, 0.37, 0.23, 0.17, 0.13 (left to right)). TS: n = 6 mice, P = 0.04, 0.20, 0.20, 0.47, 0.63 (left to right), (Cohen’s d = −1.72, −1.13, −1.13, −0.84, −0.75 (left to right)). l,m, As i,j but for the VS response. n, Task design. WN, white noise. o, Modelled APE and RPE signals following the state change. p, Example TS dopamine responses to the contralateral choice in response to the normal or the white noise cue. q, TS dopamine response to the contralateral action before and after the introduction of the novel state (P = 0.01, paired two-sided t-test) (n = 6 mice), Cohen’s d = −1.81. r, As p but for a VS recording aligned to cue. s, As q but for VS recording aligned to cue. (P = 0.02, Wilcoxon signed-rank test) (n = 7 mice), Cohen’s d = 1.04. Error bars represent s.e.m.