Supplementary Figure 8: Learning phase in the probabilistic task: experimental data and model comparison
From: Nicotinic receptors in the ventral tegmental area promote uncertainty-seeking

(a,b) Evolution of the proportion of choices of the three rewarded locations in the uncertain setting, across the learning sessions, for WT (a) and β2KO (b) mice. (c,d) Difference in Bayesian information criterion (compared to the standard RL model) of models including an expected uncertainty bonus (“uncertainty”), an adaptive learning rate (“adaptive LR”) and an unexpected uncertainty bonus, for WT (c) and β2KO (d) mice. (e,f) Model fits of the experimental data shown in (a,b) for the winning models, i.e. expected uncertainty for WT mice, and standard model for β2KO mice.