Muller et al. show that some neurons in the cortex learn faster from better-than-expected outcomes compared to worse-than-expected ones; others do the converse, resulting in simultaneous optimism and pessimism, as predicted by distributional reinforcement learning.
- Timothy H. Muller
- James L. Butler
- Steven W. Kennerley