Many population coding models of reinforcement learning assign a single global reward signal to the entire population. As the population size increases, however, this reward signal is less and less related to the performance of a single neuron, slowing down learning. This computational modeling study shows that an additional population response term modifying synaptic plasticity speeds up learning.
- Robert Urbanczik
- Walter Senn