When searching for rewards in complex, unfamiliar environments, it is often impossible to explore all options. Wu et al. show how a combination of generalization and optimistic sampling guides efficient human exploration in complex environments.
- Charley M. Wu
- Eric Schulz
- Björn Meder