The authors use a series of self-finding games—wherein players must identify themselves when there are multiple potential candidates—to show that humans are near optimal at self-orienting, whereas popular reinforcement learning algorithms are not.
- Julian De Freitas
- Ahmet Kaan Uğuralp
- Tomer D. Ullman