Visual search requires recognizing an object “invariantly”, despite changes in its appearance. Here, the authors show that humans can efficiently and invariantly search for objects in complex scenes and introduce a biologically-inspired zero-shot model that captures human eye movements during search.
- Mengmi Zhang
- Jiashi Feng
- Gabriel Kreiman