Learning where to look for a hidden target

Leanne Chukoskie; Joseph Snider; Michael C Mozer; Richard J Krauzlis; Terrence J Sejnowski

doi:10.1073/pnas.1301216110

Learning where to look for a hidden target

Proc Natl Acad Sci U S A. 2013 Jun 18;110 Suppl 2(Suppl 2):10438-45. doi: 10.1073/pnas.1301216110. Epub 2013 Jun 10.

Authors

Leanne Chukoskie¹, Joseph Snider, Michael C Mozer, Richard J Krauzlis, Terrence J Sejnowski

Affiliation

¹ Institute for Neural Computation, University of California at San Diego, La Jolla, CA 92093, USA.

Abstract

Survival depends on successfully foraging for food, for which evolution has selected diverse behaviors in different species. Humans forage not only for food, but also for information. We decide where to look over 170,000 times per day, approximately three times per wakeful second. The frequency of these saccadic eye movements belies the complexity underlying each individual choice. Experience factors into the choice of where to look and can be invoked to rapidly redirect gaze in a context and task-appropriate manner. However, remarkably little is known about how individuals learn to direct their gaze given the current context and task. We designed a task in which participants search a novel scene for a target whose location was drawn stochastically on each trial from a fixed prior distribution. The target was invisible on a blank screen, and the participants were rewarded when they fixated the hidden target location. In just a few trials, participants rapidly found the hidden targets by looking near previously rewarded locations and avoiding previously unrewarded locations. Learning trajectories were well characterized by a simple reinforcement-learning (RL) model that maintained and continually updated a reward map of locations. The RL model made further predictions concerning sensitivity to recent experience that were confirmed by the data. The asymptotic performance of both the participants and the RL model approached optimal performance characterized by an ideal-observer theory. These two complementary levels of explanation show how experience in a novel environment drives visual search in humans and may extend to other forms of search such as animal foraging.

Keywords: ideal observer; oculomotor; reinforcement learning; saccades.

Publication types

Research Support, Non-U.S. Gov't
Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

Animals
Female
Humans
Male
Models, Biological*
Problem Solving / physiology*
Problem-Based Learning*
Visual Perception / physiology*

Grants and funding

Howard Hughes Medical Institute/United States