Learning to minimize efforts versus maximizing rewards: computational principles and neural correlates

Vasilisa Skvortsova; Stefano Palminteri; Mathias Pessiglione

doi:10.1523/JNEUROSCI.1350-14.2014

Learning to minimize efforts versus maximizing rewards: computational principles and neural correlates

J Neurosci. 2014 Nov 19;34(47):15621-30. doi: 10.1523/JNEUROSCI.1350-14.2014.

Authors

Vasilisa Skvortsova¹, Stefano Palminteri², Mathias Pessiglione³

Affiliations

¹ Motivation, Brain and Behavior Laboratory, Neuroimaging Research Center, Brain and Spine Institute, INSERM U975, CNRS UMR 7225, UPMC-P6 UMR S 1127, 7561 Paris Cedex 13, France.
² Motivation, Brain and Behavior Laboratory, Neuroimaging Research Center, Brain and Spine Institute, INSERM U975, CNRS UMR 7225, UPMC-P6 UMR S 1127, 7561 Paris Cedex 13, France, Laboratoire de Neurosciences Cognitives, INSERM U960, and Département d'Etudes Cognitives, Ecole Normale Supérieure, 7505, Paris, France.
³ Motivation, Brain and Behavior Laboratory, Neuroimaging Research Center, Brain and Spine Institute, INSERM U975, CNRS UMR 7225, UPMC-P6 UMR S 1127, 7561 Paris Cedex 13, France, [email protected].

Abstract

The mechanisms of reward maximization have been extensively studied at both the computational and neural levels. By contrast, little is known about how the brain learns to choose the options that minimize action cost. In principle, the brain could have evolved a general mechanism that applies the same learning rule to the different dimensions of choice options. To test this hypothesis, we scanned healthy human volunteers while they performed a probabilistic instrumental learning task that varied in both the physical effort and the monetary outcome associated with choice options. Behavioral data showed that the same computational rule, using prediction errors to update expectations, could account for both reward maximization and effort minimization. However, these learning-related variables were encoded in partially dissociable brain areas. In line with previous findings, the ventromedial prefrontal cortex was found to positively represent expected and actual rewards, regardless of effort. A separate network, encompassing the anterior insula, the dorsal anterior cingulate, and the posterior parietal cortex, correlated positively with expected and actual efforts. These findings suggest that the same computational rule is applied by distinct brain systems, depending on the choice dimension-cost or benefit-that has to be learned.

Keywords: computational modeling; effort; reinforcement learning; reward; ventromedial prefrontal cortex.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Adolescent
Adult
Choice Behavior
Computer Simulation
Cues
Female
Humans
Learning / physiology*
Magnetic Resonance Imaging
Male
Models, Neurological*
Physical Exertion / physiology*
Reward*
Young Adult