Relative salience signaling within a thalamo-orbitofrontal circuit governs learning rate

Curr Biol. 2021 Dec 6;31(23):5176-5191.e5. doi: 10.1016/j.cub.2021.09.037. Epub 2021 Oct 11.

Abstract

Learning to predict rewards is essential for the sustained fitness of animals. Contemporary views suggest that such learning is driven by a reward prediction error (RPE)-the difference between received and predicted rewards. The magnitude of learning induced by an RPE is proportional to the product of the RPE and a learning rate. Here we demonstrate using two-photon calcium imaging and optogenetics in mice that certain functionally distinct subpopulations of ventral/medial orbitofrontal cortex (vmOFC) neurons signal learning rate control. Consistent with learning rate control, trial-by-trial fluctuations in vmOFC activity positively correlate with behavioral updating when the RPE is positive, and negatively correlates with behavioral updating when the RPE is negative. Learning rate is affected by many variables including the salience of a reward. We found that the average reward response of these neurons signals the relative salience of a reward, because it decreases after reward prediction learning or the introduction of another highly salient aversive stimulus. The relative salience signaling in vmOFC is sculpted by medial thalamic inputs. These results support emerging theoretical views that prefrontal cortex encodes and controls learning parameters.

Keywords: associative learning; learning rate; medial thalamus; optogenetics; orbitofrontal cortex; reward prediction error; salience; trace conditioning; two-photon imaging.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Learning* / physiology
  • Mice
  • Neurons / physiology
  • Optogenetics
  • Prefrontal Cortex / physiology
  • Reward*