Reinforcement learning: Dopamine ramps with fuzzy value estimates

Curr Biol. 2022 Mar 14;32(5):R213-R215. doi: 10.1016/j.cub.2022.01.070.

Abstract

A new study in reinforcement learning theory shows that extending the temporal difference algorithm to unbiased learning under state uncertainty explains the observed ramping behaviour of dopamine neurons.

Publication types

  • Comment

MeSH terms

  • Dopamine*
  • Learning / physiology
  • Models, Neurological*
  • Reinforcement, Psychology
  • Uncertainty

Substances

  • Dopamine