A gradual temporal shift of dopamine responses mirrors the progression of temporal difference error in machine learning

Ryunosuke Amo; Sara Matias; Akihiro Yamanaka; Kenji F Tanaka; Naoshige Uchida; Mitsuko Watabe-Uchida

doi:10.1038/s41593-022-01109-2

A gradual temporal shift of dopamine responses mirrors the progression of temporal difference error in machine learning

Nat Neurosci. 2022 Aug;25(8):1082-1092. doi: 10.1038/s41593-022-01109-2. Epub 2022 Jul 7.

Authors

Ryunosuke Amo¹, Sara Matias¹, Akihiro Yamanaka², Kenji F Tanaka³, Naoshige Uchida¹, Mitsuko Watabe-Uchida⁴

Affiliations

¹ Department of Molecular and Cellular Biology, Center for Brain Science, Harvard University, Cambridge, MA, USA.
² Department of Neuroscience II, Research Institute of Environmental Medicine, Nagoya University, Nagoya, Japan.
³ Division of Brain Sciences, Institute for Advanced Medical Research, Keio University School of Medicine, Tokyo, Japan.
⁴ Department of Molecular and Cellular Biology, Center for Brain Science, Harvard University, Cambridge, MA, USA. [email protected].

Abstract

A large body of evidence has indicated that the phasic responses of midbrain dopamine neurons show a remarkable similarity to a type of teaching signal (temporal difference (TD) error) used in machine learning. However, previous studies failed to observe a key prediction of this algorithm: that when an agent associates a cue and a reward that are separated in time, the timing of dopamine signals should gradually move backward in time from the time of the reward to the time of the cue over multiple trials. Here we demonstrate that such a gradual shift occurs both at the level of dopaminergic cellular activity and dopamine release in the ventral striatum in mice. Our results establish a long-sought link between dopaminergic activity and the TD learning algorithm, providing fundamental insights into how the brain associates cues and rewards that are separated in time.

Publication types

Research Support, Non-U.S. Gov't
Research Support, N.I.H., Extramural

MeSH terms

Animals
Cues
Dopamine* / physiology
Dopaminergic Neurons / physiology
Maschinelles Lernen
Mesencephalon
Mice
Reward*

A gradual temporal shift of dopamine responses mirrors the progression of temporal difference error in machine learning

Authors

Affiliations

Abstract

Publication types

MeSH terms

Substances

Associated data

Grants and funding