Nucleus accumbens dopamine release reflects Bayesian inference during instrumental learning.

Dopamine release in the nucleus accumbens has been hypothesized to signal the difference between observed and predicted reward, known as reward prediction error, suggesting a biological implementation for reinforcement learning. Rigorous tests of this hypothesis require assumptions about how the bra...

Full description

Saved in:
Bibliographic Details
Main Authors: Albert J Qü, Lung-Hao Tai, Christopher D Hall, Emilie M Tu, Maria K Eckstein, Karyna Mishchanchuk, Wan Chen Lin, Juliana B Chase, Andrew F MacAskill, Anne G E Collins, Samuel J Gershman, Linda Wilbrecht
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2025-07-01
Series:PLoS Computational Biology
Online Access:https://doi.org/10.1371/journal.pcbi.1013226
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Dopamine release in the nucleus accumbens has been hypothesized to signal the difference between observed and predicted reward, known as reward prediction error, suggesting a biological implementation for reinforcement learning. Rigorous tests of this hypothesis require assumptions about how the brain maps sensory signals to reward predictions, yet this mapping is still poorly understood. In particular, the mapping is non-trivial when sensory signals provide ambiguous information about the hidden state of the environment. Previous work using classical conditioning tasks has suggested that reward predictions are generated conditional on probabilistic beliefs about the hidden state, such that dopamine implicitly reflects these beliefs. Here we test this hypothesis in the context of an instrumental task (a two-armed bandit), where the hidden state switches stochastically. We measured choice behavior and recorded dLight signals that reflect dopamine release in the nucleus accumbens core. Model comparison among a wide set of cognitive models based on the behavioral data favored models that used Bayesian updating of probabilistic beliefs. These same models also quantitatively matched mesolimbic dLight measurements better than non-Bayesian alternatives. We conclude that probabilistic belief computation contributes to instrumental task performance in mice and is reflected in mesolimbic dopamine signaling.
ISSN:1553-734X
1553-7358