Mark Burrell (@mhburrell)

🚨Our preprint is online!🚨

www.biorxiv.org/content/10.1...

How do #dopamine neurons perform the key calculations in reinforcement #learning?

Read on to find out more! 🧵

19.09.2025 13:05 👍 197 🔁 71 💬 11 📌 4

Read more about it and see our smiling faces: https://www.mcb.harvard.edu/department/news/decoding-learning-how-cues-and-rewards-shape-behavior-and-dopamine-signals/ @harvardmcb.bsky.social (12/12)

18.03.2025 15:04 👍 6 🔁 1 💬 0 📌 0

This work is a product of a tremendous team: Lechen (Selina) Qian, Jay A Hennig (@jhennig.bsky.social), Sara Matias (@saramatias.bsky.social), Venki Murthy (@neurovenki.bsky.social), Sam Gershman (@gershbrain.bsky.social) and Naoshige Uchida (@naoshigeuchida.bsky.social)
(11/12)

18.03.2025 15:04 👍 4 🔁 1 💬 1 📌 0

We thank the reviewers for their comments, who helped us refine our explanations of the various models and why they succeed or fail.
(10/12)

18.03.2025 15:04 👍 2 🔁 0 💬 1 📌 0

In total, we show how TD learning can be used as a comprehensive explanation of the effects of contingency on associative learning and discuss how this guides our future study of how the brain learns causality. (9/12)

18.03.2025 15:04 👍 2 🔁 0 💬 1 📌 0

Finally, we showed a novel model that relies on retrospective contingency, ANCCR (doi:10.1126/science.abq6740), does not explain our results under any parameter combination. (8/12)

18.03.2025 15:04 👍 3 🔁 0 💬 1 📌 0

Moreover, working with @jhennig.bsky.social, we showed small RNNS develop similar state space representations and explain our results in the same manner. (7/12)

18.03.2025 15:04 👍 2 🔁 0 💬 1 📌 0

We sought to identify a TD model to explain these changes. While several classic implementations of TD did not working (e.g. CSC & microstimuli), with a state representation that incorporated the animal’s learned knowledge of the task structure, TD was able to explain all our results. (6/12)

18.03.2025 15:04 👍 2 🔁 0 💬 1 📌 0

Another group, received the same increase in the number of rewards, but new rewards were preceded by a novel cue. This important control reveals that a classic definition of contingency ∆𝑃 does not adequately describe the pattern of changes in dopamine and behavior. (5/12)

18.03.2025 15:04 👍 2 🔁 0 💬 1 📌 0

We performed a Pavlovian contingency degradation task to examine how behavior and dopamine activities are modulated in contingency learning. In this task, mice were first trained in a simple conditioning task. Then one group of mice received both cued and uncued rewards. (4/12)

18.03.2025 15:04 👍 2 🔁 0 💬 1 📌 0

Contingency, the degree to which a stimulus predicts an outcome, is a critical factor in shaping animal behavior during associative learning. But the neural mechanisms linking contingency to behavior have been elusive. (3/12)

18.03.2025 15:04 👍 2 🔁 0 💬 1 📌 0

In short, we found that we can explain the effects of contingency on both an animal’s behavior and ventral striatum dopamine response using temporal difference learning when equipped with appropriate state space representations. (2/12)

18.03.2025 15:04 👍 3 🔁 0 💬 1 📌 0

I’m happy to share our latest work (co-lead by Selina Qian) has today been published in its final form in @:natureneuro.bsky.social: Read here: https://www.nature.com/articles/s41593-025-01915-4
(1/12)

18.03.2025 15:04 👍 31 🔁 14 💬 2 📌 2

Mark Burrell

Latest posts by Mark Burrell @mhburrell