Hostname: page-component-8448b6f56d-c4f8m Total loading time: 0 Render date: 2024-04-18T05:02:09.569Z Has data issue: false hasContentIssue false

100 words on reward prediction error – 100 words

Published online by Cambridge University Press:  23 December 2019

Rights & Permissions [Opens in a new window]

Abstract

Type
Extras
Copyright
Copyright © The Author 2019 

Reward prediction error is like Marmite – you either love it or hate it. I hate it because it commits to a view of the brain that inherits from 20th-century behaviourism and reinforcement learning. When people say dopamine encodes reward prediction error, they are assuming that the brain is in the game of maximising reward. But it is not – the brain updates its beliefs and selects a preferred course of action. On this (planning as active inference) view, the available evidence suggests that dopamine encodes the precision of beliefs about policies – or, more simply, the confidence afforded (subpersonal) plans of action.

Submit a response

eLetters

No eLetters have been published for this article.