NYAS Conferences
New York Academy of Sciences
left end
Search
divider divider feedback right end
Annals of the New York Academy of Sciences Annals of the New York Academy of Sciences login

Main

Browse Volumes

Forthcoming Volumes

Annals PrePrints

Annals Extra

E-mail Alerts

Subscriptions & Orders

New Proposals

Author Guidelines

About Annals

Help

Get free Annals volume as a NYAS member: http://www.nyas.org/annalsreaderhw
Issue 1104 coverReward and Decision Making in Corticobasal Ganglia Networks Volume 1104 published June 2007
Ann. N.Y. Acad. Sci. 1104: 135–146 (2007). doi: 10.1196/annals.1390.005
Copyright © 2007 by the New York Academy of Sciences
description | purchase volume purchase this volume

This Volume
Table of Contents
Description
This Article
Full Text
Full Text (PDF)
All Versions of this Article:
annals.1390.005v1
1104/1/135    most recent
Services
Similar articles in this journal
Similar articles in PubMed
Alert me to new issues of the journal
Download to citation manager
Citing Articles
Citing Articles via HighWire
Citing Articles via Google Scholar
Google Scholar
Articles by PREUSCHOFF, K.
Articles by BOSSAERTS, P.
Search for Related Content
PubMed
PubMed Citation
Articles by PREUSCHOFF, K.
Articles by BOSSAERTS, P.

Original Articles

Adding Prediction Risk to the Theory of Reward Learning

KERSTIN PREUSCHOFFa AND PETER BOSSAERTSa

a Computation and Neural Systems, California Institute of Technology, Pasadena, California, USA

Key Words: reinforcement learning • learning rate • least squares learning • dopaminergic system • reward anticipation • prediction risk • uncertainty • adaptive encoding

Address for correspondence: Peter Bossaerts, m/c 228-77 California Institute of Technology, Pasadena, CA 91125, USA. Voice: +1-626-395-4028; fax: +1-626-405-9841.  pbs{at}rioja.caltech.edu

This article analyzes the simple Rescorla–Wagner learning rule from the vantage point of least squares learning theory. In particular, it suggests how measures of risk, such as prediction risk, can be used to adjust the learning constant in reinforcement learning. It argues that prediction risk is most effectively incorporated by scaling the prediction errors. This way, the learning rate needs adjusting only when the covariance between optimal predictions and past (scaled) prediction errors changes. Evidence is discussed that suggests that the dopaminergic system in the (human and nonhuman) primate brain encodes prediction risk, and that prediction errors are indeed scaled with prediction risk (adaptive encoding).




This article has been cited by other articles:


Home page
J. Neurosci.Home page
K. Preuschoff, S. R. Quartz, and P. Bossaerts
Human Insula Activation Reflects Risk Prediction Errors As Well As Risk
J. Neurosci., March 12, 2008; 28(11): 2745 - 2752.
[Abstract] [Full Text] [PDF]



footerLeft footerRight