Learning the value of information in an uncertain world (original) (raw)
References
Ernst, M.O. & Banks, M.S. Humans integrate visual and haptic information in a statistically optimal fashion. Nature415, 429–433 (2002). ArticleCASPubMed Google Scholar
Kording, K.P. & Wolpert, D.M. Bayesian integration in sensorimotor learning. Nature427, 244–247 (2004). ArticlePubMed Google Scholar
Kahneman, D. & Tversky, A. Choices, Values and Frames (Cambridge University Press, Cambridge, 2000). Book Google Scholar
Montague, P.R., Dayan, P., Person, C. & Sejnowski, T.J. Bee foraging in uncertain environments using predictive hebbian learning. Nature377, 725–728 (1995). ArticleCASPubMed Google Scholar
Samejima, K., Ueda, Y., Doya, K. & Kimura, M. Representation of action-specific reward values in the striatum. Science310, 1337–1340 (2005). ArticleCASPubMed Google Scholar
Daw, N.D., O'Doherty, J.P., Dayan, P., Seymour, B. & Dolan, R.J. Cortical substrates for exploratory decisions in humans. Nature441, 876–879 (2006). ArticleCASPubMedPubMed Central Google Scholar
Bayer, H.M. & Glimcher, P.W. Midbrain dopamine neurons encode a quantitative reward prediction error signal. Neuron47, 129–141 (2005). ArticleCASPubMedPubMed Central Google Scholar
Rescorla, R.A. & Wagner, A.R. in Classical Conditioning II: Current Research and Theory (eds. Black, A.H. & Prokasy, W.F.) 64–99 (Appleton-Century Crofts, New York, 1972). Google Scholar
Sutton, R.S. & Barto, A.G. Reinforcement Learning: an Introduction (MIT Press, Cambridge, Massachusetts, 1998). Google Scholar
Dayan, P., Kakade, S. & Montague, P.R. Learning and selective attention. Nat. Neurosci.3 Suppl, 1218–1223 (2000). ArticleCASPubMed Google Scholar
Pearce, J.M. & Hall, G. A model for Pavlovian learning: variations in the effectiveness of conditioned, but not of unconditioned, stimuli. Psychol. Rev.87, 532–552 (1980). ArticleCASPubMed Google Scholar
Dickinson, A. & Mackintosh, N.J. Classical conditioning in animals. Annu. Rev. Psychol.29, 587–612 (1978). ArticleCASPubMed Google Scholar
Cox, R.T. Probability, frequency and reasonable expectaion. Am. J. Phys.14, 1–13 (1946). Article Google Scholar
Kakade, S. & Dayan, P. Acquisition and extinction in autoshaping. Psychol. Rev.109, 533–544 (2002). ArticlePubMed Google Scholar
Courville, A.C., Daw, N.D. & Touretzky, D.S. Bayesian theories of conditioning in a changing world. Trends Cogn. Sci.10, 294–300 (2006). ArticlePubMed Google Scholar
Sugrue, L.P., Corrado, G.S. & Newsome, W.T. Matching behavior and the representation of value in the parietal cortex. Science304, 1782–1787 (2004). ArticleCASPubMed Google Scholar
Kennerley, S.W., Walton, M.E., Behrens, T.E., Buckley, M.J. & Rushworth, M.F. Optimal decision making and the anterior cingulate cortex. Nat. Neurosci.9, 940–947 (2006). ArticleCASPubMed Google Scholar
Gallistel, C.R., Mark, T.A., King, A.P. & Latham, P.E. The rat approximates an ideal detector of changes in rates of reward: implications for the law of effect. J. Exp. Psychol. Anim. Behav. Process.27, 354–372 (2001). ArticleCASPubMed Google Scholar
Procyk, E., Tanaka, Y.L. & Joseph, J.P. Anterior cingulate activity during routine and nonroutine sequential behaviors in macaques. Nat. Neurosci.3, 502–508 (2000). ArticleCASPubMed Google Scholar
Walton, M.E., Devlin, J.T. & Rushworth, M.F. Interactions between decision making and performance monitoring within prefrontal cortex. Nat. Neurosci.7, 1259–1265 (2004). ArticleCASPubMed Google Scholar
Niki, H. & Watanabe, M. Prefrontal and cingulate unit activity during timing behavior in the monkey. Brain Res.171, 213–224 (1979). ArticleCASPubMed Google Scholar
Ullsperger, M. & von Cramon, D.Y. Error monitoring using external feedback: specific roles of the habenular complex, the reward system and the cingulate motor area revealed by functional magnetic resonance imaging. J. Neurosci.23, 4308–4314 (2003). ArticleCASPubMedPubMed Central Google Scholar
Brown, J.W. & Braver, T.S. Learned predictions of error likelihood in the anterior cingulate cortex. Science307, 1118–1121 (2005). ArticleCASPubMed Google Scholar
Ito, S., Stuphorn, V., Brown, J.W. & Schall, J.D. Performance monitoring by the anterior cingulate cortex during saccade countermanding. Science302, 120–122 (2003). ArticleCASPubMed Google Scholar
Matsumoto, K., Suzuki, W. & Tanaka, K. Neuronal correlates of goal-based motor selection in the prefrontal cortex. Science301, 229–232 (2003). ArticleCASPubMed Google Scholar
Smith, S.M. et al. Advances in functional and structural MR image analysis and implementation as FSL. Neuroimage23 Suppl 1, S208–S219 (2004). ArticlePubMed Google Scholar
Koechlin, E., Ody, C. & Kouneiher, F. The architecture of cognitive control in the human prefrontal cortex. Science302, 1181–1185 (2003). ArticleCASPubMed Google Scholar
Strick, P.L., Dum, R.P. & Picard, N. Motor areas on the medial wall of the hemisphere. Novartis Found Symp.218, 64–75; discussion 75–80, 104–8 (1998). CASPubMed Google Scholar
Van Hoesen, G.W., Morecraft, R.J. & Vogt, B.A. in Neurobiology of Cingulate Cortex and Limbic Thalamus (eds. Vogt, B.A. & Gabriel, M.) (Birkhauser, Boston, 1993). Google Scholar
McCoy, A.N. & Platt, M.L. Risk-sensitive neurons in macaque posterior cingulate cortex. Nat. Neurosci.8, 1220–1227 (2005). ArticleCASPubMed Google Scholar
Fiorillo, C.D., Tobler, P.N. & Schultz, W. Discrete coding of reward probability and uncertainty by dopamine neurons. Science299, 1898–1902 (2003). ArticleCASPubMed Google Scholar
Preuschoff, K., Bossaerts, P. & Quartz, S.R. Neural differentiation of expected reward and risk in human subcortical structures. Neuron51, 381–390 (2006). ArticleCASPubMed Google Scholar
Aston-Jones, G. & Cohen, J.D. An integrative theory of locus coeruleus-norepinephrine function: adaptive gain and optimal performance. Annu. Rev. Neurosci.28, 403–450 (2005). ArticleCASPubMed Google Scholar
Engle, R.F. Autoregressive conditional Heteroscedasticity with estimates of the variance of UK inflation. Econometrica50, 987–1008 (1982). Article Google Scholar
Waelti, P., Dickinson, A. & Schultz, W. Dopamine responses comply with basic assumptions of formal learning theory. Nature412, 43–48 (2001). ArticleCASPubMed Google Scholar
O'Doherty, J. et al. Dissociable roles of ventral and dorsal striatum in instrumental conditioning. Science304, 452–454 (2004). ArticleCASPubMed Google Scholar
Haruno, M. et al. A neural correlate of reward-based behavioral learning in caudate nucleus: a functional magnetic resonance imaging study of a stochastic decision task. J. Neurosci.24, 1660–1665 (2004). ArticleCASPubMedPubMed Central Google Scholar
Tanaka, S.C. et al. Prediction of immediate and future rewards differentially recruits cortico-basal ganglia loops. Nat. Neurosci.7, 887–893 (2004). ArticleCASPubMed Google Scholar
Kunishio, K. & Haber, S.N. Primate cingulostriatal projection: limbic striatal versus sensorimotor striatal input. J. Comp. Neurol.350, 337–356 (1994). ArticleCASPubMed Google Scholar
Amiez, C., Joseph, J.P. & Procyk, E. Reward encoding in the monkey anterior cingulate cortex. Cereb. Cortex16, 1040–1055 (2006). ArticleCASPubMed Google Scholar
Yoshida, W. & Ishii, S. Resolution of uncertainty in prefrontal cortex. Neuron50, 781–789 (2006). ArticleCASPubMed Google Scholar
Fitzgerald, K.D. et al. Error-related hyperactivity of the anterior cingulate cortex in obsessive-compulsive disorder. Biol. Psychiatry57, 287–294 (2005). ArticlePubMed Google Scholar
Critchley, H.D., Mathias, C.J. & Dolan, R.J. Neural activity in the human brain relating to uncertainty and arousal during anticipation. Neuron29, 537–545 (2001). ArticleCASPubMed Google Scholar
Botvinick, M.M., Cohen, J.D. & Carter, C.S. Conflict monitoring and anterior cingulate cortex: an update. Trends Cogn. Sci.8, 539–546 (2004). ArticlePubMed Google Scholar
Rushworth, M.F., Buckley, M.J., Behrens, T.E., Walton, M.E. & Bannerman, D.M. Functional organization of the medial frontal cortex. Curr. Opin. Neurobiol.17, 220–227 (2007). ArticleCASPubMed Google Scholar
Hampton, A.N., Bossaerts, P. & O'Doherty, J.P. The role of the ventromedial prefrontal cortex in abstract state-based inference during decision making in humans. J. Neurosci.26, 8360–8367 (2006). ArticleCASPubMedPubMed Central Google Scholar
Preuschoff, K. & Bossaerts, P. Adding prediction risk to the theory of reward learning. Ann. N Y Acad. Sci.1104, 135–146 (2007). ArticlePubMed Google Scholar