Reward, motivation, and reinforcement learning.

PubWeight™: 4.28‹?› | Rank: Top 1%

🔗 View Article (PMID 12383782)

Published in Neuron on October 10, 2002

Authors

Peter Dayan1, Bernard W Balleine

Author Affiliations

1: Gatsby Computational Neuroscience Unit, University College London, 17 Queen Square, WC1N 3AR, London, United Kingdom. dayan@gatsby.ucl.ac.uk

Articles citing this

(truncated to the top 100)

The debate over dopamine's role in reward: the case for incentive salience. Psychopharmacology (Berl) (2006) 9.18

A framework for studying the neurobiology of value-based decision making. Nat Rev Neurosci (2008) 8.12

Empathic neural responses are modulated by the perceived fairness of others. Nature (2006) 6.04

The neurobiology of decision: consensus and controversy. Neuron (2009) 4.77

Dissecting components of reward: 'liking', 'wanting', and learning. Curr Opin Pharmacol (2009) 4.46

Dopamine neurons encode the better option in rats deciding between differently delayed or sized rewards. Nat Neurosci (2007) 3.96

Is avoiding an aversive outcome rewarding? Neural substrates of avoidance learning in the human brain. PLoS Biol (2006) 3.80

Reward-guided learning beyond dopamine in the nucleus accumbens: the integrative functions of cortico-basal ganglia networks. Eur J Neurosci (2008) 3.07

Rewards evoke learning of unconsciously processed visual stimuli in adult humans. Neuron (2009) 2.92

A unified framework for addiction: vulnerabilities in the decision process. Behav Brain Sci (2008) 2.83

Layered reward signalling through octopamine and dopamine in Drosophila. Nature (2012) 2.80

Decision making, movement planning and statistical decision theory. Trends Cogn Sci (2008) 2.67

Top-down versus bottom-up attentional control: a failed theoretical dichotomy. Trends Cogn Sci (2012) 2.51

Do patients with schizophrenia exhibit aberrant salience? Psychol Med (2008) 2.28

From reinforcement learning models to psychiatric and neurological disorders. Nat Neurosci (2011) 2.20

How humans integrate the prospects of pain and reward during choice. J Neurosci (2009) 2.14

Calculating utility: preclinical evidence for cost-benefit analysis by mesolimbic dopamine. Psychopharmacology (Berl) (2006) 2.03

Moment-to-moment tracking of state value in the amygdala. J Neurosci (2008) 2.02

From prediction error to incentive salience: mesolimbic computation of reward motivation. Eur J Neurosci (2012) 2.02

Goal representations and motivational drive in schizophrenia: the role of prefrontal-striatal interactions. Schizophr Bull (2010) 2.01

Amygdala central nucleus interacts with dorsolateral striatum to regulate the acquisition of habits. J Neurosci (2012) 2.00

Distinct error-correcting and incidental learning of location relative to landmarks and boundaries. Proc Natl Acad Sci U S A (2008) 1.96

The role of dopamine in the accumbens core in the expression of Pavlovian-conditioned responses. Eur J Neurosci (2012) 1.94

Striatal activity underlies novelty-based choice in humans. Neuron (2008) 1.92

Integrating hippocampus and striatum in decision-making. Curr Opin Neurobiol (2008) 1.87

Central and peripheral regulation of food intake and physical activity: pathways and genes. Obesity (Silver Spring) (2008) 1.71

Neurocomputational models of basal ganglia function in learning, memory and choice. Behav Brain Res (2008) 1.71

The neurobiology of psychopathic traits in youths. Nat Rev Neurosci (2013) 1.67

Dopamine reveals neural circuit mechanisms of fly memory. Trends Neurosci (2010) 1.66

Frontal cortex and the discovery of abstract action rules. Neuron (2010) 1.62

Dynamic computation of incentive salience: "wanting" what was never "liked". J Neurosci (2009) 1.55

Habits, action sequences and reinforcement learning. Eur J Neurosci (2012) 1.54

The human amygdala and orbital prefrontal cortex in behavioural regulation. Philos Trans R Soc Lond B Biol Sci (2007) 1.52

Distinct opioid circuits determine the palatability and the desirability of rewarding events. Proc Natl Acad Sci U S A (2009) 1.51

Behavioral functions of the mesolimbic dopaminergic system: an affective neuroethological perspective. Brain Res Rev (2007) 1.50

A neural computational model of incentive salience. PLoS Comput Biol (2009) 1.48

An "as soon as possible" effect in human intertemporal decision making: behavioral evidence and neural mechanisms. J Neurophysiol (2010) 1.44

Reward priority of visual target singletons modulates event-related potential signatures of attentional selection. Psychol Sci (2009) 1.44

The phenomenon of task-irrelevant perceptual learning. Vision Res (2009) 1.43

A neural systems analysis of the potentiation of feeding by conditioned stimuli. Physiol Behav (2005) 1.43

A single dose of nicotine enhances reward responsiveness in nonsmokers: implications for development of dependence. Biol Psychiatry (2007) 1.39

Curiosity Search: Producing Generalists by Encouraging Individuals to Continually Explore and Acquire Skills throughout Their Lifetime. PLoS One (2016) 1.38

Neural activity associated with the passive prediction of ambiguity and risk for aversive events. J Neurosci (2009) 1.37

Desperately driven and no brakes: developmental stress exposure and subsequent risk for substance abuse. Neurosci Biobehav Rev (2008) 1.37

An expanded view of energy homeostasis: neural integration of metabolic, cognitive, and emotional drives to eat. Physiol Behav (2009) 1.35

Intelligent sensing in dynamic environments using markov decision process. Sensors (Basel) (2011) 1.32

Food reward, hyperphagia, and obesity. Am J Physiol Regul Integr Comp Physiol (2011) 1.31

The dopaminergic basis of human behaviors: A review of molecular imaging studies. Neurosci Biobehav Rev (2009) 1.30

Requirement of dopamine signaling in the amygdala and striatum for learning and maintenance of a conditioned avoidance response. Learn Mem (2011) 1.27

Neuronal correlates of instrumental learning in the dorsal striatum. J Neurophysiol (2009) 1.26

Differential effect of reward and punishment on procedural learning. J Neurosci (2009) 1.22

Poor decision-making by chronic marijuana users is associated with decreased functional responsiveness to negative consequences. Psychiatry Res (2010) 1.22

Acquisition and performance of goal-directed instrumental actions depends on ERK signaling in distinct regions of dorsal striatum in rats. J Neurosci (2010) 1.22

Stable encoding of task structure coexists with flexible coding of task events in sensorimotor striatum. J Neurophysiol (2009) 1.20

Neuroeconomics. Philos Trans R Soc Lond B Biol Sci (2004) 1.19

Tonic dopamine modulates exploitation of reward learning. Front Behav Neurosci (2010) 1.17

Molecular substrates of action control in cortico-striatal circuits. Prog Neurobiol (2011) 1.12

Endogenous calcium buffering capacity of substantia nigral dopamine neurons. J Neurophysiol (2009) 1.11

Neural coding of reward magnitude in the orbitofrontal cortex of the rat during a five-odor olfactory discrimination task. Learn Mem (2007) 1.10

Distinct basal ganglia circuits controlling behaviors guided by flexible and stable values. Neuron (2013) 1.09

Cue-elicited reward-seeking requires extracellular signal-regulated kinase activation in the nucleus accumbens. J Neurosci (2008) 1.08

Stress, genotype and norepinephrine in the prediction of mouse behavior using reinforcement learning. Nat Neurosci (2009) 1.08

Mechanisms of motivation-cognition interaction: challenges and opportunities. Cogn Affect Behav Neurosci (2014) 1.08

Conditioned associations and economic decision biases. Neuroimage (2010) 1.04

Identifying predictors, moderators, and mediators of antidepressant response in major depressive disorder: neuroimaging approaches. Am J Psychiatry (2015) 1.04

Silencing the critics: understanding the effects of cocaine sensitization on dorsolateral and ventral striatum in the context of an actor/critic model. Front Neurosci (2008) 1.03

Frontal theta overrides pavlovian learning biases. J Neurosci (2013) 1.02

Parallel basal ganglia circuits for voluntary and automatic behaviour to reach rewards. Brain (2015) 1.01

Putting desire on a budget: dopamine and energy expenditure, reconciling reward and resources. Front Integr Neurosci (2012) 1.00

Cognition in insects. Philos Trans R Soc Lond B Biol Sci (2012) 1.00

Goal-oriented searching mediated by ventral hippocampus early in trial-and-error learning. Nat Neurosci (2012) 1.00

Interactions between the Midbrain Superior Colliculus and the Basal Ganglia. Front Neuroanat (2010) 0.99

Why skill matters. Trends Cogn Sci (2013) 0.99

Computational models of reinforcement learning: the role of dopamine as a reward signal. Cogn Neurodyn (2010) 0.98

DECISION UTILITY, THE BRAIN, AND PURSUIT OF HEDONIC GOALS. Soc Cogn (2008) 0.98

Human dorsal striatum encodes prediction errors during observational learning of instrumental actions. J Cogn Neurosci (2011) 0.97

Considering PTSD from the perspective of brain processes: a psychological construction approach. J Trauma Stress (2011) 0.96

States of curiosity modulate hippocampus-dependent learning via the dopaminergic circuit. Neuron (2014) 0.96

Tectonigral projections in the primate: a pathway for pre-attentive sensory input to midbrain dopaminergic neurons. Eur J Neurosci (2009) 0.95

Integrating cortico-limbic-basal ganglia architectures for learning model-based and model-free navigation strategies. Front Behav Neurosci (2012) 0.95

Correlates of perceptual learning in an oculomotor decision variable. J Neurosci (2009) 0.95

Modelling individual differences in the form of Pavlovian conditioned approach responses: a dual learning systems approach with factored representations. PLoS Comput Biol (2014) 0.94

Cue-evoked dopamine release in the nucleus accumbens shell tracks reinforcer magnitude during intracranial self-stimulation. Neuroscience (2010) 0.94

Functional circuits and anatomical distribution of response properties in the primate amygdala. J Neurosci (2013) 0.92

Model-based hierarchical reinforcement learning and human action control. Philos Trans R Soc Lond B Biol Sci (2014) 0.92

Puppets, robots, critics, and actors within a taxonomy of attention for developmental disorders. J Int Neuropsychol Soc (2008) 0.92

Dopamine and extinction: a convergence of theory with fear and reward circuitry. Neurobiol Learn Mem (2013) 0.92

Modeling the violation of reward maximization and invariance in reinforcement schedules. PLoS Comput Biol (2008) 0.91

Dopaminergic Balance between Reward Maximization and Policy Complexity. Front Syst Neurosci (2011) 0.91

Representations of appetitive and aversive information in the primate orbitofrontal cortex. Ann N Y Acad Sci (2011) 0.90

Motivation and movement: the effect of monetary incentive on performance speed. Exp Brain Res (2011) 0.90

Individual differences in dopamine efflux in nucleus accumbens shell and core during instrumental learning. Learn Mem (2006) 0.88

Neural Mechanisms for Evaluating Environmental Variability in Caenorhabditis elegans. Neuron (2015) 0.88

Probabilistic reward- and punishment-based learning in opioid addiction: Experimental and computational data. Behav Brain Res (2015) 0.86

The effect of ratio and interval training on Pavlovian-instrumental transfer in mice. PLoS One (2012) 0.85

The cerebellum: a neural system for the study of reinforcement learning. Front Behav Neurosci (2011) 0.85

Perceptual learning to reduce sensory eye dominance beyond the focus of top-down visual attention. Vision Res (2011) 0.85

Dopaminergic enhancement of local food-seeking is under global homeostatic control. Eur J Neurosci (2011) 0.85

Vicarious reinforcement learning signals when instructing others. J Neurosci (2015) 0.85

Information content and reward processing in the human striatum during performance of a declarative memory task. Cogn Affect Behav Neurosci (2012) 0.84

Articles by these authors

Lesions of dorsolateral striatum preserve outcome expectancy but disrupt habit formation in instrumental learning. Eur J Neurosci (2004) 5.62

The role of the dorsomedial striatum in instrumental conditioning. Eur J Neurosci (2005) 4.95

Double dissociation of basolateral and central amygdala lesions on the general and outcome-specific forms of pavlovian-instrumental transfer. J Neurosci (2005) 4.05

A specific role for posterior dorsolateral striatum in human habit learning. Eur J Neurosci (2009) 3.25

Reward-guided learning beyond dopamine in the nucleus accumbens: the integrative functions of cortico-basal ganglia networks. Eur J Neurosci (2008) 3.07

Inactivation of dorsolateral striatum enhances sensitivity to changes in the action-outcome contingency in instrumental conditioning. Behav Brain Res (2005) 2.91

Orbitofrontal cortex mediates outcome encoding in Pavlovian but not instrumental conditioning. J Neurosci (2007) 2.61

The role of prelimbic cortex in instrumental conditioning. Behav Brain Res (2003) 2.51

Blockade of NMDA receptors in the dorsomedial striatum prevents action-outcome learning in instrumental conditioning. Eur J Neurosci (2005) 2.41

Calculating consequences: brain systems that encode the causal effects of actions. J Neurosci (2008) 2.05

Amygdala central nucleus interacts with dorsolateral striatum to regulate the acquisition of habits. J Neurosci (2012) 2.00

Lesions of medial prefrontal cortex disrupt the acquisition but not the expression of goal-directed learning. J Neurosci (2005) 1.91

General and outcome-specific forms of Pavlovian-instrumental transfer: the effect of shifts in motivational state and inactivation of the ventral tegmental area. Eur J Neurosci (2007) 1.89

The general and outcome-specific forms of Pavlovian-instrumental transfer are differentially mediated by the nucleus accumbens core and shell. J Neurosci (2011) 1.74

Differential involvement of the basolateral amygdala and mediodorsal thalamus in instrumental action selection. J Neurosci (2008) 1.65

Instrumental and Pavlovian incentive processes have dissociable effects on components of a heterogeneous instrumental chain. J Exp Psychol Anim Behav Process (2003) 1.55

Habits, action sequences and reinforcement learning. Eur J Neurosci (2012) 1.54

Lesions of mediodorsal thalamus and anterior thalamic nuclei produce dissociable effects on instrumental conditioning in rats. Eur J Neurosci (2003) 1.49

Consolidation and reconsolidation of incentive learning in the amygdala. J Neurosci (2005) 1.43

At the limbic-motor interface: disconnection of basolateral amygdala from nucleus accumbens core and shell reveals dissociable components of incentive motivation. Eur J Neurosci (2010) 1.41

Differential dependence of Pavlovian incentive motivation and instrumental incentive learning processes on dopamine signaling. Learn Mem (2011) 1.36

Evidence of action sequence chunking in goal-directed instrumental conditioning and its dependence on the dorsomedial prefrontal cortex. J Neurosci (2009) 1.31

The contribution of orbitofrontal cortex to action selection. Ann N Y Acad Sci (2007) 1.30

Genetic control of instrumental conditioning by striatopallidal neuron-specific S1P receptor Gpr6. Nat Neurosci (2007) 1.30

Binge-like consumption of a palatable food accelerates habitual control of behavior and is dependent on activation of the dorsolateral striatum. J Neurosci (2014) 1.29

On habits and addiction: An associative analysis of compulsive drug seeking. Drug Discov Today Dis Models (2008) 1.28

Instrumental learning in hyperdopaminergic mice. Neurobiol Learn Mem (2006) 1.22

Actions, action sequences and habits: evidence that goal-directed and habitual action control are hierarchically organized. PLoS Comput Biol (2013) 1.22

Acquisition and performance of goal-directed instrumental actions depends on ERK signaling in distinct regions of dorsal striatum in rats. J Neurosci (2010) 1.22

Associative learning mechanisms underpinning the transition from recreational drug use to addiction. Ann N Y Acad Sci (2012) 1.20

Neural correlates of instrumental contingency learning: differential effects of action-reward conjunction and disjunction. J Neurosci (2011) 1.16

The thalamostriatal pathway and cholinergic control of goal-directed action: interlacing new with existing learning in the striatum. Neuron (2013) 1.14

Molecular substrates of action control in cortico-striatal circuits. Prog Neurobiol (2011) 1.12

Micro-opioid receptor activation in the basolateral amygdala mediates the learning of increases but not decreases in the incentive value of a food reward. J Neurosci (2011) 1.12

The influence of Pavlovian cues on instrumental performance is mediated by CaMKII activity in the striatum. Eur J Neurosci (2007) 1.10

Extracellular dopamine levels in striatal subregions track shifts in motivation and response cost during instrumental conditioning. J Neurosci (2011) 1.09

Contributions of ERK signaling in the striatum to instrumental learning and performance. Behav Brain Res (2010) 1.03

Transient extracellular glutamate events in the basolateral amygdala track reward-seeking actions. J Neurosci (2012) 1.03

μ- and δ-opioid-related processes in the accumbens core and shell differentially mediate the influence of reward-guided and stimulus-guided decisions on choice. J Neurosci (2012) 1.01

Sensitivity to instrumental contingency degradation is mediated by the entorhinal cortex and its efferents via the dorsal hippocampus. J Neurosci (2002) 1.01

Incentive memory: evidence the basolateral amygdala encodes and the insular cortex retrieves outcome values to guide choice between goal-directed actions. J Neurosci (2013) 1.01

Striatal cholinergic interneurons display activity-related phosphorylation of ribosomal protein S6. PLoS One (2012) 0.98

Alcohol-Paired Contextual Cues Produce an Immediate and Selective Loss of Goal-directed Action in Rats. Front Integr Neurosci (2010) 0.96

Stimulus salience and retrospective revaluation. J Exp Psychol Anim Behav Process (2006) 0.95

Hierarchical and binary associations compete for behavioral control during instrumental biconditional discrimination. J Exp Psychol Anim Behav Process (2013) 0.93

The influence of amphetamine on sensory and conditioned reinforcement: evidence for the re-selection hypothesis of dopamine function. Front Integr Neurosci (2007) 0.91

Mediated conditioning versus retrospective revaluation in humans: the influence of physical and functional similarity of cues. Q J Exp Psychol (Hove) (2008) 0.91

The ventral striato-pallidal pathway mediates the effect of predictive learning on choice between goal-directed actions. J Neurosci (2013) 0.90

δ-opioid and dopaminergic processes in accumbens shell modulate the cholinergic control of predictive learning and choice. J Neurosci (2014) 0.90

Reduced heart rate variability in social anxiety disorder: associations with gender and symptom severity. PLoS One (2013) 0.90

The role of Pavlovian cues in alcohol seeking in dependent and nondependent rats. J Stud Alcohol (2005) 0.89

Resolution of conflict between goal-directed actions: outcome encoding and neural control processes. J Exp Psychol Anim Behav Process (2009) 0.88

The role of the anterior, mediodorsal, and parafascicular thalamus in instrumental conditioning. Front Syst Neurosci (2013) 0.88

Learning-related translocation of δ-opioid receptors on ventral striatal cholinergic interneurons mediates choice between goal-directed actions. J Neurosci (2013) 0.87

The role of opioid processes in reward and decision-making. Br J Pharmacol (2015) 0.87

The role of the amygdala-striatal pathway in the acquisition and performance of goal-directed instrumental actions. J Neurosci (2013) 0.86

Helplessness and escape performance: glutamate-adenosine interactions in the frontal cortex. Behav Neurosci (2003) 0.86

Oxytocin selectively moderates negative cognitive appraisals in high trait anxious males. Psychoneuroendocrinology (2012) 0.84

An assessment of factors contributing to instrumental performance for sexual reward in the rat. Q J Exp Psychol B (2002) 0.82

Current trends in decision making. Ann N Y Acad Sci (2007) 0.82

Motivational control of second-order conditioning. J Exp Psychol Anim Behav Process (2005) 0.79

Sexual experience interacts with steroid exposure to shape the partner preferences of rats. Horm Behav (2002) 0.79

Extracting functional equivalence from reversing contingencies. J Exp Psychol Anim Behav Process (2010) 0.79

The L-type calcium channel blocker nimodipine mitigates "learned helplessness" in rats. Pharmacol Biochem Behav (2003) 0.78

δ-Opioid receptors in the accumbens shell mediate the influence of both excitatory and inhibitory predictions on choice. Br J Pharmacol (2014) 0.77

Inhibitory sensory preconditioning detected with a sodium depletion procedure. Q J Exp Psychol (Hove) (2008) 0.75