When we choose one item from several alternatives, we consider their values from amount of reward and the amount of work needed to obtain them. To study the neuronal mechanism of such a decision-making, we developed a decision-making schedule task and recorded single unit activity from monkey orbitofrontal cortex (OFC), a cortical region that the neurons carry information about rewards and their values. Two monkeys were initially trained to perform a reward schedule task. In this task, the monkey had to complete the schedule composed of 1, 2 or 4 trials of visual discriminations to earn 1, 2 or 4 drops of liquid reward. After the monkey learned this task, the decision-making schedule task was introduced. The decision-making schedule task had a decision-making part and a reward schedule part. In the decision-making part, two choice targets were presented sequentially at the center of a computer monitor. Brightness and length of the choice targets were proportional to the amount of liquid reward (1, 2, or 4 drops) and the number of the visual-discrimination trials (1, 2, or 4 trials) needed, respectively. After both first and second choice targets had been presented, the same two choice targets simultaneously reappeared side by side of the fixation point. The monkey was required to choose one of the two targets by touching the corresponding bar in the chair. Then, the chosen reward schedule task was presented. We recorded 256 neurons in the monkey OFC during the decision-making schedule task (137 and 119 neurons from each monkey). The choice target values were estimated from the monkey's choice behavior using a standard exponential discounting model of reward value. Applying a generalized linear model to the data, showed 26.6% (68/256) of the neurons had a significant correlation between the neuronal firing and the first and the second choice target values, that is, they encoded information about the first choice target, 14.8% (38/256) of recorded neurons coded currently presented choice target values, that is, they encoded information about the stimulus currently being presented, 31/68 neurons had coefficients with the same signs, that is, they encoded information about the total possible value of the trial, and 37/68 neurons had different signs for the coefficients of 2 choice target terms, that is, they encoded information about which target was most valuable. This latter group of neurons can be interpreted as carrying information for how to choose the most valuable target. To determine whether the information might be used for making a choice, we injected muscimol into the recording sites bilaterally to silence the local neuronal population. Following injection, choice accuracy and speed were degraded in value dependent manner. Thus, OFC not only encodes information about reward value and the best choice, but seems to be necessary for normal choices to occur. One strong output of the orbitofrontal goes into the ventral striatum. Single unit recording and behavioral studies have implicated the ventral striatum in reward valuation and processing. We investigated whether muscimol-induced unilateral inactivation of ventral striatum would alter delay and/or reward size valuation, and then assessed whether the observed effects could be replicated through neuronal silencing with an inhibitory chemogenetic system. The monkey performed a stimulus-reward association task in which stimuli concurrently representing reward size and delay-to-reward were presented. In each trial, the monkey either accepted or rejected the offer presented; accepting the offer resulted in delivery of the reward predicted by the stimulus after the proposed wait time, refusing the offer allowed the monkey to begin a new trial with the potential for a new stimulus and different outcome. In the first phase of the study, muscimol was used to inactivate the right ventral striatum unilaterally. In muscimol treatment sessions, the monkey received a unilateral injection of muscimol into the ventral striatum. In control sessions, the same volume of vehicle was infused. In both muscimol treatment and control sessions, we observed similar accept rate patterns; the monkey was more likely to accept offers with higher reward values and shorter delays-to-reward. On muscimol treatment days, the monkey aborted significantly more trials. In the second phase of the study, an inhibitory chemogenetic receptor - hM4Di DREADD (Designer Receptor Exclusively Activated by Designer Drug), activated by exogenous ligand CNO - was targeted to the right ventral striatum. Systemic CNO injection (10 mg/kg) was used to induce temporary silencing. On CNO treatment days, the monkey aborted significantly more trials, similar to what happened with muscimol. In summary, although unilateral inactivation of ventral striatum appears insufficient to alter reward evaluation, it consistently produces an elevated rate of spontaneous early responding. Thus it seems as if even unilateral inactivation of this tissue impairs response inhibition, leading to the increase of inappropriate early responses. In the striatum, tonically active neurons (TANs) are cholinergic inter-neurons. These TANs are thought to play a role in reward-related and motivational processes. In primates these neurons are frequently described as having a pattern of responses with a period of inhibition followed by excitation that is related to whether a reward is delivered. We hypothesized that these neurons might encode the value of the predicted outcome. We recorded neuronal responses from 63 TANs in two monkeys while they performed a task in which 9 combinations of reward were offered by mixing 3 sizes (2, 4 or 6 drops of water) and 3 delays (1, 5 or 10s). A visual cue predicting the combination being offered was presented throughout the trial. The monkeys were required to indicate whether to accept the offered reward or skip to the next trial hoping for a larger offer. The monkeys accepted more valuable rewards more often than less valuable rewards, a pattern of behavior modeled by the same exponential discounting model used above. We found that for basically all of the TANs the cue-elicited responses were related to the two factors, reward size and delay-to-reward through the spike count and/or the pattern of their responses over time. The TANs could be divided in two sets of 27 and 36 cells respectively, characterized by a large (R2 = 0.82) and small (R2 = 0.11) correlation between the response pattern as measured using principal components and the spike count. Thus, there appear to be two groups of TANs, the first for which the modulation is represented by the number of spikes as is the common type of neural coding throughout the brain, and the second for which the modulation can only be represented as a temporal modulation of the response; the response strength does not change. This is a striking example of pure temporal coding where the spike count over the response period does not carry any information. There are few, if any, such clear examples of stimulus-related purely temporal coding in the nervous system.

Project Start
Project End
Budget Start
Budget End
Support Year
25
Fiscal Year
2016
Total Cost
Indirect Cost
Name
U.S. National Institute of Mental Health
Department
Type
DUNS #
City
State
Country
Zip Code
Eldridge, M A G; Richmond, B J (2017) Resisting the Urge to Act: DREADDS Modifying Habits: (Trends in Neurosciences 40, 61-62; 2017). Trends Neurosci 40:324
Eldridge, Mark A G; Richmond, Barry J (2017) Resisting the Urge to Act: DREADDs Modifying Habits. Trends Neurosci 40:61-62
Ji, Bin; Kaneko, Hiroyuki; Minamimoto, Takafumi et al. (2016) Multimodal Imaging for DREADD-Expressing Neurons in Living Brain and Their Application to Implantation of iPSC-Derived Neural Progenitors. J Neurosci 36:11544-11558
Eldridge, Mark A G; Lerchner, Walter; Saunders, Richard C et al. (2016) Chemogenetic disconnection of monkey orbitofrontal and rhinal cortex reversibly disrupts reward value. Nat Neurosci 19:37-9
Nagai, Yuji; Kikuchi, Erika; Lerchner, Walter et al. (2016) PET imaging-guided chemogenetic silencing reveals a critical role of primate rostromedial caudate in reward evaluation. Nat Commun 7:13605
Abitbol, Raphaëlle; Lebreton, Maël; Hollard, Guillaume et al. (2015) Neural mechanisms underlying contextual dependency of subjective values: converging evidence from monkeys and humans. J Neurosci 35:2308-20
Bouret, Sebastien; Richmond, Barry J (2015) Sensitivity of locus ceruleus neurons to reward value for goal-directed actions. J Neurosci 35:4005-14
Dayan, Eran; Averbeck, Bruno B; Richmond, Barry J et al. (2014) Stochastic reinforcement benefits skill acquisition. Learn Mem 21:140-2
Lerchner, W; Corgiat, B; Der Minassian, V et al. (2014) Injection parameters and virus dependent choice of promoters to improve neuron targeting in the nonhuman primate brain. Gene Ther 21:233-41
Inaba, Kiyonori; Mizuhiki, Takashi; Setogawa, Tsuyoshi et al. (2013) Neurons in monkey dorsal raphe nucleus code beginning and progress of step-by-step schedule, reward expectation, and amount of reward outcome in the reward schedule task. J Neurosci 33:3477-91

Showing the most recent 10 out of 18 publications