Abstract
The rewards that we get from our choices and actions can have a major influence on our future behavior. Understanding how reward biasing of behavior is implemented in the brain is important for many reasons, including the fact that diminution in reward biasing is a hallmark of clinical depression. We hypothesized that reward biasing is mediated by the anterior cingulate cortex (ACC), a cortical hub region associated with the integration of reward and executive control and with the etiology of depression. To test this hypothesis, we recorded neural activity during a biased judgment task in patients undergoing intracranial monitoring for either epilepsy or major depressive disorder. We found that beta (12â30âHz) oscillations in the ACC predicted both associated reward and the size of the choice bias, and also tracked reward receipt, thereby predicting bias on future trials. We found reduced magnitude of bias in depressed patients, in whom the beta-specific effects were correspondingly reduced. Our findings suggest that ACC beta oscillations may orchestrate the learning of reward information to guide adaptive choice, and, more broadly, suggest a potential biomarker for anhedonia and point to future development of interventions to enhance reward impact for therapeutic benefit.
Similar content being viewed by others
Introduction
One of the hallmarks of human behavior is the ability to associate rewards with the stimuli that produce them1,2,3. The anticipation of a reward can influence our decision-making process by overriding our initial sensory judgments and subsequently altering our subjective evaluation of a stimulus4,5,6,7. These alterations in judgment, though sometimes suboptimal, can also be beneficial by biasing our behavior in a manner that increases the likelihood of obtaining greater rewards8. A fundamental unresolved question in human neuroscience is, therefore, what neural processes drive the learning, or association of reward information, so as to bias subsequent decisions9,10.
Dysfunction of these reward association processes is a common characteristic of several neuropsychiatric disorders, most notably major depressive disorder11,12,13,14,15. Depression is a disabling disease characterized by anhedonia, disengagement, and reduced enjoyment of life16,17. Among its hallmarks is the diminished influence of reward anticipation on decision-making18,19. Investigating the underlying mechanisms of this phenomenon is therefore important for the development of innovative interventions aimed at providing therapeutic benefit by enhancing the impact of reward20,21,22.
Several human electrophysiological studies have identified a mediofrontal oscillatory component associated with positive feedback in both gambling task and reversal learning task, tasks that have key features in common with our bias task. The increase observed in these tasks is in the beta range and occurs 200 to 400âms after the feedback informing the participant about the monetary gains23,24,25,26,27. It has been proposed that this beta activity is generated in the prefrontal cortex; the most commonly inferred source site is the anterior cingulate cortex (ACC). It is further assumed that the ACC then transmits a fast motivational value signal, still in the beta band, from the frontal cortex to downstream reward-related regions26. Moreover, beta activity in the cingulate cortex is of particular importance in depression28,29,30. For example, a recent study showed that beta activity best tracks depressive states, seen as a decrease in beta band power during the first month of chronic stimulation, followed by an eventual rise31. This result suggests that sustained, antidepressant responses might involve increased beta band power after prolonged stimulation.
Existing evidence suggests that associative learning processes occur, in part, within specialized reward and control circuitry3,32. In particular, the ACC is a notable hub region that is associated with both reward and executive functions, and that has been linked to their integration33,34. Crucially, extensive research highlights the importance of the ACC in representing reward information and monitoring rewarding outcomes, as well as in facilitating learning and enabling strategic adjustments35,36,37,38,39. Moreover, altered activity in the ACC and regions connected to it has been associated with depression40,41,42,43,44,45,46.
Anhedonia, the loss of interest and pleasure from normally rewarding stimuli, is a cardinal symptom of depression and is often inadequately treated by traditional antidepressants21,47. While individuals diagnosed with depression may exhibit a range of symptoms, anhedonia stands out as one of two key symptoms required for a diagnosis of major depressive disorder48. Moreover, it is prevalent in other psychiatric and neurological disorders including substance use disorder, bipolar disorder, schizophrenia, and Parkinsonâs disease49,50,51,52. To highlight the common dimensions underlying mental health disorders, the National Institute of Mental Health provides a research framework called the Research Domain Criteria (RDoC)53. Within this framework, the probabilistic reward task (PRT) is a well-validated task to objectively measure anhedonia. In this task, unequal frequency of reward between two correct responses produces a response bias towards the more frequently rewarded stimulus5. While prior studies have shown that performance predicts depression severity, the underlying electrophysiological basis of anhedonia remains unclear6,53,54.
We recorded intracranial local field potentials (LFPs) from four reward-related regions in human participants performing the probabilistic reward task. In subjects with medically refractory epilepsy undergoing intracranial seizure monitoring (âEpilepsy Cohortâ, no clinical diagnosis of major depressive disorder), we find that enhanced beta (12â30âHz) oscillations after decision choice in the ACC predict stronger biasing and also track reward receipt. On the other hand, in subjects with severe treatment-resistant depression (âDepression Cohortâ, i.e., a cohort undergoing intracranial monitoring as part of a clinical trial studying depression; NCT03437928), both the behavioral bias and the neural response towards rewarding stimulus are reduced. These results suggest that ACC beta oscillations may reflect neural processes that orchestrate the binding of reward and sensory information to guide adaptive choice. Additionally, our findings imply that these oscillations could serve as a potential biomarker for anhedonia, paving the way for future research to develop targeted neuromodulatory interventions.
Results
Participants developed response bias towards the more frequently rewarded stimulus
The Epilepsy Cohort consisted of 15 participants with medically refractory epilepsy, but no diagnosis of a mood disorder, undergoing intracranial seizure monitoring. These subjects performed a variant of the probabilistic reward task (Fig. 1a)5. Each run of the PRT consisted of three blocks of 100 trials. On each trial, following a fixation period, a mouthless cartoon face appeared (0.5âs). Then a mouth appeared (variable length; long mouth: 54âmm, short mouth: 49âmm) for 0.1âs and then disappeared. Participants were asked to identify whether the previously presented mouth was long or short. Participants received no reward for an incorrect response and probabilistically received either a reward (dollar sign) or no reward (empty rectangle of same size and shape) for a correct response. The likelihood of receiving a reward was fixed at either 60% or 20% depending on whether the choice was designated as the rich stimulus or the lean stimulus.
a Timeline of the probabilistic reward task. b Response bias averaged across all Epilepsy Cohort patients. A positive response bias value indicates there is a preference for choosing the more frequently rewarded stimulus. Boxplots illustrate quartiles at 25% and 75%, with horizontal lines denoting medians, and whiskers extending to 1.5 times the interquartile ranges. nâ=â16 runs of task. One-sample t test compared with zero, block1: pâ=â0.036, block2: pâ=â0.0029, block3: pâ=â0.0061. â*â represents pâ<â0.05. c Accuracy for rich and lean stimuli averaged across all epilepsy patients. Boxplots illustrate quartiles at 25% and 75%, with horizontal lines denoting medians, and whiskers extending to 1.5 times the interquartile ranges. nâ=â16 runs of task. Paired-sample t test, block1: pâ=â0.026, block2: pâ=â0.0034, block3: pâ=â0.0086. â*â represents pâ<â0.05. d Proportion of choosing the same response when there is a reward or no reward following the response in the previous trial. nâ=â48 blocks of task. Paired-sample t test, pâ=â3.9*10-7. â*â represents pâ<â0.05. e Intracranial recording electrodes sample reward-relevant regions in Epilepsy Cohort patients. Source data are provided as a Source Data file.
The optimal strategy, in terms of correct performance and reward accumulation, is to ignore the rich/lean status and choose based solely on perceptual features (i.e., mouth size). The behavioral measurement of this task was response bias, or bias towards choosing the rich stimulus. Here, following the conventions of Pizzagalli et al., 2005, we use the term response bias to indicate the extent to which behavior is modulated by reinforcement history.
Epilepsy Cohort participants showed a response bias: they tended to choose the more frequently rewarded stimulus more often than the less frequently rewarded one. Response bias averaged across all participants was larger than zero for all three blocks (Fig. 1b, one-sample t test compared with zero, block1: pâ=â0.036, block2: pâ=â0.0029, block3: pâ=â0.0061). Consistently, the accuracy for the rich stimulus was higher than the accuracy for the lean stimulus (Fig. 1c, paired-sample t test, block1: pâ=â0.026, block2: pâ=â0.0034, block3: pâ=â0.0086). To assess the impact of feedback from the previous trial, we compared choice patterns following reward and no-reward outcomes (these analyses include all choices, not just correct ones, Fig. 1d). When there was a reward in the previous trial, the likelihood of choosing the same response (meaning long vs. short) was significantly larger (paired-sample t test, tâ=â5.9, pâ<â10â4), supporting the idea that the participants have modified their behavior based on the previous feedback.
Rich stimulus induces larger ACC beta power during delay period
We recorded from four regions in the Epilepsy Cohort: anterior cingulate cortex (ACC), medial orbitofrontal cortex (mOFC), lateral orbitofrontal cortex (lOFC), and amygdala (Fig. 1e, Supplementary Table. 1).
We first analyzed the delay period (Fig. 2a), which extends 500 ms after choice. The neural activity during this period occurs after the decision-making process. Cognitive effort typically occurs during the decision-making process, as individuals engage in mental processes such as evaluating options, weighing consequences, and selecting an action. Therefore, we do not think that the difference in neural activity between different options during the delay period, after the participant has already made the choice, should be strongly influenced by cognitive effort. During this period, sensory information and reward history can be integrated to generate value representations for stimuli, enabling the brain to anticipate the potential rewards and potentially promote learning55,56. We computed the average spectral power at six major frequencies (delta, theta, alpha, beta, gamma, and high-gamma) during the delay period (using the same definitions as in Xiao et al., 2023). Considering the observation of beta activity during reward processing and the importance of beta activity in depression, our core experimental hypothesis was in favor of the beta band. Results from the other five bands with Bonferroni correction were reported for reasons of completeness.
a Delay period starts from the button press and ends when feedback appears on the screen. b Differences in beta (12â30âHz) power between the rich and lean trials are plotted at each sEEG contact. The color represents the t value comparing rich with lean trials. Red indicates greater power during rich trials. c Beta power change during the delay period. Red indicates rich trials while blue indicates lean trials. Trials are time-locked to the button press. The horizontal bar indicates time points in the cluster that are statistically significant (pâ<â0.05), as determined by two-sided cluster-based permutation test. Data are presented as mean valuesâÂąâSEM. Source data are provided as a Source Data file.
We found that beta power in ACC was greater for rich trials than for lean trials during the delay period (Fig. 2b, linear mixed model, t(stimulus)â=â2.1, pâ=â0.039, coefâ=â2.5, 95% confidence interval (95% CI)â=â[0.12â4.8], analysis epoch: 0â500âms after choice). This difference was limited to the beta frequency - the other five frequency bands did not show any detectable difference (pâ>â0.05 in all cases). Moreover, the majority of channels exhibited greater beta power in response to the rich stimulus than the lean stimulus (Fig. 2b). Specifically, we found that event-aligned activity increases immediately after choice; this rise was significantly larger following choice of the rich stimulus (Fig. 2c, cluster-based permutation test, the significant cluster begins at 244âms and ends at 293âms after the choice, number of permutationsâ=â1000).
Interestingly, beta modulation in the lateral orbitofrontal cortex was also significant before multiple comparison corrections (albeit in the opposite direction, linear mixed model, t(stimulus)â=ââ2.1, coefâ=ââ3.3, 95% confidence interval (95% CI)â=â[â6.4ââââ0.17], pâ=â0.039). This observation is notable given the hypothesized role of OFC in reward learning processes, and given its close interconnections with ACC32,57,58,59,60. This result suggests that reward biasing may reflect the activity of a broader circuit consisting of ACC and lOFC, possibly working together.
On the other hand, these effects were not brain-wide. Specifically, the beta modulation in medial orbitofrontal cortex and amygdala was not significant (t(stimulus)â=ââ1.7, pâ=â0.09, coefâ=ââ5.1, 95% confidence interval (95% CI)â=â[â12.8â2.7], for mOFC beta, t(stimulus)â=ââ1.3, pâ=â0.20, coefâ=ââ2.1, 95% confidence interval (95% CI)â=â[â4.4â0.3], for amygdala beta, before multiple comparison corrections). Note that the failure to achieve significance is not itself dispositive; indeed, we report other evidence linking mOFC and amygdala in associations, suggesting that ACC may be part of a larger network focused on learning (see below).
Neural activity during delay period is correlated with response bias
Next, we explored whether ACC beta activity plays a role in translating internal representations of stimulus value into behavioral preferences. Behavior preference is measured by response bias, with a stronger bias indicating a larger preference towards rewards. Therefore, a correlation with bias is the key indicator of learning the association between the reward and the stimulus.
During the delay period, neural activity distinguished rich and lean stimuli in blocks when bias was higher than the median, but not in blocks when it was lower than the median (Fig. 3a, b). In the high response bias condition, one significant cluster (cluster-based permutation test, from 234âms to 322âms, number of permutationsâ=â1000) was found, while no significant cluster was found in the low response bias condition (Fig. 3b). These findings suggest that ACC activity covaries with the degree of bias. This pattern is illustrated in heatmaps showing the difference between the rich and lean stimuli at each contact (Fig. 3a): in the majority of channels, a notable increase in power towards the rich stimulus was observed in the high response bias blocks. Conversely, the channels in the low response bias blocks exhibited fewer discernible differences between the rich and lean stimuli. Furthermore, we found that average beta power showed a significant difference between response to the rich and lean stimuli solely within the high response bias blocks (Fig. 3c, t(stimulus)â=â3.2, pâ=â0.0013, coefâ=â5.4, 95% confidence interval (95% CI)â=â[2.1â8.7]), while no such distinction was observed within the low response bias blocks (t(stimulus)â=ââ0.44, pâ=â0.66,coefâ=ââ0.76, 95% confidence interval (95% CI)â=â[â4.1â2.6]).
a Differences in beta power between the rich and lean trials are plotted at each sEEG contact for high response bias blocks or low response bias blocks. b Beta power change during the delay period for high response bias blocks or low response bias blocks. Trials are time-locked to the button press. Horizontal bar depicts time points for which neural activity between rich trials and lean trials is significantly different (two-sided cluster-based permutation test, pâ<â0.05). Data are presented as mean valuesâÂąâSEM. c Difference in beta power in high or low response bias blocks. Boxplots illustrate quartiles at 25% and 75%, with horizontal lines denoting medians, and whiskers extending to 1.5 times the interquartile ranges. nâ=â56 channels for low response bias blocks and nâ=â52 channels for high response bias blocks. Linear mixed model, pâ=â0.66 for the low response bias blocks and pâ=â0.0013 for the high response bias blocks. â*â represents pâ<â0.05. d Relationship between response bias and the difference in reward response towards rich or lean stimulus in all frequency bands. The statistical test for the Pearson correlation coefficient is two-sided, and no adjustments were made for multiple comparisons. e Correlation between neural activity and response bias in all frequency bands. Center for the error bars represent the correlation coefficient. Error bars represent a 95% confidence interval. pâ=â0.013 for beta band. â*â represents pâ<â0.05. Source data are provided as a Source Data file.
We then explored whether this neural activity during the delay period correlates with the behavioral variable, response bias. The difference in beta power response towards the rich or lean stimulus is positively correlated with response bias, but there is no significant correlation in other frequency bands (Fig. 3d, e, râ=â0.38 for ACC beta, pâ=â0.013). The preference for choosing a more frequently rewarded stimulus increases as the difference in beta responses increases. This indicates that the ability to differentiate stimulus values, as reflected by ACC beta activity during the delay period, is associated with participantsâ behavior.
These results clearly implicate ACC beta activity in processing related to biasing. We found some evidence that OFC is likewise involved in reward learning. Response bias was positively correlated with the difference in beta power response towards rich or lean stimulus in the orbitofrontal cortex (pâ=â0.065 for mOFC beta, pâ=â0.037 for lOFC beta before multiple comparison corrections).
Reward feedback elicits beta oscillations in the anterior cingulate cortex
Recognizing the crucial role of ACC beta power in reward anticipation during the delay period, next we tested whether this neural feature is also involved in the response to reward feedback. The observation of this feature during both the delay period and feedback period may suggest a common neural mechanism underlying the evaluation of stimuli and outcomes. The feedback period starts when either reward feedback or neutral feedback appears on the screen and ends when the image disappears (duration: 1000âms) (Fig. 4a). We first computed the average spectral power at the beta frequency during the feedback period. To compare the activity between reward trials and neutral trials, we used a two-sample t test to fit the spectral power for each channel in the ACC. Remarkably, most channels across the ACC exhibited greater beta power in response to reward feedback compared to neutral feedback (Fig. 4b). When trials were time-locked to the feedback onset, we observed a substantial increase in ACC beta power in response to reward (Fig. 4c, cluster-based permutation test, the significant cluster begins at 194âms and ends at 907âms after the feedback onset). This increase in the beta range reaches its peak approximately 500âms after feedback onset.
a The feedback period starts when the dollar bill or empty rectangle appears on the screen and ends when the feedback image disappears. b Differences in beta power between the reward and no-reward trials are plotted at each sEEG contact. The color represents the t value comparing reward feedback with neutral feedback. Red indicates greater power during the reward feedback. Only trials with correct response are included in the analysis. c Feedback-aligned beta power change averaged over trials and participants. Red indicates reward trials while blue indicates correct trials with no reward. Trials are time-locked to feedback onset. Horizontal bar depicts time points for which neural activity between reward trials and no-reward trials is significantly different (two-sided cluster-based permutation test, pâ<â0.05). Data are presented as mean valuesâÂąâSEM. d Average beta power change across the feedback period. Boxplots illustrate quartiles at 25% and 75%, with horizontal lines denoting medians, and whiskers extending to 1.5 times the interquartile ranges. nâ=â65 channels. Linear mixed model, pâ=â1.8*10â36. â*â represents pâ<â0.05. Source data are provided as a Source Data file.
To further confirm the effect of feedback type on the beta power change, we used a linear mixed effect model. The average beta power change during reward feedback was significantly larger than the change observed during neutral feedback (Fig. 4d, linear mixed effect model, t(feedback)â=â12.6, pâ<â10â4, coefâ=â15.7, 95% confidence interval (95% CI)â=â[13.3â18.2]). Overall, these results demonstrated that there was a beta power increase in response to reward in the ACC. This suggests that the ACC is engaged in the evaluation of both stimuli and outcomes, potentially representing a common neural mechanism underlying the assessment of reward values.
We found evidence that ACC is not alone in this process; indeed, two other regions from which we recorded, mOFC and amygdala, also showed a systematic relationship between post-reward beta activity and reward (mOFC: t(feedback)â=ââ2.0, pâ=â0.04, coefâ=ââ3.4, 95% confidence interval (95% CI)â=â[â6.8ââââ0.09], amygdala: t(feedback)â=ââ5.0, pâ<â10â4,coefâ=ââ9.8, 95% confidence interval (95% CI)â=â[â13.6ââââ6.0], before multiple comparison corrections). It is interesting that amygdala showed a significant reversed relationship even after multiple comparison corrections; only ACC showed the same, positive, relationship during both the post-choice and delay periods.
To further confirm the result, we performed additional analysis using 12.5â30âHz as the frequency range for beta activity61. As in the original result, we found that beta activity showed a discernible difference between the rich and lean stimuli within the high response bias blocks (Supplementary Fig. 1a), t(stimulus)â=â3.1, pâ=â0.0021, coefâ=â5.1, 95% confidence interval (95% CI)â=â[1.8â8.3], while no such distinction within the low response bias blocks (t(stimulus)â=ââ0.13, pâ=â0.90, coefâ=ââ0.22, 95% confidence interval (95% CI)â=â[â3.6â3.2]). During the feedback period, beta activity during reward feedback was significantly larger than neutral feedback (Supplementary Fig. 1b, t(feedback)â=â12.9, pâ<â10â4, coefâ=â17.6, 95% confidence interval (95% CI)â=â[14.0â21.1]). We used time-frequency maps to investigate how various components of beta activity contribute to the overall effect (Supplementary Fig. 1c, d). Our findings indicate a large effect in the lower range of beta during both the delay and feedback periods, suggesting that our results are predominantly influenced by lower beta.
Reward biasing and its neural correlates in ACC are blunted in depression cohort
In addition to the 15 patient Epilepsy Cohort, we also performed intracranial recordings in a group of four individuals with severe treatment-resistant depression (âDepression Cohortâ) participating in an NIH-funded clinical trial (NCT03437928, Fig. 5a)62,63,64,65. We analyzed behavioral performance and electrophysiological patterns in these depression patients. We did not observe any significant difference between the accuracy for the rich stimulus and the accuracy for the lean stimulus (Supplementary Fig. 2a, paired-sample t test, block1: pâ=â0.90, block2: pâ=â0.94, block3: pâ=â0.60). Response bias in depression patients was not significantly different from zero for all three blocks (Supplementary Fig. 2b, one-sample t test compared with zero, block1: pâ=â0.93, block2: pâ=â0.83 block3: pâ=â0.38). Compared to the epilepsy group, we found that response bias was blunted in these patients, especially in the last block of the task (Fig. 5b, two-sample t test compared with epilepsy patients, tâ=â2.3, pâ=â0.030). This indicates that the behavioral preference towards more frequently rewarded stimuli was reduced in this severely depressed group of individuals. Then we investigated the neural response in ACC towards reward-related stimulus in this cohort. Unlike in the Epilepsy Cohort, there was no significant difference towards rich or lean stimulus in beta activity during the delay period (Fig. 5c, linear mixed model, t(stimulus)â=ââ0.29, pâ=â0.77, coefâ=ââ3.9, 95% confidence interval (95% CI)â=â[â3.0â2.2]). This result suggests a lack of reward-oriented anticipation in patients with severe depression. During the feedback period, beta power exhibited a larger change during reward feedback compared to that during neutral feedback (linear mixed model, t(feedback)â=â6.7, pâ<â10â4, coefâ=â7.5, 95% confidence interval (95% CI)â=â[5.3â9.7]). However, we observed that this beta power change was significantly smaller in the Depression Cohort than it was in the Epilepsy Cohort (Fig. 5d, linear mixed model, t(feedback*group)â=â4.7, pâ<â10â4, coefâ=ââ8.2, 95% confidence interval (95% CI)â=â[â11.6ââââ4.8]). When analyzing the temporal response, we observed that the difference towards rich or lean stimulus was diminished across the delay period in depression patients (Fig. 5e). When the trials were time-locked to the onset of feedback, we found that the reward response in depression patients was both reduced and delayed compared to the Epilepsy Cohort (Fig. 5f). Apart from the original analysis using beta activity, we also reported the result in other frequency bands for both the Epilepsy Cohort and the Depression Cohort (Supplementary Table. 2). These findings imply alterations in reward processing within the ACC among individuals with depression.
a Intracranial recording electrodes sample reward-relevant regions in depression patients. b Response bias in Depression Cohort and Epilepsy Cohort. Boxplots illustrate quartiles at 25% and 75%, with horizontal lines denoting medians, and whiskers extending to 1.5 times the interquartile ranges. nâ=â16 runs for Epilepsy Cohort and nâ=â12 runs for Depression Cohort. Two-sample t test, block1: pâ=â0.24, block2: pâ=â0.24, block3: pâ=â0.030. â*â represents pâ<â0.05. c Average beta power change across the delay period. Boxplots illustrate quartiles at 25% and 75%, with horizontal lines denoting medians, and whiskers extending to 1.5 times the interquartile ranges. nâ=â65 channels for Epilepsy Cohort and nâ=â36 channels for Depression Cohort. Linear mixed model, pâ=â0.039 for Epilepsy Cohort, pâ=â0.77 for Depression Cohort. â*â represents pâ<â0.05. d Average beta power change across the feedback period. Boxplots illustrate quartiles at 25% and 75%, with horizontal lines denoting medians, and whiskers extending to 1.5 times the interquartile ranges. nâ=â65 channels for Epilepsy Cohort and nâ=â36 channels for Depression Cohort. Linear mixed model, pâ=â1.8*10â36 for Epilepsy Cohort, pâ=â1.6*10â11 for Depression Cohort. â*â represents pâ<â0.05. e Difference towards rich or lean stimulus in the beta activity across the delay period in Depression Cohort and Epilepsy Cohort. Data are presented as mean valuesâÂąâSEM. f Difference towards reward or neutral feedback in the beta activity across the feedback period. Data are presented as mean valuesâÂąâSEM. Source data are provided as a Source Data file.
Discussion
We obtained multi-site intracranial recordings from patients undergoing inpatient monitoring for either epilepsy or treatment-resistant depression. Patients performed a perceptual discrimination task in which rewards were stochastic and biased. We found that enhanced beta oscillations in the ACC (and, less consistently, in regions connected to it), correlate with a stronger biasing effect and track the receipt of rewards. These results suggest that beta oscillations in the ACC may be a neural signature of processes associated with reward biasing, and especially the association of reward value with choice. Both the behavioral bias and the neural response to rewarding stimuli are diminished in patients with depression, highlighting the changes in reward processing within the ACC in depression. Previous studies have shown inconsistent results in behavioral findings during reinforcement learning in depression, possibly arising from the heterogeneity and stage of depression5,6,21,54,66,67,68,69. Given that increased anhedonia levels are associated with greater illness severity and longer episodes, the observed response bias difference in our study may be attributed to the advanced stage and high severity of treatment-resistant depression70.
Anhedonia, with its profound impact on an individualâs quality of life, can create challenges in various aspects including relationships, work, and daily functioning. Traditional antidepressants often fail to adequately address this symptom21,47. Thus, gaining a deeper understanding of anhedonia is important for improving diagnosis and treatment of depression. The perceptual bias task is the most commonly used task for the assessment of anhedonia, a major element of clinical depression53. There are two reasons for this: first, the task has direct face validity to the association of reward and action; second, there is ample empirical evidence linking changes in behavior in this task with anhedonia. It also has been used to assess reward learning and feedback responsiveness in both rodents and humans71,72,73,74. Our results, which include both correlations with the behavior in non-clinically-depressed individuals and reduction in reward response in individuals with severe depression, suggest that beta activity within the ACC may be a biomarker for anhedonia. Such a biomarker has many potential benefits, including the ability to improve diagnosis and symptom monitoring. Moreover, they present an appealing target for neuromodulatory trials, which could focus on altering ACC beta and thereby reducing anhedonia. In our study, the average beta power values were z-scored across trials, regardless of stimulus types within each contact. Therefore, the smaller values in depression patients indicate a lesser difference between conditions, not the absence of beta activity overall. Monitoring this neural feature in a more naturalistic environment is essential for comparison with healthy controls and crucial for the development of potential treatments. The limited enrollment of only four depression patients in our study is due to the unique clinical trial using an inpatient intracranial platform for therapy development. Future studies with larger sample sizes will be necessary to extend the generalizability of our findings to the broader population of individuals with depression. In comparison to traditional questionnaires, continuous and passive monitoring of this neural feature in patients requires less effort from them, offers greater objectivity, and facilitates timely intervention. Future work will need to determine the time course of changes in this potential biomarker relative to those of depressive symptoms.
While certain symptoms like tremor or stiffness in Parkinsonâs disease exhibit rapid moment-to-moment fluctuations, depressive symptoms are generally characterized by a more gradual and prolonged evolution over the course of days to months75,76,77. Therefore, this biomarker holds the potential to offer clinicians a valuable temporally dynamic signal about the individualâs ongoing state and the transitions they experience. Such a signal could alert clinicians to be watchful and influence decisions regarding potential therapeutic maneuvers such as medication adjustments, behavioral interventions, or modifications in stimulation delivery. Further investigation is needed to assess its effectiveness and practical application. Current technology enables the monitoring of beta activity at the stimulation site in freely moving Parkinsonian patients, allowing for the precise control of stimulation delivery78,79,80. Similar to this approach, it is possible to track the instantaneous power of the beta band in depression patients by deep brain stimulation systems with sensing capabilities. While the intracranial signal provides more precise anatomical information via direct contact with brain tissue, future studies should aim to validate and replicate our observation of ACC beta activity using non-invasive approaches. Wireless EEG headsets may be necessary to achieve more frequent and convenient measurements, while the implementation of advanced source localization techniques can enhance anatomical precision. Advancements in these areas can facilitate broader application across patient populations.
These results also have implications for our understanding of the role of the ACC. The ACC is thought to play a crucial role in reward processing. In non-human primates, individual ACC neurons process both experienced and fictive rewards to dynamically guide changes in behavior. Monkeys with ACC lesions are impaired in using rewarded trials to sustain the selection of the correct object, emphasizing the importance of the ACC in reward-based decision-making81. Our study reveals that beta oscillations in the ACC, which represents reward outcome, are also elicited by rich stimuli and are correlated with behavioral preference. Previous electrical stimulation studies suggest that dACC plays a crucial role in motivation and drive82,83. In our study, we observed an increase in ACC activity when participants anticipated a reward. This anticipation of reward could potentially translate into an increased willingness to persevere and exert effort to obtain the reward. These stimulation studies also imply that adjacent brain regions may fulfill unique roles. Electrical stimulation of the anterior midcingulate cortex induces a âwill to persevere,â whereas stimulating subgenual or retrosplenial cingulate regions fails to evoke perceptual or behavioral responses. Further investigations with increased recording sites in the ACC are necessary to clarify the roles of its various subregions in reward processing. These findings provide further support for the importance of the ACC in reward learning. Indeed, these results are consistent with previous theories linking the ACC with cognition related to reward in general and to reward-mediated learning specifically34,81,84,85. In experiments involving both monetary gains and losses, research has demonstrated that an increase in theta power is associated with losses, while an increase in beta power is associated with gains23,25,27. Aligned with prior findings, our study showed a rise in beta activity within the ACC following positive feedback. Future studies could investigate ACC activity during a similar task that incorporates negative feedback for a more comprehensive understanding.
The alignment of reward information with other forms of sensorimotor processing involves the dynamic and flexible linking of inherently disparate pieces of information26,86. Several theories highlight the potential involvement of high-frequency oscillations in orchestrating learning and binding processes87,88,89,90,91,92. Our results indicate that successful reward biasing on sensorimotor decision-making is associated with specific enhancements in beta oscillatory activity within the ACC and areas in its local circuit. This finding suggests that ACC beta activity could play a key role in linking rewards to various cognitive areas. Future studies might explore how altering this activity could impact cognitive and emotional well-being.
Methods
Participants
Fifteen participants (eight males and seven females, mean age 39 years, range 19â60 years) undergoing invasive monitoring for the treatment of refractory epilepsy at Baylor St. Lukeâs Medical Center (Houston, Texas, USA) participated in our study. These participants did not carry a diagnosis of major depressive disorder. Implantation sites were determined solely by the clinical team for localization of the seizure onset zone. The Institution Review Board at Baylor College of Medicine approved this study (IRB protocol number H-18112), and all participants provided verbal and written consent to participate.
Four patients with treatment-resistant depression (two males and two females, mean age 42 years, range 37â58 years) who were enrolled in an early feasibility trial (NCT03437928) also participated in our study. This trial of individualized deep brain stimulation (DBS) guided by intracranial recordings is funded by the NIH BRAIN Initiative (UH3 NS103549) and approved by the Institution Review Board at Baylor College of Medicine (IRB number H-43036). These individuals did not carry significant psychiatric comorbidities based on the trialâs exclusion of schizophrenia, bipolar disorder, personality disorders, and neuro-developmental disorders, as these conditions may impact the study results. Additional details regarding the exclusion criteria can be found: https://clinicaltrials.gov/study/NCT03437928#participation-criteria. Each patient was implanted with permanent deep brain stimulation leads for stimulation delivery as well as with temporary sEEG electrodes for neural recordings. In our study, two patients used antidepressant medication, while the remaining two patients did not receive any medication. The trial protocol requires patients to maintain a stable dose of medication for at least one month before surgery, and no alterations are made to their medication during the in-patient monitoring period.
In this study, we focus on the sEEG recordings from reward-relevant regions. Following the surgical implantation of electrodes, patients underwent around one week of inpatient monitoring. Throughout this time, we conducted probabilistic reward task while simultaneously recording dense neural activity.
Probabilistic reward task
Probabilistic reward task (PRT) is a task for the investigation of reward processing. Each run of the PRT consists of 300 trials, divided into 3 blocks of 100 trials. All three blocks were performed on the same day, and a short break was allowed between the blocks. Each trial started with the presentation of a fixation cross in the center of the screen for a random duration ranging from 700 to 900âms. A mouthless cartoon face was then presented for 500âms followed by the presentation of this face with either a short mouth or a long mouth for 100âms. Participants were then asked to identify which type of mouth was presented. Either reward or neutral feedback was presented for 1000âms after a random delay interval ranging from 500 to 1000âms. For each run, we set one type of stimulus (either the short or long mouth) as the rich stimulus. The probability of reward for rich stimulus is three times more frequent than for lean stimulus (the other kind of mouth). If a participant didnât correctly identify the stimulus in a trial where reward feedback was scheduled, the feedback was postponed until the next accurate identification of the same stimulus type. Therefore, the consistency of reward feedback frequency was maintained across all participants for each stimulus type. This approach is the same as that used in the earlier study with the same task5. The likelihood of receiving a reward was set at either 60% or 20%, depending on whether the choice was categorized as the rich stimulus or the lean stimulus. Reaction time was defined as the time between stimulus presentation and participant response. As a previous study using PRT5, trials with reaction time that were too short (<150âms), too long (>2500âms), or were considered an outlier response (exceeding meanâÂąâ3âSD), were excluded from further analysis. The task was implemented in MATLAB (The Mathworks, Inc., Natick, MA) using Psychtoolbox-3. For behavioral analyses on the PRT task, the main variable of interest is response bias, which measures the systematic preference for the response associated with more frequently rewarded (rich) stimulus. Response bias can be calculated based on the number of correct or incorrect trials to the rich and lean stimuli.
Previous studies suggest that response bias is inversely related to current anhedonic symptoms in unselected adults. It is also blunted in depression patients and is improved by pharmacological treatments53,54.
Electrode localization
Patients underwent brain magnetic resonance imaging (MRI) before surgery and computed tomography (CT) after implantation. To identify the precise anatomical position of implanted electrodes, we co-registered the pre-operative T1-weighted MRI scans with the post-operative CT scans. Automatic cortical reconstruction was performed on the preoperative T1-weighted MRI using Freesurfer tools93. Functional Magnetic Resonance Imaging for the Brain Software Libraryâs Linear Image Registration Tool (FLIRT) was employed to align the postoperative CT data with the preoperative T1-weighted MRI94. Electrode positions were manually marked using the co-registered CT data in BioImage Suite v3.5b1 and plotted into the native MRI space95. An expert reviewer then examined the images to identify brain regions and determine whether the contact was in gray or white matter. Contacts determined to be in white matter were excluded from further analysis. Detailed procedures have been described elsewhere62,65.
Data acquisition and preprocessing
Neural signals were recorded during the probabilistic reward task. Signals were recorded with sEEG electrodes at 2000âHz using a Cerebus data acquisition system (BlackRock Microsystems, UT, USA) with a bandpass of 0.3â500âHz (4th order Butterworth filter). We visually inspected raw signals for the presence of recording artifacts and interictal epileptic spikes. Channels with excessive noise were excluded to prevent noise from spreading to other channels through re-reference. Signals were notch filtered (60âHz and its harmonics) to reduce line-noise artifacts and re-referenced through bipolar reference to reduce the effects of volume conduction96. We then down-sampled referenced signals to 1000âHz.
Spectral power analysis
We performed Hilbert transform to estimate spectral power in six different frequency bands: 1â4âHz (delta), 4â8âHz (theta), 8â12âHz (alpha), 12â30âHz (beta), 35â50âHz (gamma) and 70â150âHz (high-gamma). Spectral power values for each trial were calculated by averaging the squared magnitude of the Hilbert transform decomposition. For the delay period, spectral power was averaged across a time window beginning at behavioral response and ending at 500âms after choice. For the feedback period, the mean spectral power was calculated across a time window beginning at feedback onset and ending at feedback offset, which was set at 1000âms after feedback onset. The spectral power values were then normalized as a percent change relative to the baseline, which was the fixation cross period at the beginning of each trial. To ensure data quality, any value that deviated by more than three standard deviations from the mean value was considered an outlier and excluded from further analysis. Lastly, the values were z-scored within each contact for visualization.
Data visualization in the template brain
Contacts were projected onto the MNI space and visualized using the open-source software RAVE (R Analysis and Visualization of iEEG)97. To analyze the data during the delay period, a two-sample t test was employed to generate a t value for each contact. These t values, which compared the average spectral power after the rich and lean stimuli, were then plotted onto the template brain. Similarly, for the analysis of the data during the feedback period, a two-sample t test was used to compare reward feedback with neutral feedback, and the resulting t values were plotted at each contact.
Linear mixed effect model
Statistical analyses on data from all channels mainly focused on the effects of different conditions on the spectral power change. We used a linear mixed effect model to quantify the effect of stimulus type or feedback type on the power change. We modeled stimulus type or feedback type as a fixed effect, and channels and participants as nested random effects. The formula for neural activity during the delay period is Power ~ stimulusâ+â(1|Subject/chan) while The formula for neural activity during the feedback period is Power ~ feedbackâ+â(1|Subject/chan). This model was used for the control group and the depression group separately. To further test whether the power change was modulated by the group type, we included both the group type and the feedback type as fixed effects in the linear mixed effect model and calculated the corresponding p value. The formula used in this analysis is Power ~ feedback+grouptype+feedback*grouptypeâ+â(1|Subject/chan). The p value associated with each parameter is derived from a two-sided test.
Reporting summary
Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.
Data availability
Processed data is provided at the following address: https://osf.io/t3usq/. The raw intracranial EEG data are available upon request for reasons of patient confidentially. Source data are provided with this paper.
Code availability
Code used in this study is available under: https://osf.io/t3usq/.
Change history
25 June 2025
A Correction to this paper has been published: https://doi.org/10.1038/s41467-025-61277-0
References
Averbeck, B. B. & Costa, V. D. Motivational neural circuits underlying reinforcement learning. Nat. Neurosci. 20, 505â512 (2017).
Howard-Jones, P. A. & Jay, T. Reward, learning and games. Curr. Opin. Behav. Sci. 10, 65â72 (2016).
Rangel, A. & Hare, T. Neural computations associated with goal-directed choice. Curr. Opin. Neurobiol. 20, 262â270 (2010).
Kangas, B. D., Wooldridge, L. M., Luc, O. T., Bergman, J. & Pizzagalli, D. A. Empirical validation of a touchscreen probabilistic reward task in rats. Transl. Psychiatry 10, 1â11 (2020).
Pizzagalli, D. A., Jahn, A. L. & OâShea, J. P. Toward an objective characterization of an anhedonic phenotype: a signal-detection approach. Biol. Psychiatry 57, 319â327 (2005).
Pizzagalli, D. A., Iosifescu, D., Hallett, L. A., Ratner, K. G. & Fava, M. Reduced hedonic capacity in major depressive disorder: evidence from a probabilistic reward task. J. Psychiatr. Res. 43, 76â87 (2008).
Rorie, A. E., Gao, J., McClelland, J. L. & Newsome, W. T. Integration of sensory and reward information during perceptual decision-making in Lateral Intraparietal Cortex (LIP) of the Macaque Monkey. PLOS ONE 5, e9308 (2010).
Henschke, J. U. et al. Reward association enhances stimulus-specific representations in primary visual cortex. Curr. Biol. 30, 1866â1880.e5 (2020).
OâDoherty, J. P., Cockburn, J. & Pauli, W. M. Learning, reward, and decision making. Annu. Rev. Psychol. 68, 73â100 (2017).
Schultz, W. Neuronal reward and decision signals: from theories to data. Physiol. Rev. 95, 853â951 (2015).
Admon, R. & Pizzagalli, D. A. Dysfunctional reward processing in depression. Curr. Opin. Psychol. 4, 114â118 (2015).
Der-Avakian, A., Barnes, S. A., Markou, A. & Pizzagalli, D. A. Translational Assessment of Reward and Motivational Deficits in Psychiatric Disorders. In Translational Neuropsychopharmacology (eds. Robbins, T. W. & Sahakian, B. J.) 231â262 (Springer International Publishing, Cham, 2016). https://doi.org/10.1007/7854_2015_5004.
Pizzagalli, D. A. Depression, stress, and anhedonia: toward a synthesis and integrated model. Annu. Rev. Clin. Psychol. 10, 393â423 (2014).
Safra, L., Chevallier, C. & Palminteri, S. Depressive symptoms are associated with blunted reward learning in social contexts. PLOS Comput. Biol. 15, e1007224 (2019).
Whitton, A. E., Treadway, M. T. & Pizzagalli, D. A. Reward processing dysfunction in major depression, bipolar disorder and schizophrenia. Curr. Opin. Psychiatry 28, 7â12 (2015).
Park, L. T. & Zarate, C. A. Depression in the primary care setting. N. Engl. J. Med. 380, 559â568 (2019).
Williams, L. M. Precision psychiatry: a neural circuit taxonomy for depression and anxiety. Lancet Psychiatry 3, 472â480 (2016).
Alexander, L. et al. Fractionating blunted reward processing characteristic of anhedonia by over-activating primate subgenual anterior cingulate cortex. Neuron 101, 307â320.e6 (2019).
Keren, H. et al. Reward processing in depression: a conceptual and meta-analytic review across fMRI and EEG studies. Am. J. Psychiatry 175, 1111â1120 (2018).
Moutoussis, M. et al. Decision-making ability, psychopathology, and brain connectivity. Neuron 109, 2025â2040.e7 (2021).
Vrieze, E. et al. Reduced reward learning predicts outcome in major depressive disorder. Biol. Psychiatry 73, 639â645 (2013).
Widge, A. S. & Miller, E. K. Targeting cognition and networks through neural oscillations: next-generation clinical brain stimulation. JAMA Psychiatry 76, 671â672 (2019).
Cohen, M. X., Elger, C. E. & Ranganath, C. Reward expectation modulates feedback-related negativity and EEG spectra. Neuroimage 35, 968â978 (2007).
HajiHosseini, A., RodrĂguez-Fornells, A. & Marco-PallarĂŠs, J. The role of beta-gamma oscillations in unexpected rewards processing. NeuroImage 60, 1678â1685 (2012).
HajiHosseini, A. & Holroyd, C. B. Sensitivity of frontal beta oscillations to reward valence but not probability. Neurosci. Lett. 602, 99â103 (2015).
Marco-PallarĂŠs, J., MĂźnte, T. F. & RodrĂguez-Fornells, A. The role of high-frequency oscillatory activity in reward processing and learning. Neurosci. Biobehav. Rev. 49, 1â7 (2015).
Marco-Pallares, J. et al. Human oscillatory activity associated to reward processing in a gambling task. Neuropsychologia 46, 241â248 (2008).
Clark, D. L., Brown, E. C., Ramasubbu, R. & Kiss, Z. H. T. Intrinsic local beta oscillations in the subgenual cingulate relate to depressive symptoms in treatment-resistant depression. Biol. Psychiatry 80, e93âe94 (2016).
Huebl, J. et al. Processing of emotional stimuli is reflected by modulations of beta band activity in the subgenual anterior cingulate cortex in patients with treatment resistant depression. Soc. Cogn. Affect Neurosci. 11, 1290â1298 (2016).
Merkl, A. et al. Modulation of beta-band activity in the subgenual anterior cingulate cortex during emotional empathy in treatment-resistant depression. Cereb. Cortex 26, 2626â2638 (2016).
Alagapan, S. et al. Cingulate dynamics track depression recovery with deep brain stimulation. Nature 622, 130â138 (2023).
Rudebeck, P. H. & Murray, E. A. The orbitofrontal oracle: cortical mechanisms for the prediction and evaluation of specific behavioral outcomes. Neuron 84, 1143â1156 (2014).
Heilbronner, S. R. & Hayden, B. Y. Dorsal anterior cingulate cortex: a bottom-up view. Annu. Rev. Neurosci. 39, 149â170 (2016).
Monosov, I. E., Haber, S. N., Leuthardt, E. C. & Jezzini, A. Anterior cingulate cortex and the control of dynamic behavior in primates. Curr. Biol. 30, R1442âR1454 (2020).
Bush, G. et al. Dorsal anterior cingulate cortex: a role in reward-based decision making. Proc. Natl Acad. Sci. 99, 523â528 (2002).
Etkin, A., BĂźchel, C. & Gross, J. J. The neural bases of emotion regulation. Nat. Rev. Neurosci. 16, 693â700 (2015).
Hayden, B. Y., Pearson, J. M. & Platt, M. L. Fictive reward signals in the anterior cingulate cortex. Science 324, 948â950 (2009).
Otis, J. M. et al. Prefrontal cortex output circuits guide reward seeking through divergent cue encoding. Nature 543, 103â107 (2017).
Terem, A. et al. Claustral neurons projecting to frontal cortex mediate contextual association of reward. Curr. Biol. 30, 3522â3532.e6 (2020).
Cheng, W. et al. Increased functional connectivity of the posterior cingulate cortex with the lateral orbitofrontal cortex in depression. Transl. Psychiatry 8, 1â10 (2018).
McGovern, R. A. & Sheth, S. A. Role of the dorsal anterior cingulate cortex in obsessive-compulsive disorder: converging evidence from cognitive neuroscience and psychiatric neurosurgery. J. Neurosurg. 126, 132â147 (2017).
Rolls, E. T. et al. Functional connectivity of the anterior cingulate cortex in depression and in health. Cereb. Cortex 29, 3617â3630 (2019).
Sheth, S. A. et al. Human dorsal anterior cingulate cortex neurons mediate ongoing behavioural adaptation. Nature 488, 218â221 (2012).
Sheth, S. A. et al. Limbic system surgery for treatment-refractory obsessive-compulsive disorder: a prospective long-term follow-up of 64 patients: clinical article. J. Neurosurg. 118, 491â497 (2013).
Steele, J. D., Christmas, D., Eljamel, M. S. & Matthews, K. Anterior cingulotomy for major depression: clinical outcome and relationship to lesion characteristics. Biol. Psychiatry 63, 670â677 (2008).
Whitton, A. E. et al. Blunted neural responses to reward in remitted major depression: a high-density event-related potential study. Biol. Psychiatry. Cogn. Neurosci. Neuroimagin. 1, 87â95 (2016).
Wang, S., Leri, F. & Rizvi, S. J. Anhedonia as a central factor in depression: neural mechanisms revealed from preclinical to clinical evidence. Prog. Neuropsychopharmacol. Biol. Psychiatry 110, 110289 (2021).
American Psychiatric Publishing, Inc. Diagnostic and Statistical Manual of Mental Disorders. DSM Library https://dsm.psychiatryonline.org/doi/book/10.1176/appi.books.9780890425596 (2013).
Dowd, E. C., Frank, M. J., Collins, A., Gold, J. M. & Barch, D. M. Probabilistic reinforcement learning in patients with schizophrenia: relationships to anhedonia and avolition. Biol. Psychiatry Cogn. Neurosci. Neuroimagin. 1, 460â473 (2016).
Husain, M. & Roiser, J. P. Neuroscience of apathy and anhedonia: a transdiagnostic approach. Nat. Rev. Neurosci. 19, 470â484 (2018).
Stull, S. W., Bertz, J. W., Epstein, D. H., Bray, B. C. & Lanza, S. T. Anhedonia and substance use disorders by type, severity, and with mental health disorders. J. Addict. Med. 16, e150âe156 (2022).
Ward, J. et al. Novel genome-wide associations for anhedonia, genetic correlation with psychiatric disorders, and polygenic association with brain structure. Transl. Psychiatry 9, 1â9 (2019).
Insel, T. et al. Research Domain Criteria (RDoC): toward a new classification framework for research on mental disorders. Am. J. Psychiatry 167, 748â751 (2010).
Morris, B. H., Bylsma, L. M., Yaroslavsky, I., Kovacs, M. & Rottenberg, J. Reward learning in pediatric depression and anxiety: preliminary findings in a high-risk sample. Depress. Anxiety 32, 373â381 (2015).
OâDoherty, J. P. Reward representations and reward-related learning in the human brain: insights from neuroimaging. Curr. Opin. Neurobiol. 14, 769â776 (2004).
Rangel, A., Camerer, C. & Montague, P. R. A framework for studying the neurobiology of value-based decision making. Nat. Rev. Neurosci. 9, 545â556 (2008).
Fatahi, Z., Haghparast, A., Khani, A. & Kermani, M. Functional connectivity between anterior cingulate cortex and orbitofrontal cortex during value-based decision making. Neurobiol. Learn. Mem. 147, 74â78 (2018).
Kahnt, T., Heinzle, J., Park, S. Q. & Haynes, J.-D. The neural code of reward anticipation in human orbitofrontal cortex. Proc. Natl Acad. Sci. 107, 6010â6015 (2010).
Malvaez, M., Shieh, C., Murphy, M. D., Greenfield, V. Y. & Wassum, K. M. Distinct corticalâamygdala projections drive reward value encoding and retrieval. Nat. Neurosci. 22, 762â769 (2019).
Rudebeck, P. H. & Rich, E. L. Orbitofrontal cortex. Curr. Biol. 28, R1083âR1088 (2018).
Newson, J. J. & Thiagarajan, T. C. EEG frequency bands in psychiatric disorders: a review of resting state studies. Front. Hum. Neurosci. 12, 521 (2019).
Xiao, J. et al. Decoding depression severity from intracranial neural activity. Biol. Psychiatry 94, 445â453 (2023).
Adkinson, J. A. et al. Imaging versus electrographic connectivity in human mood-related fronto-temporal networks. Brain Stimul. 15, 554â565 (2022).
Allawala, A. et al. A novel framework for network-targeted neuropsychiatric deep brain stimulation. Neurosurgery 89, E116 (2021).
Sheth, S. A. et al. Deep brain stimulation for depression informed by intracranial recordings. Biol. Psychiatry 92, 246â251 (2022).
Chase, H. W. et al. Approach and avoidance learning in patients with major depression and healthy controls: relation to anhedonia. Psychol. Med. 40, 433â440 (2010).
Gradin, V. B. et al. Expected value and prediction error abnormalities in depression and schizophrenia. Brain 134, 1751â1764 (2011).
Rothkirch, M., Tonn, J., KĂśhler, S. & Sterzer, P. Neural mechanisms of reinforcement learning in unmedicated patients with major depressive disorder. Brain 140, 1147â1157 (2017).
Whitmer, A. J., Frank, M. J. & Gotlib, I. H. Sensitivity to reward and punishment in major depressive disorder: effects of rumination and of single versus multiple experiences. Cognit. Emot. 26, 1475â1485 (2012).
Gabbay, V. et al. Anhedonia, but not irritability, is associated with illness severity outcomes in adolescent major depression. J. Child Adolesc. Psychopharmacol. 25, 194â200 (2015).
Der-Avakian, A., DâSouza, M. S., Pizzagalli, D. A. & Markou, A. Assessment of reward responsiveness in the response bias probabilistic reward task in rats: implications for cross-species translational research. Transl. Psychiatry 3, e297 (2013).
IturraâMena, A. M., Kangas, B. D., Luc, O. T., Potter, D. & Pizzagalli, D. A. Electrophysiological signatures of reward learning in the rodent touchscreen-based Probabilistic Reward Task. Neuropsychopharmacology 48, 700â709 (2023).
Sailer, U., Wurm, F. & Pfabigan, D. M. Social and non-social feedback stimuli lead to comparable levels of reward learning and reward responsiveness in an online probabilistic reward task. Behav. Res. https://doi.org/10.3758/s13428-023-02255-6 (2023).
Wilkinson, M. P., Slaney, C. L., Mellor, J. R. & Robinson, E. S. J. Investigation of reward learning and feedback sensitivity in non-clinical participants with a history of early life stress. PLOS ONE 16, e0260444 (2021).
Bouthour, W. et al. Biomarkers for closed-loop deep brain stimulation in Parkinson disease and beyond. Nat. Rev. Neurol. 15, 343â352 (2019).
Lozano, A. M. et al. Deep brain stimulation: current challenges and future directions. Nat. Rev. Neurol. 15, 148â160 (2019).
Vergunst, F. K. et al. Longitudinal course of symptom severity and fluctuation in patients with treatment-resistant unipolar and bipolar depression. Psychiatry Res. 207, 143â149 (2013).
Cagnan, H., Denison, T., McIntyre, C. & Brown, P. Emerging technologies for improved deep brain stimulation. Nat. Biotechnol. 37, 1024â1033 (2019).
Little, S. et al. Adaptive deep brain stimulation in advanced Parkinson disease. Ann. Neurol. 74, 449â457 (2013).
Rosa, M. et al. Adaptive deep brain stimulation in a freely moving Parkinsonian patient. Mov. Disord. 30, 1003â1005 (2015).
Chudasama, Y. et al. The role of the anterior cingulate cortex in choices based on reward value and reward contingency. Cereb. Cortex 23, 2884â2898 (2013).
Fox, K. C. R. et al. Intrinsic network architecture predicts the effects elicited by intracranial electrical stimulation of the human brain. Nat. Hum. Behav. 4, 1039â1052 (2020).
Parvizi, J., Rangarajan, V., Shirer, W. R., Desai, N. & Greicius, M. D. The will to persevere induced by electrical stimulation of the human cingulate Gyrus. Neuron 80, 1359â1367 (2013).
Holroyd, C. B. & Umemoto, A. The research domain criteria framework: the case for anterior cingulate cortex. Neurosci. Biobehav. Rev. 71, 418â443 (2016).
Shenhav, A., Botvinick, M. M. & Cohen, J. D. The expected value of control: an integrative theory of anterior cingulate cortex function. Neuron 79, 217â240 (2013).
PolanĂa, R., Krajbich, I., Grueschow, M. & Ruff, C. C. Neural oscillations and synchronization differentially support evidence accumulation in perceptual and value-based decision making. Neuron 82, 709â720 (2014).
Bogaerts, L., Richter, C. G., Landau, A. N. & Frost, R. Beta-band activity is a signature of statistical learning. J. Neurosci. 40, 7523â7530 (2020).
Espenhahn, S. et al. Cortical beta oscillations are associated with motor performance following visuomotor learning. NeuroImage 195, 340â353 (2019).
Fries, P. Rhythms for cognition: communication through coherence. Neuron 88, 220â235 (2015).
Ghiani, A., Maniglia, M., Battaglini, L., Melcher, D. & Ronconi, L. Binding mechanisms in visual perception and their link with neural oscillations: a review of evidence from tACS. Front. Psychol. 12, 643677 (2021).
Lundqvist, M. et al. Gamma and beta bursts underlie working memory. Neuron 90, 152â164 (2016).
Lundqvist, M., Herman, P., Warden, M. R., Brincat, S. L. & Miller, E. K. Gamma and beta bursts during working memory readout suggest roles in its volitional control. Nat. Commun. 9, 394 (2018).
Fischl, B. FreeSurfer. NeuroImage 62, 774â781 (2012).
Jenkinson, M., Bannister, P., Brady, M. & Smith, S. Improved optimization for the robust and accurate linear registration and motion correction of brain images. Neuroimage 17, 825â841 (2002).
Joshi, A. et al. Unified framework for development, deployment and robust testing of neuroimaging algorithms. Neuroinformatics 9, 69â84 (2011).
Bastos, A. M. & Schoffelen, J.-M. A tutorial review of functional connectivity analysis methods and their interpretational pitfalls. Front. Syst. Neurosci. 9, 175 (2015).
Magnotti, J. F., Wang, Z. & Beauchamp, M. S. RAVE: comprehensive open-source software for reproducible analysis and visualization of intracranial EEG data. NeuroImage 223, 117341 (2020).
Acknowledgements
We thank our study participants for their commitment and trust. This work was supported by the National Institutes of Health (Grant No. UH3 NS103549 [to S.A.S., K.R.B., W.K.G., and N.P.], Grant No. K01 MH116364 [to K.R.B.], Grant No. R21 NS104953 [to K.R.B.], Grant No. UH3 NS100549 [to W.K.G. and S.A.S.], and Grant No. R01 MH114854 [to W.K.G.]) and McNair Foundation (to S.A.S.).
Author information
Authors and Affiliations
Contributions
J.X. and S.A.S. conceptualized and designed the study. J.X. performed data collection, analyzed the data, and drafted the manuscript with support from J.A.A., J.M., A.B.A., R.K.M., V.P., R.N., N.R.P., E.B., A.J.W., D.O., R.G., A.A., and B.S. S.A.S., N.P., W.K.G., S.J.M., and K.R.B. oversaw the organization of the clinical trial, subject recruitment, and regulatory activities. B.H., X.P., and S.A.S. supervised and guided the data analysis. J.X., B.H., and S.A.S. wrote and edited the manuscript, and all authors contributed to review and revision of the manuscript.
Corresponding author
Ethics declarations
Competing interests
S.A.S. has consulting agreements with Boston Scientific, NeuroPace, Abbott, and Zimmer Biomet. W.K.G. receives royalties from Nview, LLC and OCDscales, LLC. S.J.M. has served as a consultant to the following companies: Almatica Pharma, Biohaven, BioXcel Therapeutics, Boehringer-Ingelheim, Brii Biosciences, Clexio Biosciences, COMPASS Pathways, Delix Therapeutics, Douglas Pharmaceuticals, Eleusis, Engrail Therapeutics, Freedom Biosciences, Janssen, Liva Nova, Levo Therapeutics, Merck, Neumora, Neurocrine, Perception Neurosciences, Praxis Precision Medicines, Relmada Therapeutics, Sage Therapeutics, Seelos Therapeutics, Signant Health, Sunovion, Xenon Pharmaceuticals, and XW Pharma. S.J.M. has received research support from Boehringer-Ingelheim, Engrail Therapeutics, Merck, Neurocrine, and Sage Therapeutics. N.P. is a consultant for Abbott Laboratories and Sensoria Therapeutics. The remaining authors declare no competing interests.
Peer review
Peer review information
Nature Communications thanks Josef Parvizi, John Rolston, Nelleke van Wouwe and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. A peer review file is available.
Additional information
Publisherâs note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Source data
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the articleâs Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the articleâs Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Xiao, J., Adkinson, J.A., Myers, J. et al. Beta activity in human anterior cingulate cortex mediates reward biases. Nat Commun 15, 5528 (2024). https://doi.org/10.1038/s41467-024-49600-7
Received:
Accepted:
Published:
Version of record:
DOI: https://doi.org/10.1038/s41467-024-49600-7
This article is cited by
-
ENX-104: a selective and potent D2/D3 receptor antagonist enhances dopamine neurotransmission and reward responsiveness in translational rodent models
Neuropsychopharmacology (2025)
-
The diagnostic validity of central symptoms for major depressive disorder in adults with attention-deficit/hyperactivity disorder: a network analysis
European Archives of Psychiatry and Clinical Neuroscience (2025)







