Identification of conserved frontal neurophysiological markers of cognitive flexibility in humans and rats

Der-Avakian, Andre; Barnes, Samuel A.; Lees, Ty; Schroder, Hans S.; Kangas, Brian D.; Linton, Samantha R.; Nickels, Stefanie; Robble, Mykel A.; Breiger, Micah; Iturra-Mena, Ann M.; Lobien, Rachel; Perlo, Sarah; Cárdenas, Emilia F.; Nowicki, Genevieve P.; Wu, Zeyun; Pan, Hongyi; Dillon, Daniel G.; Kesby, James P.; Bergman, Jack; Carlezon, William A.; Risbrough, Victoria B.; Mukamel, Eran; Leutgeb, Stefan; Pizzagalli, Diego A.

doi:10.1038/s42003-025-08729-x

Download PDF

Article
Open access
Published: 23 August 2025

Identification of conserved frontal neurophysiological markers of cognitive flexibility in humans and rats

Communications Biology volume 8, Article number: 1268 (2025) Cite this article

2369 Accesses
3 Altmetric
Metrics details

Subjects

Abstract

Cognitive flexibility broadly describes behavioral alterations made in response to environmental changes and is fundamental for survival. While human and non-human animal assessments of cognitive flexibility are available, a systematic cross-species comparison of behavioral, neurophysiological, and computational markers of cognitive flexibility has not been reported. Using versions of a probabilistic reversal learning task aligned between humans and rats, electroencephalogram recordings reveal a frontal reward positivity (RewP) associated with unexpected reward outcomes. Reinforcement Q-learning models of both species’ task behavior reveal that prediction error (PE) magnitude was significantly related to RewP amplitude. The stimulant drug modafinil alters PEs in rats without affecting the RewP in either species. These findings reveal analogous neurophysiological markers associated with PEs in humans and rats using equivalent tasks and identical computational analyses. This translational approach may improve the predictive validity of tests for novel pharmacotherapies and accelerate neuropsychiatric treatment by assessing neural mechanisms conserved across species.

Concordant neurophysiological signatures of cognitive control in humans and rats

Article 21 March 2021

Electrophysiological biomarkers of behavioral dimensions from cross-species paradigms

Article Open access 17 September 2021

Flexible adaptation of task-positive brain networks predicts efficiency of evidence accumulation

Article Open access 02 July 2024

Introduction

Cognitive flexibility broadly refers to the ability to alter behavior in response to a changing environment^1,2,3, and such adjustments are a vital component of navigating everyday life^4,5. It describes the balance between repeating behaviors that yield beneficial outcomes toward a particular goal and modifying behaviors in response to environmental changes^3,6. This modification of behavioral strategy is key to cognitive flexibility and enables previously learned rules to be updated to maximize rewarded outcomes. Cognitive flexibility has been studied across several key domains, including set-shifting, working memory, and reversal learning⁶. Moreover, human research has found that cognitive flexibility is associated with resilience to stress and negative life events^4,7 and overall higher quality of life⁸. Conversely, deficits in cognitive flexibility have been documented in a multitude of psychiatric disorders, including mood disorders^9,10,11, schizophrenia^12,13, obsessive compulsive disorder^14,15, and substance use disorder¹⁶. If the neurobiological processes underlying cognitive flexibility were better understood, they could be targeted for treatment in these patient populations.

Reversal learning tasks, particularly probabilistic versions, are ideal for assessing cognitive flexibility^17,18,19. Although several variations of the probabilistic reversal learning (PRL) task exist, they are all based on a fundamental reinforcement learning paradigm whereby provided feedback is not always informative and the stimulus associated with the higher probability of reward is subject to change. For example, in a common PRL procedure, subjects are presented with two distinctive stimuli, one of which is associated with a high (e.g., 80%) probability of reinforcement, while the other is associated with a low (e.g., 20%) probability of reinforcement. By sampling both stimuli, subjects learn to maximize responding to the stimulus associated with a greater probability of reinforcement (i.e., target stimulus). After a pre-determined criterion (e.g., consecutive responses for the target stimulus), the reinforcement contingencies are reversed, and subjects need to identify the new target stimulus (identified as a reversal). Thus, PRL tasks require subjects to ignore occasional negative outcomes while responding to the target stimulus and avoid perseverating on a previously successful action when the reward contingencies are reversed. Individuals with major depressive disorder are sensitive to negative events, and for example, may respond to the occasional misleading negative feedback following a target response by prematurely responding for the other (i.e., non-target) stimulus²⁰, whereas patients with obsessive compulsive disorder may perseverate on a stimulus that is no longer the target stimulus, despite the new relatively low reinforcement probability²¹. In either case, patients may complete fewer reversals, indicative of impairment in adaptive behavior.

Expression of flexible behavior during a PRL task requires successful and rapid reinforcement learning²², in particular, the ability to parse the likelihood of reward and identify the target stimulus. As a result, a violation of the expected outcome, such as the omission of reward following a target response (i.e., due to probabilistic feedback), should elicit a reward prediction error (PE). Frequent, but not occasional, PEs should facilitate a change in response strategy. PEs have been associated with changes in event-related potentials (ERPs) over frontocentral electrode sites, namely the Reward positivity (RewP)^23,24. Originally named the feedback-related negativity^25,26, and identified as a negative deflection following unexpected negative feedback (i.e., a negative PE) relative to expected positive feedback^27,28, more recent evidence clarified that the RewP is driven by the response to reward feedback rather than non-rewarded outcomes^23,29,30. Evidence suggests that the FRN/RewP originates in the anterior cingulate cortex (ACC)^31,32,33. Alterations in the activity of midbrain dopamine neurons are known to code PEs^34,35,36, whereby negative PEs elicit decreased firing of these neurons and phasic activation encodes positive PEs. This dopaminergic signal projects to the striatum and ACC and may modulate the FRN/RewP, which can subsequently predict behavioral adjustments^26,37,38.

Non-human animal versions of the PRL task are critical for advancing our understanding of cognitive flexibility. In particular, the operant nature of the task enables translation across species; moreover, task behavior is well-characterized by reinforcement learning models, enabling application of the same (or very similar) models to data across species³⁹. To study this phenomenon in non-human animals, rodent versions of the PRL task have been developed^40,41. While human and non-human versions of the task are conceptually similar and may yield comparable behavioral results, to our knowledge, a systematic cross-species comparison of behavioral, computational, and neurophysiological markers of cognitive flexibility has not been reported. Such a comparison would be useful for drug development because putative treatments that demonstrate therapeutic behavioral effects and target engagement in non-human animals could be used in parallel human testing to accelerate drug discovery.

Thus, we modified human and rodent versions of a PRL task to align several parameters, enabling the comparison of behavioral performance across species. We measured the electroencephalogram (EEG) during the task to determine whether unexpected rewards were associated with a frontal RewP. Additionally, we applied a reinforcement Q-learning model to model behavior and linked it to EEG data from both species to compare the relationship between action values and frontal neurophysiological signals. Lastly, we administered comparable doses of modafinil, an indirect dopamine (DA) agonist that also has norepinephrinergic effects, to both species to determine whether modulation of dopaminergic signaling similarly altered the behavioral and neurophysiological indices of cognitive flexibility in humans and rats. Prior studies have shown that DA agonists increase reward-related ERPs during reward learning⁴² and modafinil increases DA signaling in the striatum (via the inhibition of DA transporters), the neural substrate for error prediction⁴³. Critically, we identified behavioral, computational, and neurophysiological markers of cognitive flexibility that were similar across species, providing a robust platform that could hasten treatment development.

Results

We instructed humans (n = 54) and trained rats (n = 11) to perform functionally identical versions of a PRL task (Supplementary Fig. 1) that consisted of 300 trials and reinforced responses probabilistically using an 80%/20% schedule (see “Methods” section). For both species, the reward contingencies reversed after eight consecutive correct responses, and EEG recordings were obtained during task performance (see “Methods” section).

Behavioral indices of cognitive flexibility and reinforcement metrics are consistent across humans and rats

During a single test session, humans completed 4.4 ± 0.23 (mean ± SEM) reversals per 100 trials (Fig. 1A), whereas rats completed 2.6 ± 0.40 (Fig. 1B). Repeating a target response after reward delivery (i.e., target win-stay) and abandoning the target response after non-reward delivery (i.e., target lose-shift) reflect responsiveness to positive and negative feedback, respectively. In both humans (Fig. 1A) and rats (Fig. 1B), target win-stay probability was significantly greater than target lose-shift probability [humans: t(53) = 13.93, p < 0.001; rats: t(10) = 6.20, p < 0.001]. Increased target win-stay and reduced target lose-shift responding facilitated more reversals, as reflected by significant positive correlations between target win-stay probability and reversals in humans (Pearson r(52) = 0.44, p < 0.001; Fig. 1C) and rats (r(9) = 0.79, p = 0.004; Fig. 1D), and significant negative correlations between target lose-shift probability and reversals in humans (r(52) = −0.58, p < 0.001; Fig. 1C) and rats (r(9) = −0.73, p = 0.012; Fig. 1D). Thus, in both species, greater sustained responding for reinforced target responses, despite occasional misleading feedback, was associated with better task performance.

**Fig. 1: PRL behavior was comparable between humans (left) and rats (right) performing similar versions of the task.**

Reinforcement learning models identify consistent patterns of behavioral responding across species

To gain a deeper insight into the behavioral mechanisms underlying PRL performance, we fit several Q-learning models to behavior (see Supplementary Methods). Consistent with our previous work^41,44, we found that the model consisting of three free parameters (learning rate, inverse temperature, and forget parameters) best fit PRL performance (Supplementary Table 1). Moreover, parameter recovery and posterior predictive checks (Supplementary Fig. 2) demonstrated recoverable parameter estimates and alignment between simulated and observed PRL performance.

As subjects perform the PRL task, the value of each chosen action (i.e., Q-value) is updated based on reinforcement. Thus, during the task, the value of the target stimulus fluctuates to reflect the changing probability of reward delivery (see Supplementary Fig. 3A, B for representative Q-values across a test session for both species). As expected, humans assigned greater Q-values to target stimuli (0.710 ± 0.007) than non-target stimuli (0.329 ± 0.01) [t(106) = 25.51; p < 0.001] (Supplementary Fig. 3C), as did the rats (0.550 ± 0.036 vs. 0.275 ± 0.034) [t(20) = 6.08, p < 0.001] (Supplementary Fig. 3C), indicating that both species learned to appropriately value stimuli based on experience throughout the task.

The beta parameter reflects the degree to which subjects explore both actions (lower beta value) vs. exploit the highest value action (higher beta value)⁴⁵. Higher beta values were significantly associated with more reversals in humans (r(52) = 0.65, p < 0.001; Fig. 1E) and rats (r(9) = 0.69, p = 0.019; Fig. 1F). Thus, consistent with the target win-stay and lose-shift correlations described above, exploiting high value actions and limiting exploration to periods when those actions become less favorable (i.e., after reversals) was an adaptive strategy for both species. Conversely, there was no association between alpha or forget parameters and reversals in either species (Supplementary Fig. 4).

Electrophysiological markers associated with reward expectancy are consistent across humans and rats

As humans and rats performed the PRL task, continuous EEG was recorded and averaged across rewarded and non-rewarded trials. A RewP emerged in frontal recording sites (e.g., frontocentral electrode (FCz) in humans (Fig. 2A); ACC in rats (Fig. 2B)) in response to positive vs. negative feedback. Highlighting spatial specificity, these effects were not present at parietal electrodes in either species (p’s > 0.168). The RewP peaked in the human FCz electrode approximately 200 ms following feedback and approximately 100 ms after reward feedback in the rat ACC, and reflected a more positive frontal signal for a rewarded vs. non-rewarded response (consistent with a positive PE). When broken down further into the four possible trial types, the average RewP amplitude was greater for rewarded compared to unrewarded trials after both target and non-target responses in both humans (Fig. 2C) and rats (Fig. 2D). In humans, a 2-way ANOVA revealed a significant Response × Feedback interaction [F(1,33) = 6.10, p = 0.019], and Bonferroni post-hoc tests revealed significantly greater voltage during rewarded vs. unrewarded target trials (p = 0.002). Moreover, a 1-way ANOVA revealed a significant linear effect of trial type [F(3,99) = 30.23, p < 0.001], and Bonferroni post-hoc tests revealed progressively increasing voltage that scaled with expected PE magnitude: Target/No Reward (expected to elicit the largest negative PE), Non-target/No Reward, Target/Reward, Non-target/Reward (expected to elicit the largest positive PE). In rats, a 2-way ANOVA did not reveal a significant Response × Feedback interaction, but did reveal a significant main effect of Feedback [F(1,9) = 37.73; p < 0.001], and Bonferroni post-hoc tests revealed significantly more positive voltage during rewarded vs. unrewarded target trials (p < 0.001). Critically, like humans, a 1-way ANOVA in rodents revealed a significant linear effect of trial type [F(3,27) = 18.52, p < 0.001], and Bonferroni post-hoc tests revealed progressively increasing voltage in the same order as humans.

**Fig. 2: Frontal feedback-locked event-related potentials (ERPs) and prediction error (PE) values were comparable between humans (left) and rats (right) during performance of the PRL task.**

Prediction errors associated with reward expectancy are consistent across humans and rats

Based on the value assigned to the chosen option on any trial, a PE can be computed depending on the outcome of that trial. For example, rewarded target responses (i.e., informative feedback) should elicit small positive PEs, whereas non-rewarded target responses (i.e., misleading feedback) should elicit large negative PEs^46,47. Indeed, the PEs for rewarded and unrewarded outcomes were positive and negative, respectively, and PE magnitude was greater when the outcome of the selected action was unexpected (Fig. 2E, F). In humans, a 2-way ANOVA revealed a significant main effect of Feedback (F(1, 53) = 22,570.94, p < 0.001), but no Response × Feedback interaction. Post-hoc tests revealed significantly greater PEs during rewarded vs. unrewarded target trials (p < 0.001). Consistent with the linear effect of the four trial types on RewP voltage, 1-way ANOVA revealed a significant linear effect of trial type [F(3,159) = 3571.61, p < 0.001], and Bonferroni post-hoc tests revealed progressively increasing PE values in the same order as described for RewP voltage. In rats, a 2-way ANOVA revealed a significant main effect of Feedback (F(1, 10) = 5698.12, p < 0.001), but no Response × Feedback interaction. Post-hoc tests revealed significantly greater PE during rewarded vs. unrewarded target trials (p < 0.001). Similar to humans, the 1-way ANOVA across all four trial types in rodents revealed a significant effect [F(3,30) = 882.49, p < 0.001], due to progressively increasing PEs. Interestingly, in both species, PEs were more negative for unrewarded target vs. non-target responses and more positive for rewarded non-target vs. target responses (all p’s < 0.001), suggesting that reward expectancy was greater following target vs. non-target responses in both groups.

Relationships between behavioral and electrophysiological measures of PEs

Next, we investigated associations between model parameters and electrophysiological markers. Using linear regression, we used trial-level data to determine the association between the ERP/LFP voltage at each timepoint within each trial and the PE associated with that trial. Notably, in both species (Fig. 3A, B), the regression coefficient peaked in the time window corresponding to the RewP. This indicates that during the RewP period, greater ERP/LFP amplitudes are associated with more positive PEs, whereas lower ERP/LFP amplitudes are associated with less positive PEs. That is, the more positive ERP/LFP for rewarded vs. unrewarded non-target responses appears to reflect the positive PE that occurs when rewards are unexpected. Analogous models using outcome and expected value (i.e., the subcomponents of PE) are presented in Supplementary Fig. 5.

**Fig. 3: The strong relationship between ERP voltage following reward feedback and PE values was comparable between humans (left) and rats (right).**

To provide additional validation for this regression approach, we used the intercept and regression coefficient generated at each timepoint to predict the ERP/LFP amplitude associated with various PEs. The predicted ERP/LFP traces (Fig. 3C, D) replicated the traces observed in humans and rats (Fig. 2A, B), and, importantly, differences in the predicted traces emerged at the time window corresponding to the RewP. Moreover, the predicted ERP/LFP was greater when the hypothetical PE was more positive. To ensure that the predicted ERP/LFP traces aligned with the activity we recorded from our human or rodent subjects, we plotted the ERP/LFP for trials associated with a PE greater than 0.5 or less than −0.5 (Fig. 3E, F). As expected, and consistent with our predicted ERP/LFP values, more positive PEs were associated with a more positive ERP/LFP, and this difference was most evident in the RewP time window.

Modafinil effects on frontal electrophysiological signals and PE values

To understand whether electrophysiological markers of cognitive flexibility are modulated by altering dopamine transmission, we administered the dopamine transport blocker modafinil to a separate cohort of humans (N = 29) and the same cohort of rats described above (N = 11) prior to PRL testing and EEG recording. Under the placebo/vehicle condition, we successfully replicated the frontal RewP observed between unrewarded and rewarded responses in both humans (Fig. 4A) and rats (Fig. 4B). In humans, a 1-way ANOVA of the ERPs following target responses revealed a significant main effect of Feedback [F(1,26) = 10.16, p = 0.004] (Fig. 4C). The same significant Feedback effect [F(1,9) = 15.08, p = 0.004] was identified in the 1-way ANOVA of the rodent LFPs (Fig. 4D). In both species, ERPs/LFPs were more negative following unrewarded target responses relative to rewarded target responses, with, unexpectedly, no effect of modafinil (see Supplementary Fig. 6 for ERPs from all modafinil doses for both species).

**Fig. 4: Modafinil did not alter feedback-locked ERPs, but did disrupt PE values in rats at high doses.**

We then fitted the same Q-learning algorithm to PRL performance. One-way ANOVA models examining PEs following target responses revealed a significant main effect of Feedback for both humans [F(1,28) = 7630.16, p < 0.001] (Fig. 4E) and rats [F(1,10) = 2737.01, p < 0.001] (Fig. 4F). Interestingly, in rats, a 1-way ANOVA across all modafinil doses revealed significant linear effects for the rewarded condition [F(5,50) = 2.62, p = 0.035] with higher doses resulting in more positive PEs; this was not the case for the unrewarded condition [F(5,50) = 1.90, p = 0.111]. Although the linear effects analysis of human PEs was not significant, the pattern of results was similar to rodents (i.e., increasing PEs with increasing modafinil dose). Although a wide dose response range was used in rodents, matching doses across species is challenging⁴⁸ and it is possible that the higher doses used in rats (e.g., 32 and 64 mg/kg) were relatively greater than the highest human dose. Thus, the pattern of increasing PEs across doses may have been more robust if the human participants were exposed to higher modafinil doses.

Discussion

Using analogous PRL tasks across species, along with equivalent EEG data processing and identical computational analyses of behavioral and EEG data, we found remarkably concordant behavioral, computational, and electrophysiological findings across species, including a neurophysiological response consistent with a RewP in both humans and rats. In humans, the RewP was observed over frontocentral brain regions, while the rodent RewP was recorded directly from the ACC. Notably, in humans, this component peaked approximately 200 ms following feedback, which is earlier than the more typically observed ~300 ms peak⁴⁹. We postulate that this latency shift is likely due to using auditory feedback rather than the more typical visual feedback, but recognize that other task design effects may also be involved in this latency shift. Behavioral performance between species was comparable, as indicated by greater sensitivity to positive feedback (i.e., target win-stay) relative to negative feedback (i.e., target lose-shift), both of which significantly correlated with the overall number of reversals in both species. Although the number of reversals and win-stay responses were generally higher and lose-shift ratios were generally lower in humans compared to rats, these findings are consistent with previously published PRL data in both species^40,50. Importantly, Q-learning computational analyses revealed that both humans and rats assigned greater value to target vs. non-target stimuli, and these values alternated between response apertures as the criterion for reversals was achieved and reward contingencies were switched. Consistent with our prior finding⁵¹, we found a positive correlation between the beta parameter and the number of completed reversals, likely because a higher beta parameter promotes a greater tendency to exploit the action with the higher value. Taken together, these findings confirm that humans and rats performing the PRL task exhibit concordance in both behavioral performance and evoked neurophysiological responses.

Reward PEs are fundamental for learning⁵², and signal when an actual outcome deviates from what was expected. As the value assigned to the target was greater than the non-target, one would likely expect a reward after a target response. Importantly, rewarded outcomes elicited positive PEs, whereas non-rewards elicited negative PEs. However, the PE signal following a rewarded non-target response was more positive than rewarded target responses, and a non-rewarded target response elicited a more negative PE than non-rewarded non-target responses. Interestingly, the direction and magnitude of the frontal voltage across the RewP window following feedback were strikingly similar to the direction and magnitude of PEs across all four trial types in both species (Fig. 2C–F), and PEs significantly correlated with voltage during the RewP window in both species as well. These results suggest that the frontal voltage changes triggered by reward feedback represent a neural marker of PEs that is consistent across humans and rats. Less positive deflections after an unexpected reward omission are associated with greater negative PEs, whereas smaller deflections are associated with weaker negative PEs. Blunted negative PEs may contribute to perseverative responding and less flexibility if reward omissions fail to signal that an error has occurred and that alternate options should be explored. Future research focused on understanding the trial-level relationship between neural activity and choice selection may provide further insight into the role of PEs in cognitive flexibility.

Because midbrain dopaminergic signals to the ACC may contribute to and/or modulate the RewP^26,53, we hypothesized that indirectly enhancing dopamine levels (among other neuromodulators) via modafinil would enhance both the RewP and PEs. Interestingly, in rodents, we found that higher doses of modafinil increased positive PEs for rewarded target responses (i.e., PEs became more positive as modafinil dose increased) but suppressed negative PEs for unrewarded target responses (i.e., PEs became less negative as modafinil dose increased). The effect on PEs may disrupt the ability to properly identify and update the value of the target stimulus, ultimately impacting choice behavior. Notably, we only saw this effect in rodents administered the higher modafinil doses, and animals also completed fewer reversals and exhibited alterations in win-stay/lose-shift strategies (see Supplementary Table 2). The absence of this effect in humans suggests that the maximal human dose administered (200 mg) was not sufficient to disrupt behavior and neurophysiology in a manner consistent with that observed in rats.

It is possible that by increasing dopamine levels, high doses of modafinil might have interfered with the ability of midbrain dopamine neurons to appropriately suppress activity during an unexpected event, thereby suppressing the PE, which is associated with an attenuated RewP. This is consistent with prior evidence indicating that increased dorsomedial striatum activity reduces the impact of loss, thereby impairing performance in a reversal learning task⁵⁴. Critically, modafinil also modulates other monoamine systems⁵⁵, such as noradrenaline and serotonin, which also play a role in regulating reward PEs and value updating^56,57. Indeed, noradrenergic neurotransmission plays a central role in error monitoring⁵⁸. Thus, it is unclear whether PRL disruptions evident in rats after the highest dose of modafinil are due to alterations in dopamine neurotransmission or another neurotransmitter system. Alternatively, another possible explanation for why modafinil did not modulate PEs and ERPs/LFPs may have to do with the subjects. All human participants were healthy and without any psychiatric conditions, and all rats were naïve and otherwise healthy. Thus, dopamine signaling in healthy subjects may already be optimal, and any additional increase may impede performance. It is possible that modafinil may improve PRL performance in subjects that are otherwise healthy but exhibit reduced task performance or in patient samples characterized by reduced cognitive flexibility. Future studies that are sufficiently powered could group participants based on task performance (i.e., optimal vs poor) before modafinil administration or consider neuropsychiatric samples.

Importantly, the results of the cross-species studies presented here highlight the potential to replicate the rodent study using a parallel patient sample. If successful, this and other similar cross-species approaches may be used to test novel putative pharmacotherapies using similar behavioral and neurophysiological measures with high predictive validity⁵⁹. Such an approach would identify which neural mechanisms linked to behavioral endpoints are conserved across species and thus would be appropriate to assess using an animal model. Importantly, as a major step towards this goal, our findings demonstrate that concordant behavioral, computational, and neurophysiological measures are observed in humans and rodents performing a cross-species task. A similar approach could be used to identify neural mechanisms of and treatments for disorders featuring deficits in cognitive flexibility and other behaviors that can be accurately measured across species and linked to conserved neural processes.

Methods

A detailed description of the multidisciplinary research program of which the present study was part is described at: clinicaltrials.gov/study/NTC02855229.

Humans subjects

Sixty-three volunteers, aged 18–45 years, were recruited for the first PRL cohort (which did not include modafinil testing). A total of 54 were retained (19 male, 35 female) for final behavioral data analyses, of which 34 (13 male, 21 female) were retained for EEG analyses after further exclusion due to poor EEG data quality. Thirty separate right-handed volunteers were recruited for the second PRL cohort (which included modafinil), and a total of 29 subjects (14 male, 15 female) were retained for final data analyses. Subjects were free of any psychiatric history, as determined by a clinician-administered Structured Clinical Interview for DSM-5 (SCID-5)⁶⁰. Subjects were compensated $452 for participation. All ethical regulations relevant to human research participants were followed, and all procedures were approved by the Mass General Brigham Institutional Review Board. Subjects provided written informed consent in the presence of a medical doctor prior to participation.

Human PRL task procedure

The first cohort participated in a single testing session. Subjects were randomly assigned to either the PRL task or a Flanker task (the results of which were reported separately)⁶¹. Analyses examining the effects of task order revealed no significant differences. The second PRL cohort completed three sessions, separated by at least one week, using a double-blind, within-subjects, placebo-controlled design; across sessions, subjects were administered 0 mg (placebo), 100 mg, or 200 mg modafinil (2 h pretreatment).

Subjects completed a modified version of the PRL task (Supplementary Fig. 1A) while seated ~70 cm from a computer monitor inside an acoustically and electrically shielded booth. All stimuli were presented on a 22.5-in VIEWPixx monitor (VPixx Technologies, Saint-Bruno, Canada) using PsychoPy software. In this PRL task, participants were tasked to choose between two stimuli, one of which had been randomly designated as the target stimulus at the beginning of the session. Participants received probabilistic feedback in that, if the target stimulus was chosen, a reward would follow 80% of the time. Similarly, if non-target stimuli, negative feedback was given 80% of the time. Thus, spurious feedback would occur 20% of the time. The target or non-target assignment was reversed if the participant selected the target stimulus on 8 consecutive trials irrespective of feedback.

To ensure parallel instructions between species, participants were not made aware of reversing contingencies. Rather, they were instructed that they would need to “choose between two circles in order to win as much money as you can” and that they would hear one tone indicating a win/correct selection or a different tone indicating an incorrect selection. Participants completed 10 practice trials to familiarize themselves with the trial structure and the two tones. One session consisted of 300 trials with one break after 150 trials.

Every trial started with the presentation of a fixation cross of random duration between 500 and 1000 ms. Stimuli consisted of a red and blue circle, randomly placed on the left or right side of the screen, presented for maximally 2000 ms or until a response was given. Participants selected the left circle by pressing “c” or the right circle by pressing “m” on a keyboard. As soon as the response was given, a black border appeared around the selected circle for 400 ms. Following a random delay of between 400 and 600 ms, auditory feedback (either a 700 Hz or 1000 Hz pure sine wave) was played for 200 ms. Assignment of the reward and omission outcome to the high or low tone was counterbalanced between participants. If the subject received positive feedback, the feedback tone was followed by the sound of a coin dropping for 1200 ms. This sound was added to mimic the consumption of the food reward in rodents. If participants failed to answer within 2000 ms, a 300 Hz tone was played together with a visual stimulus reading “No response!”.

Human EEG acquisition

In both PRL cohorts, EEG data were recorded using an actiCHamp amplifier and a 96 Ag/AgCl active electrode actiCAP system (Brain Products GmbH, Gilching, Germany) that used an equidistant spherical montage and was referenced online to a vertex channel (approximating Cz), with a ground electrode approximately at AFz. Data were digitized at 500 Hz using BrainVision Recorder, and impedances were kept below 35 kΩ.

Rodent subjects

Eleven adult male (n = 5) and female (n = 6) Wistar rats were used for both baseline and modafinil experiments (Charles River Laboratories, Wilmington, MA, USA). Animals were pair-housed and food-restricted to 85% of their free-feeding body weight throughout behavioral training. After EEG electrodes were surgically implanted, animals were single-housed for the duration of the experiment and all PRL tests. All rats were housed in a vivarium room with a 12-h reverse light-dark cycle, with lights off between 7:00 AM and 7:00 PM. Rats were monitored daily for signs that would prompt a humane endpoint (e.g., excessive weight loss, inappetence, moribund state, or infection), requiring removal from the study and euthanasia. We have complied with all relevant ethical regulations for animal use; all rodent procedures were approved by the UC San Diego Institutional Animal Care and Use Committee, and were conducted in accordance with guidelines from the National Institute of Health and the Association for Assessment and Accreditation of Laboratory Animal Care.

Rodent EEG surgery and data acquisition

Prior to testing, rats were anesthetized with a 2% isoflurane/oxygen vapor mixture and secured on a stereotaxic frame (Kopf Instruments; Tujunga, CA, USA). In order to best approximate human EEG recordings, we implanted three different electrode types over and within the rats’ brains: (1) A 1/8” diameter fine silver disc (Hauser and Miller; St. Louis, MO, USA) soldered to a 0.01” diameter PFA-coated stainless steel wire (#792400; A-M Systems; Sequim, WA, USA) was placed on the surface of the skull immediately rostral to bregma; (2) a stainless steel jeweler’s screw soldered to the wire described above was implanted in the skull over the frontal cortex (AP + 3.7 mm, ML ± 2.5 mm) and parietal cortex (AP −4.5 mm, ML ± 4.9 mm); (3) a stainless steel wire described above was inserted into the ACC (AP + 2.7 mm, ML ± 0.8 mm, DV −2.3, from bregma), lateral orbitofrontal cortex, nucleus accumbens shell, caudate nucleus, and primary auditory cortex; notably, electrode implantation angles were unified across animals. Recordings from sites other than the ACC will be reported separately. Reference and ground skull screws were implanted bilaterally over the cerebellum. All electrodes were secured initially with Denmat cement, then completely covered with dental acrylic. The wires from all electrodes were secured with gold pins into an electrode interface board (EIB-16; Neuralynx; Bozeman, MT, USA) that attached to a removable amplifier board during data acquisition. Rats were monitored for at least one-week post-surgery before EEG recording. Unifying electrode coordinates, materials, and implantation angles across animals was done to minimize variability in signal orientation^62,63.

During testing, rats were connected to a 16-channel amplifier board (RHD2132; Intan Technologies; Los Angeles, CA, USA) that transmitted electrophysiological data to a USB interface board connected to a computer running RHD2000 interface software (Intan Technologies). Data were continuously recorded at a 1 kHz sampling rate and filtered between 0.1 and 300 Hz. While LFP data were being continuously collected during the PRL task, TTL event markers were recorded to identify presentation of reward feedback. Audio signals were recorded during testing and connected to the EEG acquisition system to confirm the accuracy of the time-lock between tones and neurophysiological signals.

Rodent PRL task procedure

Rats were trained and tested in a Plexiglas operant conditioning box (24 × 30 × 29 cm; Med Associates, St Albans, VT, USA) enclosed in a Faraday cage (Med Associates). It consisted of two retractable levers, a food receptacle positioned between the two levers, a stimulus light above each lever, a speaker above the food receptacle, and a house light placed 2 cm below the ceiling on the opposite wall. Tones were created by an audio generator. All programs and collection of data were done on MED-PC V software (Med Associates, St Albans, VT, USA).

The rodent PRL task was designed to be as similar to a human task as possible (Supplementary Fig. 1B). Briefly, rats responded for one of two colored light stimuli (that were illuminated for up to 5 s) by pressing one of two levers (presented 1 s after illumination) under the two lights. Target responses resulted in positive feedback (100 ms tone, 5 or 15 kHz, counterbalanced) on 80% of trials 500–1000 ms after the response and preceded the delivery of a 45 mg (for male rats) or 20 mg (for female rats) sucrose pellet (5TUT, Test Diet), or negative feedback on 20% of trials (other tone) followed by no pellet. Non-target responses resulted in negative and positive feedback on 80% and 20% of trials, respectively. Eight consecutive target responses, regardless of feedback, resulted in the target stimulus switching to the other light. A “reversal” was recorded when a rat successfully made 8 consecutive target responses after a switch. Rats completed 300 trials per session.

EEG recordings were obtained on the 21st day of testing (i.e., after rats had sufficient time to learn the PRL procedure). After one week, EEG recordings were obtained during testing after administration of one of the following doses of modafinil using a within-subjects Latin-square design: 0, 4, 8, 16, 32, 64 mg/kg. The vehicle for modafinil dosing was DMSO, administered at a volume of 1 ml/kg. There was a minimum one-week washout period between tests.

Cross-species Task Performance and ERP/LFP derivation

In both species, in addition to reversals, target win-stay probabilities were calculated as the number of responses repeated after a rewarded target response divided by the total number of rewarded target responses. Target lose-shift probabilities were calculated as the number of responses not repeated after an unrewarded target response divided by the total number of unrewarded target responses. Q-learning parameters, including Q, PE, alpha, beta, and forget values, were also calculated identically in both species (see Supplementary Methods).

All EEG data were analyzed with BrainVision Analyzer 2.1 (Brain Products GmbH, Gilching, Germany) in the following steps: human data were visually inspected to identify gross muscle activity and artifactual channels, and rodent data were checked for polarity inversions. Following this, data were bandpass filtered from 0.1 (12 dB/oct) to 30 Hz (24 dB/Oct Human cohort 1, 48 dB/oct, Human cohort 2, and Rats) using zero-phase Butterworth IIR filters. Human data were then subjected to independent component analysis to remove eye movement and EKG sources, spherical spline interpolation to replace corrupted channels, and finally re-referenced to the common average. Rodent data were re-referenced to the electrode implanted above the left cerebellum. Processed data for both species were then segmented into −1500 to 2000 ms epochs around the feedback stimulus, and segments were rejected channel-wise as artifact if any of these criteria were met: (1) a voltage exceeding ±75 µV (humans) or ±800 µV (rats); (2) a maximum voltage difference of less than 0.5 µV for more than 100 ms within a trial. Human recordings were also checked against two extra criteria: (1) a voltage step exceeding 50 µV, and (2) a maximum voltage difference of more than 150 µV across 200 ms time intervals within a trial.

In both species, feedback-locked data were segmented into individual epochs spanning from 200 ms before and 600 ms after the feedback tone, baseline-corrected (described below), and averaged. In humans, feedback-locked ERPs were quantified at channel 2 (approximating electrode FCz), and baseline-corrected to the −200 to 0 ms pre-feedback window. The RewP was quantified as the average amplitude between 165 and 225 ms post-feedback. In rats, feedback-locked LFPs were baseline-corrected to the −300 to −100 ms pre-feedback window and quantified at the ACC LFP channel as the average activity across the 60–160 ms post-feedback window.

Statistics and reproducibility

Across all cohorts (i.e., 1 rodent (n = 11) and 2 humans (n = 34 and n = 30)), the associations between task performance and behavioral parameters were assessed using Pearson’s correlation. In turn, between-condition comparisons for all parameters, at both the trial- and session-level, were evaluated using 1- and 2-way ANOVAs. Finally, the association between the ERP/LFP voltage and PE values was evaluated using a series of generalized linear models (GLMs) with a Gaussian distribution and identity link function. The general structure of these models was as follows:

$$E\left({{Voltage}}_{{dt}}\right)={\beta }_{0}+{\beta }_{1}{{PE}}_{t}$$

(1)

where E(Voltage_dt) corresponds to the ERP/LFP amplitude at each data point, d, within a given trial, t, and PE_t is the signed PE value on that given trial.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data availability

Numerical source data used in the present manuscript are hosted on OSF and available at: osf.io/gxkdv. All other data (i.e., raw EEG recordings) that support the findings of this study are available from the corresponding author upon reasonable request.

Code availability

The following software were used in preprocessing raw data and subsequent analysis: BrainVision Analyzer (v2.0), Python (v3.7.1), NumPy Python library (v1.21.5), pandas Python Library (v1.1.5), SciPy Python Library (v1.4.1), matplotlib Python library (v3.5.3), statsmodels Python Library (v0.13.5), IBM SPSS Statistics (v24), and GraphPad Prism (v8). Where possible, Python and R code used in the present analysis are available on OSF at: osf.io/gxkdv. EEG preprocessing templates are available from the corresponding author on reasonable request.

References

Scott, W. A. Cognitive complexity and cognitive flexibility. Sociometry 25, 405 (1962).
Article Google Scholar
Uddin, L. Q. Cognitive and behavioural flexibility: neural mechanisms and clinical considerations. Nat. Rev. Neurosci. 22, 167–179 (2021).
Article PubMed PubMed Central Google Scholar
Zühlsdorff, K., Dalley, J. W., Robbins, T. W. & Morein-Zamir, S. Cognitive flexibility: neurobehavioral correlates of changing one’s mind. Cereb. Cortex 33, 5436–5446 (2023).
Article PubMed Google Scholar
Genet, J. J. & Siemer, M. Flexible control in processing affective and non-affective material predicts individual differences in trait resilience. Cogn. Emot. 25, 380–388 (2011).
Article PubMed Google Scholar
Chen, Q. et al. Association of creative achievement with cognitive flexibility by a combined voxel-based morphometry and resting-state functional connectivity study. Neuroimage 102, 474–483 (2014).
Article PubMed Google Scholar
Cools, R. Neuropsychopharmacology of Cognitive Flexibility. In Brain Mapping 349–353 (Elsevier, 2015).
Gabrys, R. L., Tabri, N., Anisman, H. & Matheson, K. Cognitive control and flexibility in the context of stress and depressive symptoms: the cognitive control and flexibility questionnaire. Front. Psychol. https://doi.org/10.3389/fpsyg.2018.02219 (2018).
Davis, J. C., Marra, C. A., Najafzadeh, M. & Liu-Ambrose, T. The independent contribution of executive functions to health related quality of life in older women. BMC Geriatr. 10, 16 (2010).
Article PubMed PubMed Central Google Scholar
Robinson, O. J., Cools, R., Carlisi, C. O., Sahakian, B. J. & Drevets, W. C. Ventral striatum response during reward and punishment reversal learning in unmedicated major depressive disorder. Am. J. Psychiatry 169, 152–159 (2012).
Article PubMed PubMed Central Google Scholar
Taylor Tavares, J. V. et al. Neural basis of abnormal response to negative feedback in unmedicated mood disorders. Neuroimage 42, 1118–1126 (2008).
Article PubMed Google Scholar
Veiel, H. O. F. A preliminary profile of neuropsychological deficits associated with major depression. J. Clin. Exp. Neuropsychol. 19, 587–603 (1997).
Article PubMed Google Scholar
Reddy, L. F., Waltz, J. A., Green, M. F., Wynn, J. K. & Horan, W. P. Probabilistic Reversal Learning in Schizophrenia: Stability of Deficits and Potential Causal Mechanisms. Schizophr. Bull. 42, 942–951 (2016).
Article PubMed PubMed Central Google Scholar
Waltz, J. A. & Gold, J. M. Probabilistic reversal learning impairments in schizophrenia: further evidence of orbitofrontal dysfunction. Schizophr. Res. 93, 296–303 (2007).
Article PubMed PubMed Central Google Scholar
Chamberlain, S. R., Fineberg, N. A., Blackwell, A. D., Robbins, T. W. & Sahakian, B. J. Motor inhibition and cognitive flexibility in obsessive-compulsive disorder and trichotillomania. Am. J. Psychiatry 163, 1282–1284 (2006).
Article PubMed Google Scholar
Remijnse, P. L. et al. Reduced orbitofrontal-striatal activity on a reversal learning task in obsessive-compulsive disorder. Arch. Gen. Psychiatry 63, 1225 (2006).
Article PubMed Google Scholar
Camchong, J. et al. Frontal hyperconnectivity related to discounting and reversal learning in cocaine subjects. Biol. Psychiatry 69, 1117–1123 (2011).
Article PubMed PubMed Central Google Scholar
Ragland, J. D. et al. CNTRICS final task selection: long-term memory. Schizophr. Bull. 35, 197–212 (2009).
Article PubMed Google Scholar
Izquierdo, A., Brigman, J. L., Radke, A. K., Rudebeck, P. H. & Holmes, A. The neural basis of reversal learning: an updated perspective. Neuroscience 345, 12–26 (2017).
Article PubMed Google Scholar
Robbins, T. W. Cross-species studies of cognition relevant to drug discovery: a translational approach. Br. J. Pharmacol. 174, 3191–3199 (2017).
Article PubMed PubMed Central Google Scholar
Murphy, F. C., Michael, A., Robbins, T. W. & Sahakian, B. J. Neuropsychological impairment in patients with major depressive disorder: the effects of feedback on task performance. Psychol. Med. 33, 455–467 (2003).
Article PubMed Google Scholar
Chamberlain, S. R. et al. Orbitofrontal dysfunction in patients with obsessive-compulsive disorder and their unaffected relatives. Science (1979). 321, 421–422 (2008).
Article PubMed Google Scholar
Kehagia, A. A., Murray, G. K. & Robbins, T. W. Learning and cognitive flexibility: frontostriatal function and monoaminergic modulation. Curr. Opin. Neurobiol. 20, 199–204 (2010).
Article PubMed Google Scholar
Proudfit, G. H. The reward positivity: from basic research on reward to a biomarker for depression. Psychophysiology 52, 449–459 (2015).
Article PubMed Google Scholar
Kehrer, P., Brigman, J. L. & Cavanagh, J. F. Depth recordings of the mouse homologue of the reward positivity. Cogn. Affect Behav. Neurosci. 24, 292–301 (2024).
Article PubMed Google Scholar
Miltner, W. H. R., Braun, C. H. & Coles, M. G. H. Event-related brain potentials following incorrect feedback in a time-estimation task: evidence for a “generic” neural system for error detection. J. Cogn. Neurosci. 9, 788–798 (1997).
Article PubMed Google Scholar
Holroyd, C. B. & Coles, M. G. H. The neural basis of human error processing: reinforcement learning, dopamine, and the error-related negativity. Psychol. Rev. 109, 679–709 (2002).
Article PubMed Google Scholar
Yasuda, A., Sato, A., Miyawaki, K., Kumano, H. & Kuboki, T. Error-related negativity reflects detection of negative reward prediction error. Neuroreport 15, 2561–2565 (2004).
Article PubMed Google Scholar
Chase, H. W., Swainson, R., Durham, L., Benham, L. & Cools, R. Feedback-related negativity codes prediction error but not behavioral adjustment during probabilistic reversal learning. J. Cogn. Neurosci. 23, 936–946 (2011).
Article PubMed Google Scholar
Holroyd, C. B., Pakzad-Vaezi, K. L. & Krigolson, O. E. The feedback correct-related positivity: sensitivity of the event-related brain potential to unexpected positive feedback. Psychophysiology 45, 688–697 (2008).
Article PubMed Google Scholar
Holroyd, C. B., Krigolson, O. E. & Lee, S. Reward positivity elicited by predictive cues. Neuroreport 22, 249–252 (2011).
Article PubMed Google Scholar
Hauser, T. U. et al. The feedback-related negativity (FRN) revisited: new insights into the localization, meaning and network organization. Neuroimage 84, 159–168 (2014).
Article PubMed Google Scholar
Gruendler, T. O. J., Ullsperger, M. & Huster, R. J. Event-related potential correlates of performance-monitoring in a lateralized time-estimation task. PLoS ONE 6, e25591 (2011).
Article PubMed PubMed Central Google Scholar
Hyman, J. M., Holroyd, C. B. & Seamans, J. K. A novel neural prediction error found in anterior cingulate cortex ensembles. Neuron 95, 447–456.e3 (2017).
Article PubMed Google Scholar
Montague, P., Dayan, P. & Sejnowski, T. A framework for mesencephalic dopamine systems based on predictive Hebbian learning. J. Neurosci. 16, 1936–1947 (1996).
Article PubMed PubMed Central Google Scholar
Schultz, W. Predictive reward signal of dopamine neurons. J. Neurophysiol. 80, 1–27 (1998).
Article PubMed Google Scholar
Bayer, H. M. & Glimcher, P. W. Midbrain dopamine neurons encode a quantitative reward prediction error signal. Neuron 47, 129–141 (2005).
Article PubMed PubMed Central Google Scholar
Brown, J. W. & Braver, T. S. Learned predictions of error likelihood in the anterior cingulate cortex. Science 307, 1118–1121 (2005).
Article PubMed Google Scholar
Cohen, M. X. & Ranganath, C. Reinforcement learning signals predict future decisions. J. Neurosci. 27, 371–378 (2007).
Article PubMed PubMed Central Google Scholar
Cavanagh, J. F. et al. Electrophysiological biomarkers of behavioral dimensions from cross-species paradigms. Transl. Psychiatry 11, 482 (2021).
Article PubMed PubMed Central Google Scholar
Bari, A. et al. Serotonin modulates sensitivity to reward and negative feedback in a probabilistic reversal learning task in rats. Neuropsychopharmacology 35, 1290–1301 (2010).
Article PubMed PubMed Central Google Scholar
Tranter, M. M., Aggarwal, S., Young, J. W., Dillon, D. G. & Barnes, S. A. Reinforcement learning deficits exhibited by postnatal PCP-treated rats enable deep neural network classification. Neuropsychopharmacology 48, 1377–1385 (2023).
Article PubMed Google Scholar
Cavanagh, J. F. et al. Amphetamine alters an EEG marker of reward processing in humans and mice. Psychopharmacology 239, 923–933 (2022).
Article PubMed PubMed Central Google Scholar
Volkow, N. D. et al. Effects of modafinil on dopamine and dopamine transporters in the male human brain. JAMA 301, 1148 (2009).
Article PubMed PubMed Central Google Scholar
Tranter, M. M. et al. Postnatal phencyclidine-induced deficits in decision making are ameliorated by optogenetic inhibition of ventromedial orbitofrontal cortical glutamate neurons. Biol. Psychiatry Glob. Open Sci. 4, 264–274 (2024).
Article PubMed Google Scholar
Cohen, J. D., McClure, S. M. & Yu, A. J. Should I stay or should I go? How the human brain manages the trade-off between exploitation and exploration. Philos. Trans. R. Soc. B Biol. Sci. 362, 933–942 (2007).
Article Google Scholar
Schultz, W., Dayan, P. & Montague, P. R. A neural substrate of prediction and reward. Science (1979). 275, 1593–1599 (1997).
Article PubMed Google Scholar
Daw, N. D. & Doya, K. The computational neurobiology of learning and reward. Curr. Opin. Neurobiol. 16, 199–204 (2006).
Article PubMed Google Scholar
Sharma, V. & McNeill, J. H. To scale or not to scale: the principles of dose extrapolation. Br. J. Pharmacol. 157, 907–921 (2009).
Article PubMed PubMed Central Google Scholar
Sambrook, T. D. & Goslin, J. A neural reward prediction error revealed by a meta-analysis of ERPs using great grand averages. Psychol. Bull. 141, 213–235 (2015).
Article PubMed Google Scholar
Rychlik, M., Bollen, E. & Rygula, R. Ketamine decreases sensitivity of male rats to misleading negative feedback in a probabilistic reversal-learning task. Psychopharmacology 234, 613–620 (2017).
Article PubMed Google Scholar
Barnes, S. A. et al. Modulation of ventromedial orbitofrontal cortical glutamatergic activity affects the explore-exploit balance and influences value-based decision-making. Cereb. Cortex 33, 5783–5796 (2023).
Article PubMed Google Scholar
Millard, S. J., Bearden, C. E., Karlsgodt, K. H. & Sharpe, M. J. The prediction-error hypothesis of schizophrenia: new data point to circuit-specific changes in dopamine activity. Neuropsychopharmacology 47, 628–640 (2022).
Article PubMed Google Scholar
Warren, C. M., Hyman, J. M., Seamans, J. K. & Holroyd, C. B. Feedback-related negativity observed in rodent anterior cingulate cortex. J. Physiol. Paris 109, 87–94 (2015).
Article PubMed Google Scholar
Young, M. K. et al. Activity in the dorsomedial striatum underlies serial reversal learning performance under probabilistic uncertainty. Biol. Psychiatry Glob. Open Sci. 3, 1030–1041 (2023).
Article PubMed Google Scholar
Minzenberg, M. J. & Carter, C. S. Modafinil: a review of neurochemical actions and effects on cognition. Neuropsychopharmacology 33, 1477–1502 (2008).
Article PubMed Google Scholar
Basu, A. et al. Frontal norepinephrine represents a threat prediction error under uncertainty. Biol. Psychiatry 96, 256–267 (2024).
Article PubMed PubMed Central Google Scholar
Luo, Q. et al. Comparable roles for serotonin in rats and humans for computations underlying flexible decision-making. Neuropsychopharmacology 49, 600–608 (2024).
Article PubMed Google Scholar
Riba, J., Rodríguez-Fornells, A., Morte, A., Münte, T. F. & Barbanoj, M. J. Noradrenergic stimulation enhances human action monitoring. J. Neurosci. 25, 4370–4374 (2005).
Article PubMed PubMed Central Google Scholar
Der-Avakian, A., Barnes, S. A., Markou, A. & Pizzagalli, D. A. Translational assessment of reward and motivational deficits in psychiatric disorders. Curr. Top. Behav. Neurosci. 28, 231–262 (2015).
Article Google Scholar
First, M. B., Williams, J. B. W., Karg, R. S. & Spitzer, R. L. Structured clinical interview for DSM-5 — research version (SCID-5 for DSM-5, Research Version; SCID-5-RV). (Arlington, VA, American Psychiatric Association, 2015).
Robble, M. A. et al. Concordant neurophysiological signatures of cognitive control in humans and rats. Neuropsychopharmacology 46, 1252–1262 (2021).
Article PubMed PubMed Central Google Scholar
Einevoll, G. T., Kayser, C., Logothetis, N. K. & Panzeri, S. Modelling and analysis of local field potentials for studying the function of cortical circuits. Nat. Rev. Neurosci. 14, 770–785 (2013).
Article PubMed Google Scholar
Buzsáki, G., Anastassiou, C. A. & Koch, C. The origin of extracellular fields and currents — EEG, ECoG, LFP and spikes. Nat. Rev. Neurosci. 13, 407–420 (2012).
Article PubMed PubMed Central Google Scholar

Download references

Acknowledgements

D.A.P. discloses support for the research and publication of this work from the National Institute of Mental Health grants UH2 MH109334 and UH3 MH109334. The authors would like to acknowledge the members of our scientific advisory board, Dr. Cindy Ehlers, Dr. Stan Floresco, Dr. Patricio O’Donnell, and Dr. Steven Siegel, for their assistance in the development and execution of these studies. We also thank Ms. Jessica Dally for technical assistance.

Author information

These authors contributed equally: Andre Der-Avakian, Samuel A. Barnes, Ty Lees.

Authors and Affiliations

Department of Psychiatry, University of California San Diego, La Jolla, CA, USA
Andre Der-Avakian, Samuel A. Barnes & Victoria B. Risbrough
Department of Psychiatry, McLean Hospital & Harvard Medical School, Belmont, MA, USA
Ty Lees, Hans S. Schroder, Brian D. Kangas, Samantha R. Linton, Stefanie Nickels, Mykel A. Robble, Micah Breiger, Ann M. Iturra-Mena, Rachel Lobien, Sarah Perlo, Emilia F. Cárdenas, Genevieve P. Nowicki, Daniel G. Dillon, Jack Bergman, William A. Carlezon Jr & Diego A. Pizzagalli
Department of Cognitive Science, University of California San Diego, La Jolla, CA, USA
Zeyun Wu, Hongyi Pan & Eran Mukamel
Queensland Centre for Mental Health Research, Wacol, QLD, Australia
James P. Kesby
Queensland Brain Institute, The University of Queensland, St. Lucia, QLD, Australia
James P. Kesby
Veterans Affairs Center of Excellence for Stress and Mental Health, La Jolla, CA, USA
Victoria B. Risbrough
Neurobiology Section and Center for Neural Circuits and Behavior, University of California San Diego, La Jolla, CA, USA
Stefan Leutgeb
Noel Drury, M.D. Institute for Translational Depression Discoveries, University of California, Irvine, CA, USA
Diego A. Pizzagalli

Authors

Andre Der-Avakian
View author publications
Search author on:PubMed Google Scholar
Samuel A. Barnes
View author publications
Search author on:PubMed Google Scholar
Ty Lees
View author publications
Search author on:PubMed Google Scholar
Hans S. Schroder
View author publications
Search author on:PubMed Google Scholar
Brian D. Kangas
View author publications
Search author on:PubMed Google Scholar
Samantha R. Linton
View author publications
Search author on:PubMed Google Scholar
Stefanie Nickels
View author publications
Search author on:PubMed Google Scholar
Mykel A. Robble
View author publications
Search author on:PubMed Google Scholar
Micah Breiger
View author publications
Search author on:PubMed Google Scholar
Ann M. Iturra-Mena
View author publications
Search author on:PubMed Google Scholar
Rachel Lobien
View author publications
Search author on:PubMed Google Scholar
Sarah Perlo
View author publications
Search author on:PubMed Google Scholar
Emilia F. Cárdenas
View author publications
Search author on:PubMed Google Scholar
Genevieve P. Nowicki
View author publications
Search author on:PubMed Google Scholar
Zeyun Wu
View author publications
Search author on:PubMed Google Scholar
Hongyi Pan
View author publications
Search author on:PubMed Google Scholar
Daniel G. Dillon
View author publications
Search author on:PubMed Google Scholar
James P. Kesby
View author publications
Search author on:PubMed Google Scholar
Jack Bergman
View author publications
Search author on:PubMed Google Scholar
William A. Carlezon Jr
View author publications
Search author on:PubMed Google Scholar
Victoria B. Risbrough
View author publications
Search author on:PubMed Google Scholar
Eran Mukamel
View author publications
Search author on:PubMed Google Scholar
Stefan Leutgeb
View author publications
Search author on:PubMed Google Scholar
Diego A. Pizzagalli
View author publications
Search author on:PubMed Google Scholar

Contributions

A.D., J.B., W.A.C, V.B.R, S.L., D.A.P. contributed to the study design, A.D., H.S.S., B.D.K., S.L., S.N., M.A.R., M.B., A.M.I., R.L., S.P., E.F.C., G.P.N., J.P.K. conducted the experiments, S.A.B., Z.W., H.P., D.G.D., E.M. performed data analyses, A.D., S.A.B., T.L., H.S.S. wrote the manuscript.

Corresponding author

Correspondence to Diego A. Pizzagalli.

Ethics declarations

Competing interests

Over the past 3 years, B.D.K. has had sponsored research agreements with BlackThorn Therapeutics, Compass Pathways, Delix Therapeutics, Engrail Therapeutics, Neurocrine Biosciences, and Takeda Pharmaceuticals. Over the past 3 years, V.B.R. has received consulting fees from Engrail Pharmaceuticals, Jazz Pharmaceuticals, and Cohen Veterans Biosciences. Over the past 3 years, D.A.P. has received consulting fees from Arrowhead Pharmaceuticals, Boehringer Ingelheim, Compass Pathways, Engrail Therapeutics, Neumora Therapeutics, Neurocrine Biosciences, Neuroscience Software, and Takeda; he has received honoraria from the American Psychological Society, Psychonomic Society and Springer (for editorial work) and from Alkermes; he has received research funding from the Bird Foundation, Brain and Behavior Research Foundation, Dana Foundation, DARPA, Millennium Pharmaceuticals, the National Institute of Mental Health, and Wellcome Leap; he has received stock options from Ceretype Neuromedicine, Compass Pathways, Engrail Therapeutics, Neumora Therapeutics, and Neuroscience Software. No funding or any involvement from these entities was used to support the current work, and all views expressed are solely those of the authors. All other authors have no conflicts of interest or relevant disclosures.

Peer review

Peer review information

Communications Biology thanks Clay B Holroyd, James Cavanagh, and Markus Ullsperger for their contribution to the peer review of this work. Primary Handling Editors: Christian Beste and Benjamin Bessieres. [A peer review file is available.]

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Transparent Peer Review file

Supplementary Material

Reporting Summary

Python Code for Human Data Analysis

R Code for analyses and plots

Python code for RL model fitting and testing

Python code for Rodent data analysis

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Der-Avakian, A., Barnes, S.A., Lees, T. et al. Identification of conserved frontal neurophysiological markers of cognitive flexibility in humans and rats. Commun Biol 8, 1268 (2025). https://doi.org/10.1038/s42003-025-08729-x

Download citation

Received: 18 June 2024
Accepted: 14 August 2025
Published: 23 August 2025
DOI: https://doi.org/10.1038/s42003-025-08729-x