Adaptive learning from outcome contingencies in eating-disorder risk groups

Pike, Alexandra C.; Sharpley, Ann L.; Park, Rebecca J.; Cowen, Philip J.; Browning, Michael; Pulcu, Erdem

doi:10.1038/s41398-023-02633-w

Download PDF

Article
Open access
Published: 04 November 2023

Adaptive learning from outcome contingencies in eating-disorder risk groups

Translational Psychiatry volume 13, Article number: 340 (2023) Cite this article

3284 Accesses
10 Citations
12 Altmetric
Metrics details

Subjects

Abstract

Eating disorders are characterised by altered eating patterns alongside overvaluation of body weight or shape, and have relatively low rates of successful treatment and recovery. Notably, cognitive inflexibility has been implicated in both the development and maintenance of eating disorders, and understanding the reasons for this inflexibility might indicate avenues for treatment development. We therefore investigate one potential cause of this inflexibility: an inability to adjust learning when outcome contingencies change. We recruited (n = 82) three groups of participants: those who had recovered from anorexia nervosa (RA), those who had high levels of eating disorder symptoms but no formal diagnosis (EA), and control participants (HC). They performed a reinforcement learning task (alongside eye-tracking) in which the volatility of wins and losses was independently manipulated. We predicted that both the RA and EA groups would adjust their learning rates less than the control participants. Unexpectedly, the RA group showed elevated adjustment of learning rates for both win and loss outcomes compared to control participants. The RA group also showed increased pupil dilation to stable wins and reduced pupil dilation to stable losses. Their learning rate adjustment was associated with the difference between their pupil dilation to volatile vs. stable wins. In conclusion, we find evidence that learning rate adjustment is unexpectedly higher in those who have recovered from anorexia nervosa, indicating that the relationship between eating disorders and cognitive inflexibility may be complex. Given our findings, investigation of noradrenergic agents may be valuable in the field of eating disorders.

Neural activation of regions involved in food reward and cognitive control in young females with anorexia nervosa and atypical anorexia nervosa versus healthy controls

Article Open access 23 June 2023

Exaggerated frontoparietal control over cognitive effort-based decision-making in young women with anorexia nervosa

Article Open access 28 August 2024

Altered social cognition in a community sample of women with disordered eating behaviours: a multi-method approach

Article Open access 19 July 2021

Introduction

Eating disorders (EDs) are a cluster of psychiatric disorders characterised by altered eating attitudes and behaviours, alongside over-valuation of the control of eating, weight and/or shape [1]. EDs are relatively common [2,3,4], often severely disabling [5], and can become chronic [6, 7], with high rates of mortality [3, 8, 9] and relatively low rates of treatment response or remission [10,11,12,13]. Psychological treatments can help, but only in some cases [14], and there are few efficacious pharmacological treatments [15]. To improve treatment success, greater knowledge of the cognitive differences that precipitate and maintain the ritualistic, rigid behaviours that characterise many EDs would be valuable [16]. Furthermore, understanding cognitive mechanisms underlying a disorder may combine with our knowledge of the actions of pharmacological agents on those mechanisms to indicate new treatment directions.

‘Cognitive inflexibility’ is frequently observed in EDs [17, 18], and can be defined as an inability to adjust or adapt cognitive functions (e.g. learning, and decision-making) in response to changes in the requirements of the task, outcome contingencies, or the goals that the individual is pursuing. This is often probed using set-shifting tasks such as the Wisconsin Card Sort Task [19], where the ‘rule’ that governs correct behaviour changes without warning. On this and similar tasks, those with EDs show worse set-shifting performance, broadly indicating difficulty in flexibly altering responses given changes in the requirements of the task [20]. This is thought to be present as a ‘trait’, i.e. not just a product of the disease-state [21, 22].

‘Set-shifts’ induce unexpected uncertainty or volatility – changes in the underlying probabilistic structure that has been learnt by the individual [23, 24]. This type of uncertainty is common in everyday life. For example, you may have a local restaurant you really like but have recently found the food to be less good than it had been. Where you go to eat in the future will depend on whether the recent bad meals occurred by chance (in which case you should continue going to the same restaurant), or whether there has been a reduction in the underlying quality of the food (i.e. the quality is volatile, in which case you should switch restaurant). The question addressed in this paper is whether people with EDs are able to perceive and change their behaviour appropriately in response to outcome volatility, and whether difficulties with this might underpin neuropsychological findings of poor cognitive flexibility.

Recent work in this area has used a task [25] in which the volatility of outcomes is manipulated independently between blocks. Computational modelling allows us to estimate a parameter referred to as a 'learning rate', which can be understood as the extent to which each outcome influences the learnt value of particular options. When outcomes are volatile, learning rates should be higher, as more recent outcomes are more predictive of the actual value of an option than outcomes that occurred further back in history. Healthy participants performing this task are able to adjust their learning rate in response to volatility in an approximately optimal way [26]. A recent adaptation of this task has shown that participants are also able to maintain separate estimates of different valences of outcomes (‘wins’ and ‘losses’), and track the volatility of each of these, adjusting learning rates for wins and losses independently [25]. Importantly, using computational modelling allows us to adjudicate between competing hypotheses regarding poor set-shifting performance in eating disorders: greater noise in behavioural decision-making, generally reduced learning rates, or reduced learning rate adjustment in response to volatility.

We hypothesised that adjustment in learning rates in response to volatility would be reduced in individuals with EDs, in line with the cognitive neuroscience evidence suggesting that those with EDs may experience difficulties with cognitive flexibility, and struggle to adjust their behaviour [17, 20,21,22]. A finding that those in ED groups adjusted their learning rates less would also correspond with a similar finding in anxiety disorders [27], which would be unsurprising given the high comorbidity between anxiety disorders and EDs [28].

Biologically, noradrenaline (NA) may signal unexpected uncertainty [24]. It is possible to indirectly measure the response of the central noradrenergic system using pupillometry [29, 30]; phasic changes in pupil diameter have been observed to correlate with volatility [25, 27, 29], and pupil dilation may also be linked to surprise [31]. Pharmacological manipulations that increase the release of NA are able to improve performance on (attentional) set-shifting tasks in rats [32]. Furthermore, NA deafferentation in rats impairs set-shifting performance [33]. In humans, propranolol, which attenuates NA transmission, reduces volatility-related increases in learning rates [34]. However, not all of the evidence paints such a clear picture of the role of NA in signalling unexpected uncertainty. Jepma et al. [35] found that atomoxetine (a NA transporter blocker) caused an increase in learning rate after an alteration in outcome contingencies if the baseline learning rate was low, but otherwise, atomoxetine caused a reduction in learning rate. There has been early evidence for the efficacy of atomoxetine in the treatment of binge-eating disorder, and anorexia nervosa with binge-purge features [36, 37]. Additionally, research has indicated that there is reduced NA functioning in ED patients, including those who have recovered [38,39,40]. We therefore recorded pupil dilation to examine whether any changes in learning rate adjustment were reflected in altered pupil dilation changes, which could in turn reflect differences in noradrenergic transmission. We hypothesised that corresponding to a reduction in learning rate adjustment, those with EDs might show a reduced pupil dilation response to volatility. This may indicate a lack of sensitivity to changes in outcome volatility.

Methods and materials

This study was preregistered on clinicaltrials.gov, with the identifier NCT03450291. This study was approved by the University of Oxford’s Central University Research Ethics Committee (reference R51898). Open data and code are not available as not all participants consented to have their data shared, even if they could not be identified after anonymisation.

Participants

Three groups of female participants, selected to cover a range of different ED-relevant phenotypes, were recruited (82 in total): those who had recovered from anorexia nervosa (RA, n = 25), those who were highly concerned about their eating, shape and weight, but did not have a formal ED diagnosis (EA, n = 25), and control participants who had never had an eating disorder and were below various markers on self-report questionnaires about eating, shape and weight (HC, n = 32). The criteria for each of these groups, along with further details of the inclusion and exclusion criteria, may be found in the Supplementary Material.

General procedure

The study involved the completion of pre-screening questionnaires: the Eating Attitudes Test (EAT-26; [41]), Eating Disorders Examination Self-Report Questionnaire (EDE-Q; [42]), and Clinical Impairment Assessment for eating disorders (CIA; [43]). If eligible, participants were invited for a single study visit. During that visit, the Structured Clinical Interview for the Diagnostic and Statistical Manual of Mental Disorders, Fifth Edition (DSM-5) (Revised Version) was performed. Subsequently, participants completed additional self-report questionnaires online (see Table 1). They then completed the learning task [25] described below, alongside eye-tracking.

Table 1 Demographic details.

Full size table

The volatility task

This study utilises a task which has been shown to produce adaptive learning in human participants [25,26,27]. The volatility of outcomes (here defined as the frequency at which the associations between the stimuli and outcomes alter) is manipulated between blocks (Fig. 1). In brief, the task consisted of three blocks (each consisting of 80 trials), and on any trial, two stimuli were presented (kept constant within blocks), and associated with win and/or loss outcomes. Participants learned using trial and error to select the stimulus that was associated with wins and avoid the stimulus associated with losses. Importantly, the associations between stimuli and outcomes were not necessarily fixed: they could vary within a block (i.e. were ‘volatile’). A stimulus could be associated with both win and loss outcomes on a given trial (or only one of the two outcomes, or neither): therefore, we were able to independently manipulate the volatility of wins and losses between the three blocks. Participants generally display an elevated learning rate for volatile outcomes [26, 27], akin to overweighting more recent outcomes compared to more distal outcomes, and participants have been shown to adjust their learning rates for wins and losses separately as their volatility changes [27]. The reader should note that wherever we analyse behaviour or pupil dilation in response to volatility, we include data from both the first block (block 1) and the block where only that outcome was volatile (i.e. the ‘win volatile’ block for wins, and the ‘loss volatile’ block for losses), to maximise our power to detect differences.

General statistical approach

Wherever applicable we used a Greenhouse-Geisser correction to adjust for lack of sphericity. To further clarify significant effects from the mixed ANOVAs, we used post-hoc Welch’s t-tests, which conservatively assume unequal variance between groups. We did not correct for multiple comparisons in these analyses: the primary (adjustment of learning rate) and secondary outcomes (valence-specific effects in learning rate adjustment, and differences in pupil dilation under volatility) were pre-registered, and all other analyses are exploratory and designed to aid the interpretation of the results.

Behavioural analysis

Data quality

Participants who showed no evidence of learning during the task (those whose total money won was the same or less than the £1.50 they started with) were removed from further analysis (n = 4; comprised of two HC participants, and one from each of the other two groups).

Switch/stay analysis

The trial-by-trial tendency of participants to ‘switch’ (i.e. choose a different stimulus on the next trial) or ‘stay’ (i.e. repeat their choice on the next trial) may provide some insight into participants’ behaviour in this volatility task. Those who heavily weigh the most recent outcome against the long-run probability of each outcome associated with each stimulus are more likely to choose to ‘switch’ after a loss and a ‘stay’ after a win. In general, we expected that participants would ‘switch’ more after a negative outcome in blocks where losses are volatile, as the receipt of a loss would indicate a potential change in loss probability. Similarly, we would expect participants to ‘stay’ more after a rewarding outcome in blocks where wins are volatile. Cognitive inflexibility, in particular reduced adjustment to changing outcome probabilities (which we expected to observe in both the RA and EA groups), may manifest as a reduction in this pattern of choices.

We, therefore, analysed the proportion of times that participants chose to switch after receiving an outcome when that outcome was either volatile or stable. We used a logit transform on the proportion of times that participants chose to stay after each outcome to ensure that this was on the real, infinite number line. We subsequently used a repeated-measures ANOVA, with between-subject effects of group and block presentation order and within-subject effects of outcome volatility (volatile vs. stable) and valence of outcome (win vs. loss), to investigate whether there were any between-group differences.

Reinforcement learning analysis

The switch/stay analysis described above relies on summary statistics, which are less suitable for assessing the latent variables of interest (e.g. learning, performance, stochasticity) than more principled model-based analyses [44, 45]. In particular, computational models of learning are able to capture how performance evolves as a result of feedback, rather than simply assessing average performance [46]. Furthermore, previous work has suggested that behaviour in this learning task is modulated by three separate computational factors, which all contribute to the obtainable summary statistics: learning rates, unexplained biases in favour of one shape, and choice stochasticity [47].

We therefore fit reinforcement-learning models linked to stochastic choice models based on previous work using variants of this task [25,26,27]. Models were fit using Markov-Chain Monte-Carlo sampling. The models used and further methodological details of the model-fitting procedure can be found in the Supplementary Information.

Subsequently, we selected the model that (a) best fit the participants’ behaviour (according to the total integrated BIC score [48]), (b) showed good parameter recovery, and (c) was able to faithfully reproduce participant behaviour. Finally, we conducted statistical analyses on the relevant parameters from the best-fitting model.

Statistical analysis

The preregistered primary analysis of this study aimed to see if there were group differences in ability to alter learning rate between blocks, which we examined using a repeated-measures ANOVA on the difference between learning rates in blocks (sets of 80 trials) in which outcomes were volatile (see above: in block 1 both win and loss outcomes were volatile, and in blocks 2 and 3 win outcomes were volatile and loss outcomes stable, or vice versa, in an order counterbalanced between participants). We used counterbalance order and group as between-participant factors, and valence (win and loss learning rate adjustment) as the within-participant factor. We also analysed the learning rates in block 1, in which both outcomes were volatile, using an ANOVA with group and order as between-participant factors, and valence as a within-participant factor.

For completeness, we also explored the effects of group on the other parameters from the winning computational model, with between-subject factors of group and order, and within-subject factor of block.

Pupillometry analysis

Preprocessing

Detailed information on the preprocessing of eye tracking data is provided in the supplementary materials. Notably, trials were excluded from analysis if >50% of the data in that trial was interpolated, and two participants who had >50% interpolation on >50% of the task trials were removed. One participant was removed as a power outage corrupted their pupillometry data.

After preprocessing, we had a time-series for each trial displaying pupil dilation to rewards, and to punishments, spanning 1 s before to 6 s after the outcome presentation. We subsequently calculated four mean time-series: for each combination of volatility and outcome (win volatile, win stable, loss volatile, and loss stable). As elsewhere, ‘block 1’ data was included in the volatile time-series. We then created a subtraction time-series from these: the difference between pupil dilation to receipt (subtracted from non-receipt) of volatile wins/losses minus stable rewards/losses.

Statistical analysis

We ran a cluster-based permutation mixed effects model using the ‘permutes’ package in R [49], with 1000 permutations and a random slope specified as (valence | id), to identify any time-points in which there were significant effects of group, valence, or interactions. The model had the following equation:

$${value} \sim {valence}* {Group}+\left({valence}|{id}\right),$$

where time was added as a continuous time-series variable. The package used shuffles the labels of the fixed effects, using a simplified version of the algorithm from Lee & Braun [50]. Note that the data included in the permutation test represent averaged time series for each individual for each valence. Subsequently, we ran post-hoc permutation linear mixed-effects models on relevant time-windows to obtain robust p-values.

Results

Participant characteristics

Demographic information and participant questionnaire scores (and group differences if found) are shown in Table 1.

Switch/stay analyses do not discriminate between ED groups and healthy controls

In a repeated-measures ANOVA, there were main effects of volatility (F_1,76 = 16.21, p < 0.001) and valence (F_1,76 = 364.60, p < 0.001), and an interaction effect between the two (F_1,76 = 40.64, p < 0.001). This is an expected task effect: participants tended to ‘stay’ more when receiving a reward outcome when that reward was presented in a volatile block (M = 0.91, SD = 0.11) than when presented in a stable block (M = 0.87, SD = 0.18). Participants also tended to stay less when receiving a loss outcome when that loss was presented in a volatile block (M = 0.62, SD = 0.14) compared to a stable block (M = 0.75, SD = 0.16). There was not a main effect of group (F_2,76 = 0.16, p = 0.851; Fig. 2) nor any interaction effects including group (group and volatility: F_2,76 = 0.62, p = 0.542; group and valence: F_2,76 = 0.04, p = 0.965; group, volatility and valence: F_2,76 = 0.03, p = 0.971). There was an interaction effect between volatility, valence and order (F_1,76 = 4.32, p = 0.041). The full results of this analysis can be seen in the Supplementary material.

**Fig. 2: Difference in the mean proportion of times participants repeated a choice (‘stay’ choices) depending on whether the outcome they received was volatile (i.e. informative) or stable.**

At first sight, therefore, the different groups seem to show comparable learning rates (i.e., their tendency to switch or stay is not affected by group membership). In order to validate these results and explore the other computational factors that may govern behaviour in this task (overall choice stochasticity and unexplained preference biases), we modelled choice behaviour by linking different reinforcement learning and stochastic choice models.

Best-fitting reinforcement learning model

We examined a set of reinforcement learning models which incorporated separate learning rates for rewards and losses, as in Pulcu & Browning [27] - see supplementary materials. The best-fitting reinforcement learning model (according to the integrated BIC [48]) had a single inverse temperature term, which captures choice stochasticity, or the extent to which participants do not act in accordance with the learnt values of the stimuli. We present results in the Supplementary Material which show that this model has good parameter recovery and was able to faithfully reproduce salient features of our participant data. We therefore use this model for subsequent inference.

RA group show greater learning rate adjustment

We performed an ANOVA on the parameters from our best-fitting reinforcement-learning model, and found a significant group effect on the difference in learning rates between volatile and stable blocks (including block 1; F_2,76 = 3.42, p = 0.038; Fig. 3A). This was driven by the RA group (groupwise RA vs. HC: F_1,53 = 7.13, p = 0.010; groupwise EA vs. HC: F_1,53 = 0.53, p = 0.468; groupwise RA vs. EA: F_1,46 = 2.72, p = 0.106), who show elevated learning rate adjustment (t_110.78 = 2.43, p = 0.017). When we examined the learning rates themselves in both the volatile block and the stable block, there was no effect of group (volatile block: F_2,76 = 0.56, p = 0.574; stable block: F_2,76 = 0.12, p = 0.888). Notably, there was no interaction between group and valence, so we do not investigate this further (F_2,76 = 0.02, p = 0.985), nor any interaction between group and order (F_2,76 = 0.91, p = 0.407). We also replicated a previous finding, of elevated win learning rate for wins when these are volatile compared to stable (t_161.65 = 3.31, p = 0.001, M = 0.371 vs. 0.246), and the same for loss outcomes (t_161.57 = 2.99, p = 0.003, M = 0.291 vs. 0.183). This demonstrates that, as expected, participants are adapting to volatility. The learning rate adjustment in all three groups was significantly different to 0 (RA: t₄₉ = 8.00, p < 0.001, M(sd)=0.191(0.169); EA: t₄₉ = 5.29, p < 0.001, M(sd)=0.134(0.180); HC: t₆₃ = 4.43, p < 0.001, M(sd)=0.107(0.195)).

**Fig. 3: Learning rates, shown using violin plots, with mean and standard error shown using black dots and bars.**

We also examined participants’ learning rates in block 1, in which both rewards and punishments were volatile. This block is particularly well-suited to the detection of baseline negative and positive biases in learning, as both of the outcomes are volatile and thus equally informative. In this exploratory analysis, we did not find a group effect (F_2,.79 = 0.78, p = 0.463; Fig. 3B). There was also no effect of group on the other computational model parameter: inverse temperature (exploratory analysis: F_2,76 = 1.04, p = 0.360; Fig. 3C).

RA group also shows reduced effect of reward volatility on pupil dilation

In our cluster-based permutation mixed-effects model, we observed significant clusters for all contrasts. In particular, we observed a significant interaction effect between group and valence, from 342 ms to 4904 ms after outcome presentation (F₂ = 3.85, p < 0.001, see Supplementary Figure 6). We subsequently performed post-hoc exploratory analyses using permutation mixed-effects models on subsets of the data (separated back into four time courses) to identify the source of this effect. In the reward domain, there were main effects of group, condition, and an interaction effect; this was also true in the punishment domain. Further investigation showed no significant effect of group on pupil responses to volatile rewards (F₂ = 0.01, p = 0.949), but there was a group effect in response to stable rewards (F₂ = 2.73, p = 0.036), driven by the difference between RA and both other groups (vs. HC F₁ = 3.26, p = 0.036, vs. EA F₁ = 5.17, p = 0.029; Fig. 4C). There was no group effect when just including EA and HC (F₁ = 0.38, p = 0.345). Similarly, there was no effect when examining volatile losses (F₂ = 0.156, p = 0.682), but there was when including data for stable losses (F₂ = 2.94, p = 0.022). This was driven by a significant difference between RA and EA (F₁ = 4.86, p = 0.040); and RA and HC (F₁ = 3.03, p = 0.043; Fig. 4B). There was no significant difference between EA and HC (F₁ = 0.85, p = 0.185). The full results of these permutation tests can be observed in the Supplementary material.

**Fig. 4: Results of pupillometry analysis.**

Learning rate adjustment and pupil volatility adjustment correlate in the RA group

The average of the pupil response to rewards, across the time period of the significant group*valence interaction, was positively correlated with learning rate adjustment for rewards in the RA group in an exploratory correlation analysis (r₂₂ = 0.512, p = 0.011; Fig. 4C), but this was not true in any other group (EA: r₂₃ = 0.067, p = 0.751; HC: r₂₈ = −0.176, p = 0.351). We compared these correlation coefficients using a Fisher’s r-to-z transform, and found a significant difference between the RA and HC coefficients (z = 2.56, p = 0.011), but not between the RA and EA coefficients (z = 1.71, p = 0.087). There was no significant correlation between pupil response to losses and learning rate adjustment for losses in any group (ps > 0.2).

Discussion

Contrary to our hypotheses, we found that a group of participants who had recovered from Anorexia Nervosa (RA group) showed greater learning rate adjustment when outcomes changed from volatile to stable (Fig. 3A). There was no difference between healthy control participants (HC) and those with high levels of symptoms but no diagnosed eating disorder (EA). In parallel, we found that the RA group showed greater pupil dilation to volatile vs. stable loss outcomes, soon after outcome delivery (Fig. 4B), and lower pupil dilation in response to volatile rewards compared to stable rewards soon after outcome delivery (Fig. 4B). These effects were driven by elevated pupil dilation to stable win outcomes, and reduced pupil dilation to stable loss outcomes. The pupil response difference for volatile win vs. stable win outcomes was positively correlated with learning rate adjustment in this group (Fig. 4C).

The finding that learning rate adjustment is greater in the RA group (Fig. 3A), is somewhat surprising, as much previous eating disorder research has focused on the trait of cognitive inflexibility as a possible marker for eating disordered behaviour [17, 18, 20,21,22]. Indeed, we hypothesised that we would observe the opposite result: that learning rate adjustment would be reduced. Reduced adjustment has been observed previously in anxiety disorders [27], and in those with high levels of internalizing symptoms [51], although findings in autism are similar to those we observe [52].

This result was specific to those in the RA group, and no differences were found between the EA group and HC group. This may reflect greater premorbid vulnerability to EDs: all of those in the RA group had a previously diagnosed eating disorder, whilst none of the EA group had a current ED diagnosis and were recruited purely on the basis of elevated scores on a symptom questionnaire (though some may be formally diagnosed in the future, but we do not know how many). They may thus not be an appropriate ‘risk’ group, and indeed, perhaps should be considered resilient to eating disorders, by virtue of the combination of elevated symptoms but no formal diagnosis. Importantly, the RA group included only those who had previously been diagnosed with one eating disorder – Anorexia Nervosa – whereas the EA group could include those with symptoms consistent with other eating disorders, so our finding may reflect different cognition between these disorders. Alternatively, it is possible that the EA group is an appropriate choice of risk group, but that differences in cognition are more subtle than would be observed in a group of participants with diagnosed eating disorders. The results presented in Fig. 3 suggest that perhaps this is the best explanation – this group have numerically greater learning rate adjustment values than the HC group, though less than the RA group. Future research should consider this, and use a more conservative estimate of effect sizes for power calculations for risk groups compared to clinically-diagnosed groups.

We also found an effect of group on the pupil dilation during volatile and relative to stable outcome receipt. This effect was driven by differences in the RA groups’ response to stable outcomes. Interestingly, the RA group shows greater pupil dilation to stable rewards and reduced pupil dilation to stable losses (compared to both other groups). This is further evidence that the RA group may be processing volatile outcomes unusually: in fact, the mean pupil dilation to stable rewards was greater than the mean pupil dilation to volatile outcomes (Fig. 4A). It may be significant that the effects are opposite for wins and losses – perhaps this reflects asymmetric processing of outcomes that are noisy but not volatile [53]. This type of imbalance was not, however, reflected either in the ‘baseline’ learning rates in response to the first block or by any effect of valence in the learning rate adjustment analyses. This discrepancy is not necessarily a contradiction: pupil dilation is thought to reflect many underlying computations, including volatility, but also surprise, salience, and mental effort [54, 55]. This discrepancy could thus be due to a fundamental between-groups difference in processing different types of uncertainty, or due to a mismatch in how participants from the different groups experience task difficulty or outcome salience. Future experiments should attempt to control for these other variables to further disentangle this effect. This pupil dilation time course for rewards was positively associated with learning rate adjustment, such that participants who adjusted their learning rates more (i.e., were further away from the HC group) showed more typical pupil dilation patterns (Fig. 4C), which is also consistent with the theory that greater pupil dilation reflects greater noradrenergic activity in response to increased volatility. This allowed us to link our model-based results (learning rate adjustment) with a model-free, physiological measure (pupil dilation). Speculatively, this may be a marker of improved cognitive flexibility: individuals who recover from eating disorders adjust their learning rates further, and their pupil responses are closer to those observed in the control group. It is also surprising that there is no correlation between learning rate adjustment and pupil dilation within the other groups studied, or in the loss domain, as has been observed previously [25].

Previous cognitive neuroscience studies have shown that changes in pupil diameter may reflect a number of different influences, from cognitive effort, to uncertainty, to surprise, to changes in the world [29, 31]. Further work could attempt to manipulate these independently in different ED groups to ascertain which is reflected in our findings. Importantly, various noradrenergic agents such as propanalol [34] or atomoxetine [35] may be able to alter responses to volatility. Above, we note that noradrenaline has been implicated in EDs using measurements of noradrenaline metabolites [38,39,40], with emerging evidence that atomoxetine may be effective in eating disorders featuring binging behaviours [36, 37]. Notably, however, our RA group (in whom we observed pupillometry differences) were not selected for high levels of binging, which may suggest that noradrenaline differences may be more broadly present across ED groups. In light of our findings, experimental medicine studies or early-stage clinical trials of noradrenergic compounds in other, non-binging eating disorders may be a promising new avenue for exploration.

Limitations

This study has several limitations. Firstly, ‘expected’ or ‘irreducible’ uncertainty was not manipulated independently from volatility, and may have its own effects on behaviour and pupil dilation [23, 24, 53]. Specifically, expected uncertainty is high when an outcome is probabilistic rather than deterministic, and is maximal when the probabilities of all outcomes are equiprobable – as was the case in our ‘stable’ blocks, where outcomes were associated with stimuli with 50% probability. This may mean that our results relate to differences in uncertainty estimation more generally, rather than volatility. Future work should use a task in which other versions of uncertainty are held stable whilst volatility is manipulated. On a related note, both outcomes were always volatile in the first block. This design choice allowed us to check for the existence of baseline differences in learning rate. Given that no differences were observed, in future research it might be more beneficial to have a fully randomised block order to avoid any ‘priming’ with expectations of high volatility, and to balance the number of times that participants would be expected to increase (compared to decrease) their learning rates.

Secondly, we did not recruit any individuals who were currently unwell with eating disorders. Part of our rationale for this was to ensure that malnutrition and underweight status did not drive results in this cognitively-demanding task. However, we therefore cannot claim our findings are necessarily representative of those with a current eating disorder.

Conclusions

In conclusion, we find evidence for differences in the processing of outcome volatility between a group who had recovered from Anorexia Nervosa, and healthy control participants. The RA group showed greater adjustment of their learning rates to volatility than controls. In this same group, we also observed an atypical lower pupil dilation to volatile than stable outcomes, particularly in those participants who showed learning rate adjustment closest to control participants. Importantly, given these findings, manipulation of noradrenaline levels using pharmacological agents may be an interesting future direction for eating disorder research.

Disclaimer

The views expressed are those of the authors and not necessarily those of the NHS, the NIHR or the Department of Health.

Data availability

Open data and code are not available as not all participants consented to have their data shared, even if they could not be identified after anonymisation.

References

American Psychiatric Association: Diagnostic and Statistical Manual of Mental Disorders, 5th Edition. American Psychiatric Publishing, Inc. 2013 https://doi.org/10.1176/appi.books.9780890425596.744053
Galmiche M, Déchelotte P, Lambert G, Tavolacci MP. Prevalence of eating disorders over the 2000–2018 period: a systematic literature review. Am J Clin Nutr. 2019;109:1402–13.
Article PubMed Google Scholar
Smink FRE, van Hoeken D, Hoek HW. Epidemiology of eating disorders: incidence, prevalence and mortality rates. Curr Psychiatry Rep. 2012;14:406–14.
Article PubMed PubMed Central Google Scholar
Lindvall Dahlgren C, Wisting L, Rø Ø. Feeding and eating disorders in the DSM-5 era: a systematic review of prevalence rates in non-clinical male and female samples. J Eat Disord. 2017;5:56.
Article PubMed PubMed Central Google Scholar
Winkler LA-D, Christiansen E, Lichtenstein MB, Hansen NB, Bilenberg N, Støving RK. Quality of life in eating disorders: a meta-analysis. Psychiatry Res. 2014;219:1–9.
Article PubMed Google Scholar
Steinhausen H-C. The outcome of anorexia nervosa in the 20th century. AJP. 2002;159:1284–93.
Article Google Scholar
Fichter MM, Quadflieg N, Crosby RD, Koch S. Long-term outcome of anorexia nervosa: results from a large clinical longitudinal study: FICHTER et al. Int J Eat Disord. 2017;50:1018–30.
Article PubMed Google Scholar
Arcelus J, Mitchell AJ, Wales J, Nielsen S. Mortality rates in patients with anorexia nervosa and other eating disorders: a meta-analysis of 36 studies. Arch Gen Psychiatry. 2011;68:724.
Article PubMed Google Scholar
Fichter MM, Quadflieg N. Mortality in eating disorders - results of a large prospective clinical longitudinal study: MORTALITY IN EATING DISORDERS. Int J Eat Disord. 2016;49:391–401.
Article PubMed Google Scholar
Keel PK, Brown TA (2010): Update on course and outcome in eating disorders. Int J Eat Disord.
Bergh C, Callmar M, Danemar S, Hölcke M, Isberg S, Leon M, et al. Effective treatment of eating disorders: Results at multiple sites. Behav Neurosci. 2013;127:878–89.
Article PubMed Google Scholar
Agüera Z, Sánchez I, Granero R, Riesco N, Steward T, Martín-Romera V, et al. Short-term treatment outcomes and dropout risk in men and women with eating disorders: treatment outcome in males with ED. Eur Eat Disord Rev. 2017;25:293–301.
Article PubMed Google Scholar
Helverskov JL, Clausen L, Mors O, Frydenberg M, Thomsen PH, Rokkedal K. Trans-diagnostic outcome of eating disorders: a 30-month follow-up study of 629 patients. Eur Eat Disord Rev. 2010;18:453–63.
Article CAS PubMed Google Scholar
Wilson GT. Psychological treatment of eating disorders. Annu Rev Clin Psychol. 2005;1:439–65.
Article PubMed Google Scholar
Himmerich H, Kan C, Au K, Treasure J. Pharmacological treatment of eating disorders, comorbid mental health problems, malnutrition and physical health consequences. Pharmacol Ther. 2021;217:107667.
Article CAS PubMed Google Scholar
Godier LR, Park RJ (2014): Compulsivity in anorexia nervosa: a transdiagnostic concept. Front Psychol. 5. https://doi.org/10.3389/fpsyg.2014.00778
Tchanturia K, Anderluh MB, Morris RG, Rabe-Hesketh S, Collier DA, Sanchez P, et al. Cognitive flexibility in anorexia nervosa and bulimia nervosa. J Int Neuropsychol Soc. 2004;10:513–20.
Article PubMed Google Scholar
Wang SB, Gray EK, Coniglio KA, Murray HB, Stone M, Becker KR, et al. Cognitive rigidity and heightened attention to detail occur transdiagnostically in adolescents with eating disorders. Eat Disord. 2021;29:408–20.
Article PubMed Google Scholar
Grant DA, Berg E. A behavioral analysis of degree of reinforcement and ease of shifting to new responses in a Weigl-type card-sorting problem. J Exp Psychol. 1948;38:404–11.
Article CAS PubMed Google Scholar
Roberts ME, Tchanturia K, Stahl D, Southgate L, Treasure J. A systematic review and meta-analysis of set-shifting ability in eating disorders. Psychol Med. 2007;37:1075–84.
Article PubMed Google Scholar
Holliday J, Tchanturia K, Landau S, Collier D, Treasure J. Is impaired set-shifting an endophenotype of anorexia nervosa. Am J Psychiatry. 2005;162:2269–75.
Article PubMed Google Scholar
Tchanturia K, Harrison A, Davies H, Roberts M, Oldershaw A, Nakazato M, et al. Cognitive flexibility and clinical severity in eating disorders. PLoS ONE. 2011;6:1–5.
Article Google Scholar
Pulcu E, Browning M. The misestimation of uncertainty in affective disorders. Trends Cogn Sci. 2019;23:865–75.
Article PubMed Google Scholar
Yu AJ, Dayan P. Uncertainty, neuromodulation, and attention. Neuron. 2005;46:681–92.
Article CAS PubMed Google Scholar
Pulcu E, Browning M. Affective bias as a rational response to the statistics of rewards and punishments. eLife. 2017;6:e27879.
Article PubMed PubMed Central Google Scholar
Behrens TEJ, Woolrich MW, Walton ME, Rushworth MFS. Learning the value of information in an uncertain world. Nat Neurosci. 2007;10:1214–21.
Article CAS PubMed Google Scholar
Browning M, Behrens TE, Jocham G, O’Reilly JX, Bishop SJ. Anxious individuals have difficulty learning the causal statistics of aversive environments. Nat Neurosci. 2015;18:590–6.
Article CAS PubMed PubMed Central Google Scholar
Kaye WH, Bulik CM, Ph D, Thornton L, Barbarich N, Masters K. Comorbidity of anxiety disorders with anorexia and bulimia nervosa. Psychiatry. 2004;161:2215–21.
Google Scholar
Nassar MR, Rumsey KM, Wilson RC, Parikh K, Heasly B, Gold JI. Rational regulation of learning dynamics by pupil-linked arousal systems. Nat Neurosci. 2012;15:1040–6.
Article CAS PubMed PubMed Central Google Scholar
Joshi S, Li Y, Kalwani RM, Gold JI. Relationships between pupil diameter and neuronal activity in the locus coeruleus, colliculi, and cingulate cortex. Neuron. 2016;89:221–34.
Article CAS PubMed Google Scholar
Preuschoff K, ’T Hart BM, Einhäuser W. Pupil dilation signals surprise: evidence for noradrenaline’s role in decision making. Front Neurosci. 2011;5:1–12.
Article Google Scholar
Devauges V, Sara SJ. Activation of the noradrenergic system facilitates an attentional shift in the rat. Behav Brain Res. 1990;39:19–28.
Article CAS PubMed Google Scholar
McGaughy J, Ross RS, Eichenbaum H. Noradrenergic, but not cholinergic, deafferentation of prefrontal cortex impairs attentional set-shifting. Neuroscience. 2008;153:63–71.
Article CAS PubMed Google Scholar
Lawson RP, Bisby J, Nord CL, Burgess N, Rees G. The computational, pharmacological, and physiological determinants of sensory learning under uncertainty. Curr Biol. 2021;31:163–.e4.
Article CAS PubMed PubMed Central Google Scholar
Jepma M, Murphy PR, Nassar MR, Rangel-Gomez M, Meeter M, Nieuwenhuis S. Catecholaminergic regulation of learning rate in a dynamic environment. PLoS Comput Biol. 2016;12:1–24.
Article Google Scholar
McElroy SL, Guerdjikova A, Kotwal R, Welge JA, Nelson EB, Lake KA, et al. Atomoxetine in the treatment of binge-eating disorder: a randomized placebo-controlled trial. J Clin Psychiatry. 2007;68:390–8.
Article CAS PubMed Google Scholar
Wilfahrt RP, Wilfahrt LG, Matthews Hamburg A. Atomoxetine reduced binge/purge symptoms in a case of anorexia nervosa binge/purge type. Clin Neuropharm. 2021;44:68–70.
Article Google Scholar
Pirke KM. Central and peripheral noradrenalin disorders. Psychiatry Res. 1996;62:43–49.
Article CAS PubMed Google Scholar
Kaye WH, Jimerson DC, Lake CR, Ebert MH. Altered norepinephrine metabolism following long-term weight recovery in patients with anorexia nervosa. Psychiatry Res. 1985;14:333–42.
Article CAS PubMed Google Scholar
Kaye WH, Ebert MH, Raleigh M, Lake CR. Abnormalities in CNS monoamine metabolism in anorexia nervosa. Arch Gen Psychiatry. 1984;41:350–5.
Article CAS PubMed Google Scholar
Garner DM, Olmsted MP, Bohr Y, Garfinkel PE. About the eating attitudes test - the eating attitudes test (EAT-26). 1979;1–23 http://www.Eat-26.Com/Screening.Php.
Fairburn CG, Beglin SJ. Eating disorder examination. Cognitive behavior therapy and eating disorders. New York: Guilford Press. 2008;265–308.
Bohn K, Fairburn CG. The clinical impairment assessment questionnaire. Cognitive behavior therapy and eating disorders. New York: Guilford Press. 2008.
Haines N, Kvam PD, Irving LH, Smith C, Beauchaine TP, Pitt MA, et al. Learning from the reliability paradox: how theoretically informed generative models can advance the social, behavioral, and brain sciences. PsyArXiv. 2020. https://doi.org/10.31234/osf.io/xr7y3
Wiecki TV, Poland J, Frank MJ. Model-based cognitive neuroscience approaches to computational psychiatry: clustering and classification. Clin Psychol Sci. 2015;3:378–99.
Article Google Scholar
Daw ND. Trial-by-trial data analysis using computational models. Decis Mak, affect, Learn: Atten Perform XXIII. 2011;23:26.
Google Scholar
Pulcu E, Shkreli L, Holst CG, Woud ML, Craske MG, Browning M, et al. The effects of the angiotensin II receptor antagonist losartan on appetitive versus aversive learning: a randomized controlled trial. Biol Psychiatry. 2019;86:397–404.
Article CAS PubMed Google Scholar
Huys QJM, Cools R, Gölzer M, Friedel E, Heinz A, Dolan RJ, et al. Disentangling the roles of approach, activation and valence in instrumental and pavlovian responding ((A Rangel, editor)). PLoS Comput Biol. 2011;7: e1002028.
Voeten CC. Permutes: permutation tests for time series data, version 2.6. Retrieved October 26, 2022, from https://CRAN.R-project.org/package=permutes
Lee OE, Braun TM. Permutation tests for random effects in linear mixed models. Biometrics. 2012;68:486–93.
Article PubMed Google Scholar
Gagne C, Zika O, Dayan P, Bishop SJ. Impaired adaptation of learning to contingency volatility in internalizing psychopathology ((A Shackman, JI Gold, A Stringaris, & SJ Gershman, editors)). eLife. 2020;9: e61387.
Lawson RP, Mathys C, Rees G. Adults with autism overestimate the volatility of the sensory environment. Nat Neurosci. 2017;20:1293–9.
Article CAS PubMed PubMed Central Google Scholar
Piray P, Daw ND. A model for learning based on the joint estimation of stochasticity and volatility [no. 1]. Nat Commun. 2021;12:6587.
Article CAS PubMed PubMed Central Google Scholar
van der Wel P, van Steenbergen H. Pupil dilation as an index of effort in cognitive control tasks: A review. Psychon Bull Rev. 2018;25:2005–15.
Article PubMed PubMed Central Google Scholar
Zénon A. Eye pupil signals information gain. Proc R Soc B: Biol Sci. 2019;286:20191593.
Article Google Scholar

Download references

Acknowledgements

Many thanks to the participants who took part in this study – this work would not be possible without their time and energy. Thanks are also due to the groups ACP was affiliated with during her DPhil: the Psychopharmacology Research Group and the Oxford Brain-Body Research into Eating Disorders group, both based in the Department of Psychiatry in Oxford, and the many collaborators who have given feedback on these results.

Funding

This research was funded by the Medical Research Council and the Department of Psychiatry at the University of Oxford via a studentship to ACP (ref: 1650420), and supported by the National Institute of Health Research (NIHR) Oxford Health Biomedical Research Centre (OHBRC). ACP’s salary was previously supported by an MRC Senior Non-Clinical Award to her advisor, Professor Oliver Robinson (MR/R020817/1). The funders had no role in the study design, collection, analysis, or interpretation of data, or the decision to submit this manuscript for publication.

Author information

These authors jointly supervised this work: Michael Browning, Erdem Pulcu.

Authors and Affiliations

Department of Psychology and York Biomedical Research Institute, University of York, Heslington, York, YO10 5DD, UK
Alexandra C. Pike
Anxiety Laboratory, Neuroscience and Mental Health Group, Institute of Cognitive Neuroscience, University College London, 17-19 Queen Square, London, WC1N 3AR, UK
Alexandra C. Pike
Department of Psychiatry, University of Oxford, Warneford Hospital, Oxford, OX3 7JX, UK
Alexandra C. Pike, Ann L. Sharpley, Rebecca J. Park, Philip J. Cowen, Michael Browning & Erdem Pulcu
Oxford Health NHS Foundation Trust, Warneford Hospital, Oxford, UK
Alexandra C. Pike, Ann L. Sharpley, Rebecca J. Park, Philip J. Cowen, Michael Browning & Erdem Pulcu

Authors

Alexandra C. Pike
View author publications
Search author on:PubMed Google Scholar
Ann L. Sharpley
View author publications
Search author on:PubMed Google Scholar
Rebecca J. Park
View author publications
Search author on:PubMed Google Scholar
Philip J. Cowen
View author publications
Search author on:PubMed Google Scholar
Michael Browning
View author publications
Search author on:PubMed Google Scholar
Erdem Pulcu
View author publications
Search author on:PubMed Google Scholar

Contributions

ACP: Conceptualization, methodology, software, formal analysis, investigation, data curation, writing – original draft, writing – review and editing, visualization, project administration. ALS: Investigation, data curation, writing – review and editing, project administration, supervision. RJP: Resources, writing – review and editing, supervision. PJC: Conceptualization, resources, writing – review and editing, supervision, project administration, funding acquisition. MB: Conceptualization, methodology, software, validation, resources, writing – review and editing, supervision. EP: Conceptualization, methodology, software, validation, formal analysis, writing – review and editing, supervision.

Corresponding author

Correspondence to Alexandra C. Pike.

Ethics declarations

Competing interests

ACP has received funding from the Wellcome Trust (226694/Z/22/Z). She currently sits on the Council of the British Association of Psychopharmacology. She was also the named secondee on an MRC-Proximity to discovery award (PI: Dr Oliver Robinson) with Roche (who provide in-kind contributions and have sponsored travel) regarding work on heart-rate variability and anxiety. MB has acted as a consultant for Janssen Research, P1vital Ltd and CHDR, owns shares in P1vital Products Ltd and was previously a paid employee of P1vital Ltd. No other authors report any conflicts of interest.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary material

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Pike, A.C., Sharpley, A.L., Park, R.J. et al. Adaptive learning from outcome contingencies in eating-disorder risk groups. Transl Psychiatry 13, 340 (2023). https://doi.org/10.1038/s41398-023-02633-w

Download citation

Received: 06 December 2021
Revised: 13 October 2023
Accepted: 18 October 2023
Published: 04 November 2023
Version of record: 04 November 2023
DOI: https://doi.org/10.1038/s41398-023-02633-w

This article is cited by

Examining the biological causes of eating disorders to inform treatment strategies
- Claire J. Foldi
- Kristi R. Griffiths
Nature Reviews Neuroscience (2025)
Reinforcement Learning and Decision Making in Anorexia Nervosa
- Christina E. Wierenga
- Carina S. Brown
- Erin E. Reilly
Current Psychiatry Reports (2025)

Subjects

Abstract

Similar content being viewed by others

Neural activation of regions involved in food reward and cognitive control in young females with anorexia nervosa and atypical anorexia nervosa versus healthy controls

Exaggerated frontoparietal control over cognitive effort-based decision-making in young women with anorexia nervosa

Altered social cognition in a community sample of women with disordered eating behaviours: a multi-method approach

Introduction

Methods and materials

Participants

General procedure

The volatility task

General statistical approach

Behavioural analysis

Data quality

Switch/stay analysis

Reinforcement learning analysis

Statistical analysis

Pupillometry analysis

Preprocessing

Statistical analysis

Results

Participant characteristics

Switch/stay analyses do not discriminate between ED groups and healthy controls

Best-fitting reinforcement learning model

RA group show greater learning rate adjustment

RA group also shows reduced effect of reward volatility on pupil dilation

Learning rate adjustment and pupil volatility adjustment correlate in the RA group

Discussion

Limitations

Conclusions

Disclaimer

Data availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Supplementary information

Supplementary material

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Examining the biological causes of eating disorders to inform treatment strategies

Reinforcement Learning and Decision Making in Anorexia Nervosa

Search

Quick links