Introduction

A central cognitive mechanism in the onset and progression of social anxiety may involve the impact of socio-evaluative stress on working memory (WM). WM can be defined as a multicomponent system that updates and manipulates temporarily stored information for higher cognitive functions, such as reasoning or learning1,2. This system consists of the subcomponents verbal working memory, visual-spatial working memory, attentional control system and a temporary storage system that modulates and integrates sensory information (“episodic buffer”)3. The functioning of these WM systems can be affected by stress through multiple pathways4,5. Stress can activate the sympathetic nervous system (SNS) and an increased activity of the hypothalamic–pituitary–adrenal axis (HPA axis)6. These processes lead to the release of glucocorticoids from the adrenal cortex (GCs), which can affect prefrontal cortex functioning, such as the capacity to hold and manipulate information temporarily, a core component of working memory (WM)7,8. High levels of cortisol can impair the prefrontal cortex (PFC), with noradrenaline influencing PFC function in an inverted U-shaped manner, ranging from fatigue (too little) to alertness (medium) to stress (too much)9. This pattern has been reliably demonstrated in both human and animal studies10.

Moreover, stress induces a shift in neural activity toward more emotionally driven brain regions, such as the amygdala, which plays a key role in fear conditioning and the consolidation of emotionally salient information11. That is, attention is diverted away from working memory tasks and instead directed toward threat-relevant cues in the environment. Consequently, some studies have reported that stress can enhance the consolidation of rather declarative long-term memories, particularly those with emotional significance12. Finally, stress can impair WM by increasing cognitive load through stress-related interference13. This includes intrusive thoughts and the intentional suppression of these distressing thoughts, both of which compete for the brain’s limited attentional resources, ultimately decreasing WM performance14,15. In sum, stress impairs working memory by disrupting prefrontal cortex (PFC) functioning, shifting neural processes toward brain regions involved in processing emotions, and consuming limited cognitive resources available to the PFC (see Fig. 1). Such stress-related WM disruptions may be associated with distorted or misremembered social experiences, which could play a role in the development and maintenance of social anxiety16,17,18.

Fig. 1
figure 1

The Pathway of how stress affects working memory performance.

Impaired recall of social information may help explain why individuals with social anxiety continue to perceive everyday interactions as threatening, even when evidence suggests otherwise. According to learning theory, a person with SAD should, over time, accumulate evidence that their feared outcomes do not materialize, given the inevitability of social interactions in daily life19. Distorted working memory processes might hinder people with SAD to remember, and thus learn from positive social interaction, which could replace overly negative expectations17. The Attentional Control Theory13 posits that individuals with high social anxiety excessively put resources to process socially threatening information at the cost of task-relevant cognitive processes, which leads to lower task performance. For example, a socially anxious person might have the intrusive and recurrent thought “I will embarrass myself in front of an audience.” when they must perform in a social situation20. The resulting negative cognitions then consume attentional resources, while participants attempt to complete a task involving executive functioning. In addition, information that could disconfirm maladaptive beliefs (e.g., noticing that someone is reacting positively to their performance) is less likely to be processed and recalled. In sum, social evaluation is likely to more strongly impair executive control, specifically memory updating, in individuals with elevated social anxiety.

In line with this, previous studies have shown that socially anxious individuals, recall fewer details of positive feedback after holding a presentation21 and display a less accurate recall of interpersonal information22. Yet, there are only a few studies, with mixed findings, investigating the impact of acute socio-evaluative stress on WM in socially anxious individuals. Two studies using modified version of the Trier Social Stress Task (TSST) found that socio-evaluative stress induction does not lead to lower memory performance in socially anxious individuals23,24. Another study, using videotaping as a social stress induction, found decreased memory performance in socially anxious individuals25.

The effect of stress on WM is strongly dependent on the intensity, timing and the type of stress4,12,26. This paper focuses on the induction of psychosocial stress, as it is particularly relevant for investigating social anxiety, a condition fundamentally characterized by the fear of negative evaluation by others27. To successfully evoke psychosocial stress, a laboratory task must1 create an uncontrollable situation2, create a forced failure context in which one cannot succeed and .3 include socio-evaluative threat, for example by having an observer judge the task performance27. Prominent paradigms to investigate psychosocial stress induction are the Trier Social Stress Task28, Montreal Imaging Stress Task (MIST)29 and the ScanSTRESS paradigm30. In the following section we will review these paradigms and explain the reasoning for creating the SENT paradigm.

The Trier Social Stress Task28, consists of 2 phases: an anticipation phase (10 min) and the task phase (10 min). The task often encompasses to perform a free speech (e.g., job interview) and mental arithmetic in front of an evaluative audience. The TSST has been adapted to measure WM performance; however, WM is measured only after the psychosocial stress induction phase12,26,31,32. Consequently, the existing modifications of the TSST paradigm do not allow examining how transient WM processes, such as ongoing memory retrieval and encoding processes, are impacted by acute socio-evaluative stress.

Sociopsychological stress paradigms that allow testing the effects of acute psychophysiological stressors are the ScanSTRESS paradigm30 and the Montreal Imaging Stress Task (MIST)29. During the MIST, participants are asked to solve mental arithmetic problems (e.g., 27 × 13 − 5) presented on a screen. Socio-evaluative stress is induced by telling participants that they are being evaluated by experimenters or observers (sometimes via camera or live feedback), and by giving negative textual feedback between mental arithmetic tasks (e.g. “Too slow”). Moreover, the task adapts in real time to ensure a high failure rate (e.g., shortening the response time or increasing difficulty). While this method effectively elicits stress responses, it does not include a working memory task. Moreover, it can compromise the reliability of behavioral data, as enforced task failure does not result in comparable working memory performance across individuals (i.e. some participants will have more difficult tasks than others).

During the ScanSTRESS paradigm30 participants are required to perform cognitively demanding tasks, a mental rotation and serial subtraction task, under time pressure, all while being observed by a panel of investigators visible via live video feed within the scanner environment. To heighten the social-evaluative threat, the panel delivers standardized negative feedback in real time. Both task difficulty and response speed are dynamically adjusted based on the participant’s performance, ensuring a high rate of failure and a strong sense of uncontrollability. To our awareness, no published study has incorporated a working memory task, such as the N-back, within the ScanSTRESS paradigm. Furthermore, the paradigm’s enforced task failure and dynamic difficulty adjustment compromises the comparability of working memory performance. Finally, both the MIST and ScanSTRESS paradigms allow participants to divert their attention away from the social-evaluative threat, as the threatening cues are either positioned above the task display or, in some cases, not shown at all.

Building on these previous paradigms, we developed the Socio-evaluative N-back Task (SENT), which integrates real-time webcam feedback and adaptive textual and verbal evaluation during a cognitively demanding task. SENT advances previous stress paradigms in three key ways: Firstly, this paradigm allows measuring the impact of acute socio-evaluative stress on ongoing memory retrieval and encoding processes associated with working memory. This is achieved by creating a social stress scenario in which the individual must perform in front of a judgmental audience in real time, rather than performing a cognitive task after stress induction. Secondly, the task prevents participants from diverting their attention away from the social threat. This is accomplished by projecting the task directly in front of the judging audience’s faces, forcing participants to maintain visual contact with the evaluators while completing the task, thereby eliminating the possibility of avoiding eye contact or looking only at the task to reduce discomfort. Finally, this task allows for the collection of comparable working memory performance data during exposure to social-evaluative stress. Unlike adaptive paradigms, the task maintains a consistent level of difficulty, using a demanding working memory task (i.e., 3- N-back task). Furthermore, it provides adaptive feedback by displaying the number of errors or emphasizing when reaction times are too slow, thereby sustaining performance pressure without modifying the task’s level of difficulty.

In sum, this study aims to evaluate the effectiveness of a newly developed social stress paradigm, the Socio-evaluative N-back Task (SENT). Moreover, we examine whether individuals with varying levels of social anxiety exhibit differences in stress response and working memory performance. Task performance is defined as working memory accuracy and response speed. In addition, the study investigates the physiological impact of the SENT task by examining stress-related indicators, including tonic electrodermal activity and heart rate variability. These markers reflect sympathetic and parasympathetic nervous system activity, respectively and are used as indicators of heightened stress33. In line with previous studies, we hypothesize that all participants will exhibit heightened tonic electrodermal activity and reduced heart rate variability28,34, prolonged reaction times35, and reduced memory performance12,31, in response to negative social evaluation. In addition, compared to low anxious individuals, we expect that socially anxious individuals will demonstrate heightened physiological stress responses, increased reaction times, and diminished memory performance25 in response to socio-evaluative threat.

Methods

Sampling and participants

Participants were recruited via bulletins at supermarkets and psychotherapeutic centers, and online advertisements on eBay classifieds and universities’ websites. Participants had to fill in an online pre-questionnaire on the UniPark assessment platform. Participants were recruited if they fall below 13 (i.e., low social anxiety) or above 25 (high social anxiety) on the Social Phobia Inventory (SPIN)36. The cut-off scores are based on previous validation studies using the SPIN questionnaire37,38. Inclusion criteria encompassed age > 18, ability to consent to the study, and fluent German language skills. Participants were excluded if they suffer from respiratory or cardiac disease, neurological disorders, lifetime-prevalence of Schizophrenia, severe depression, substance use disorder, antisocial and/or borderline personality disorder according to DSM-539 criteria. Moreover, exclusion criteria were pregnancy, heavy speaking impairment, non-correctible vision and hearing impairments, regular drug intake and irregular intake of psychopharmacology. In addition, participants in the control group were excluded if any prior or current diagnosis of mental disorder according to ICD-10/DSM-5 was given. If participants were eligible to participate in the study, they were contacted and we conducted a clinical interview on the German version of SCID-5-CV and SCID-5-PD40. A total of 13 participants were diagnosed with social anxiety disorder. All experimental protocols were approved by the Ethics Committee of Freie Universität Berlin (ref: 033/2020), and all methods were carried out in accordance with the relevant guidelines and regulations. We obtained informed consent from all subjects and/or their legal guardian(s) for publication of identifying information/images in an online open-access publication.

Socio-evaluative N-back task (SENT)

During the N-back task, participants continuously observed a screen displaying a live video of one male and one female observer, both of whom had no prior contact to the participants and were seated in a “monitoring room.“. The N-back numbers were then displayed “on top” of the faces of the two observers (see Fig. 2). Therefore, attending to the social evaluative threat could not be avoided if one wants to solve that task. During this task, participants observed a series of 16 briefly presented numbers and decide in each trial (i.e., press a button) if the currently presented number is the same as the number presented 3-trials ago. Each number was presented for 500 ms, followed by an interstimulus interval of 1500 ms during which the participant could give a response. If participants made a mistake, a red box was displayed surrounding the number for 250ms (see Fig. 3). As shown in Fig. 4, there were 2 randomized control and stress conditions, each condition consists of 6 blocks of N-back tasks, which presented 16 numbers, with 5 randomized targets (pressing button is correct) and 11 randomized non-targets (pressing button is incorrect). The task was programmed using Python (Python Software Foundation, https://www.python.org/) and is freely available on https://osf.io/h29rp/.

Fig. 2
figure 2

Depiction of the Socio-evaluative N-back Task. During the stress phase, participants were observed by a female and male experimenter (upper picture). During the control phase, the experimenter were faced to the other direction (lower picture).

Fig. 3
figure 3

The inter-trial interval, which consisted of number display (500 ms), inter-stimulus interval (ISI) (1500) and an additional 250ms in case a mistake was made. The reaction time window is 2000 ms in total.

Fig. 4
figure 4

Schematic depiction of the Socio-evaluative N-back task.

In total three experimenters are involved during the task. Person A, the experimenter, explains that the study is a memory performance test examining how effort and concentration relate to physiological responses and whether these can predict performance. Participants are given a chance to practice the N-back task and ask questions. The experimenter also introduces the setup, including the observer room, where a panel of psychologists monitors performance, and the experiment room, where participants complete the task under webcam observation. Persons B and C, a male and a female panel member, introduce themselves, over a webcam from the observer room, as part of the psychological staff responsible for monitoring participants’ performance. They explain the task and tell participants to put in full effort during all sessions. For detailed instructions to the experimenter and the cover story, please refer to the Supplementary Information online.

To ensure comparability of the N-back task performance between subjects, the difficulty of the N-back task stayed even across each N-back blockand all participants. The difficulty level of the N-back task (including factors such as the number of N-backs and inter-stimulus interval between numbers) was set to ensure it was neither overly challenging, which could compromise the validity of the memory performance measure, nor too easy, which may not elicit stress.

Each participant completed the N-back task in both a control and stress condition (see Fig. 4). In stress conditions, any errors, whether false alarms or misses, were visually signaled by a red box appearing around the displayed number. Additionally, participants received both textual and verbal negative feedback. Textual feedback was provided following every second N-back block (each consisting of 16 numbers). When participants committed more than 2 errors during the preceding two blocks a negative text message was displayed, for instance: “You made 3 errors. On average, participants made 2 errors. Specifically, you responded once when a response was not required and missed two required responses.”. Conversely, if participants made no more than 2 errors, the average reaction time minus 80-100ms was automatically calculated and presented as textual negative feedback, such as, “Your reaction time is 621ms, whereas the average participant reacted within 514ms. Your reaction time needs improvement.” This dynamic adjustment of feedback was adopted to enhance the credibility of the negative feedback.

Additionally, participants received verbal negative feedback twice following every third N-back block. A member of the opposite sex, acting as an observer, entered the room and conveyed the following to the participant: (1) expressing dissatisfaction with their performance, (2) emphasizing the necessity for improved performance to ensure data quality, and (3) urging an increase in performance during the subsequent block. In contrast, during the control condition, participants were not observed, and they did not receive any form of negative feedback. Additionally, the observers turned around so that it became clear that they would not be able to see the participant (see Fig. 2).

Study procedure, measurements and data preparation

After signing an informed consent, participants filled in a questionnaire consisting of demographic questions and the Beck Depression Inventory (BDI-II)41. Next, participants had electrodes attached for recording of the electrocardiogram (ECG) and electrodermal activity (EDA). Then, they were seated for a 7-minute resting phase during which they were instructed to look at a fixation cross. Thus, participants had approximately a total of 30-minute resting period before the experiment, to exclude stress caused by arrival to the laboratory. Afterwards, participants were instructed in the N-back task and performed a test trial. The entire study took approximately 55 min to complete.

N-back performance

Performance of the N-back task was divided into reaction time and WM accuracy. Reaction time is the time interval between the display of the number and a button press. Memory accuracy was calculated via the following formula:

$$Memory accuracy = (Hits/Possible Hits) - (False Alarms/Possible False Alarms).$$

This was done to penalize participants who press indiscriminately. Participants could correctly press a button in response to targets (= hits), 5 times in each block. Moreover, participants could incorrectly press a button in response to non-targets (= false alarms) 11 times in each N-back block.

Psychophysiological stress response

Tonic electrodermal activity

Electrodermal activity (EDA) was recorded via two sintered Ag/AgCl electrodes attached to the hypothenar of the participant’s non-dominant hand, with a constant voltage of 0.5 V. The signal was recorded using the GSR MR Module by BrainProducts. The incoming raw signal was filtered with a low cutoff time constant (10s) and high cutoff frequency (250 Hz). Skin conductance data was analyzed with the Matlab analysis software Ledalab V3.2.3 (www.ledalab.de). The SCR data were visually inspected and down-sampled to 100 Hz, we applied Butterworth – lowpass filter and the data was separated into phasic and tonic activity via a Decomposition Analysis42. This method returns a continuous measure of tonic EDA. This method aims at retrieving the activity properties of the underlying sudomotor nerve activity, avoiding estimation biases produced by overlapping responses. A response window of 36 s was used to cover the N-back trial and the minimum amplitude criterion was set to 0.01 µS, finally the SCR data was z-standardized.

Heart rate variability

Electrocardiography (ECG) was recorded via three Ag/AgCl electrodes. Two Ag/AgCl electrodes were attached under the right clavicle and the lowest rib on the left. An additional Ag/AgCl electrode was attached to the left clavicle and served as ground electrode. The signal (1025 Hz) was recorded using the GSR MR Module by BrainProducts. Custom-made Python-scripts (Python Software Foundation, https://www.python.org/) were developed to analyze relevant time frames, including the toolbox BioSSPy43, which was used to band-pass filter the data (3 to 45 Hz; finite impulse response filter) and detect R-spikes based on the approach of Hamilton44. The detected R-spikes were visually inspected and manually corrected. Moreover, we applied Kamath ectopic beats removal method to the resulting RR-intervals. Using the BioSSPY package, we calculated the root mean square of successive differences between normal heartbeats (RMSSD) for a 36s interval45. Thus, RMSSD reflects the beat-to-beat variance in heart rate and is the principal time-domain measure to estimate vagally mediated changes in heart rate variability46.

Statistical analysis

All statistical analyses were conducted in R (Version 3.5.3, http://cran.r-project.org/). Working memory accuracy and reaction time were averaged for each 3-back block (i.e. 16 trials). That is, we accounted for the multilevel structure of the data by analyzing the effects of stress and control conditions on 24 N-back blocks for each participant. To consider the hierarchical data structure, we build 4 multilevel random intercept models with memory accuracy, reaction time, tonic electrodermal activity and heart rate variability (RMSSD) as outcomes. First level was the individual participant, and second level was their score on the outcome variable. We included stress condition, as well as social anxiety as categorical predictors and their interaction term. Moreover, we included the following control variables (covariates): age, gender, and depressiveness (BDI scores). To determine whether a random intercept or random slope model is more appropriate for our data, we followed a model selection procedure using the Akaike Information Criterion47. A more detailed description of the multilevel model, including the random effects can be found online (Supplementary Table S1-S4).

To identify outliers, we visually checked a boxplot of each outcome variable as well employed the z-score method to detect extreme values in normally distributed data (see Supplementary Fig. S1-S4). That is, each value was standardized by calculating its z-score, which measures the number of standard deviations it lies from the mean. Observations with an absolute z-score greater than 3 were considered outliers, as they deviate significantly from the expected range. To assess the robustness of our results, we repeated the analyses after excluding extreme outliers. The results were consistent with those reported in the Results section, suggesting that the observed effects were not attributable to extreme values (see Supplementary Table S5-S8). The analysis script and task can be found on https://osf.io/h29rp/.

Results

In total, 960 participants initiated the online eligibility questionnaire. A considerable proportion either did not meet the required SPIN criteria (< 13 or > 25; 35%) or discontinued the questionnaire before completion (22%). Of the total 65 invited participants, 7 (3 LSA, 4 HSA; 11% of total sample) stopped participating during the study, because it was too stressful. Due to a technical problem one participant had invalid EDA and ECG data and was subsequently excluded from the analyses. The final sample size was 57. Recruitment flow is shown in Fig. 5, sample characteristics are shown in Table 1.

Fig. 5
figure 5

Recruitment flow.

Table 1 Demographics and sample characteristics.

Manipulation check

To assess physiological changes before and during the experiment, we compared heart rate during a 7-minute resting state measurement prior to the task with heart rate during the experimental condition. As shown in Fig. 6, there was an increase in heart rate from the resting state to the task phase. A paired samples t-test revealed a statistically significant increase in heart rate from the Resting State (M = 72.3 bpm) to the SENT (M = 78.5 bpm), t(49) = -5.20, p < .001. The mean difference was 6.24 bpm, 95% CI [-8.65, -3.83]. This is comparable to the task effect observed in the ScanSTRESS paradigm, which elicited an increase in heart rate of 8.98 bpm during stress conditions30.

Fig. 6
figure 6

Manipulation check. Comparison of mean heart rate during a 7-minute resting phase and the average heart rate recorded during the Socio-evaluative N-back Task (SENT).

In general, the feedback provided during the task was accurate, as it was adaptively linked to the participants’ performance. During the post-experiment debriefing, no participant indicated that they were certain the task was deliberately designed to induce stress or that the feedback was false.

The effects of socio-evaluative stress on reaction time and memory accuracy

Figure 7 illustrates the impact of the between-subject factor social anxiety, as well as the within-subject factor stress and control conditions, on reaction time and memory accuracy.

Fig. 7
figure 7

Mean plot of memory accuracy (A) and reaction time (B) according to the two conditions (Stress/Control), as well as according to the anxiety groups (High/Low Social Anxiety).

Memory accuracy

Memory accuracy was significantly lower in the stress condition compared to the control condition, b = -0.06, t(1309) = -3.70, p < .001. When comparing social anxiety groups, there was no significant difference in memory accuracy between individuals with low and high social anxiety, b = -0.03, t(52) = -0.59, p = .559. Additionally, there was no significant interaction between stress condition and social anxiety group, b = 0.01, t(1309) = 0.47, p = .64. Among the covariates tested, only depressiveness was significantly associated with memory accuracy, b < 0.01, t(52) = -2.11, p = .039. All other covariates were non-significant. See Table 2 for full results.

Table 2 Multiple regression results with memory accuracy as the dependent variable.

Reaction time

Participants with low social anxiety exhibited significantly longer reaction times compared to those with high social anxiety, b = 138.85, t(52) = 2.31, p = .025. In contrast, the main effect of stress condition was not significant; participants in the stress condition did not differ significantly in reaction time from those in the control condition, b = -3.17, t(1299) = -0.22, p = .787. However, there was a significant interaction between stress condition and social anxiety group, b = -35.80, t(1299) = -2.10, p = .036. As shown in Fig. 7, the high social anxiety group responded faster in the stress condition. In the low social anxiety group, reaction times did not significantly differ between the stress and control conditions. Reaction time was not significantly associated with any of the covariates (all p’s > 0.05). See Table 3 for detailed statistics.

Table 3 Multiple regression results with reaction time as the dependent variable.

The effects of socio-evaluative stress on electrodermal activity and heart rate variability

Figure 8 illustrates the impact of the between-subject factor social anxiety, as well as the within-subject factor stress and control conditions, on electrodermal activity and heart rate variability.

Fig. 8
figure 8

Mean plot of electrodermal activity (A) and heart rate variability (RMSSD) (B) according to the two conditions (Stress/Control), as well as according to the anxiety groups (High/Low Social Anxiety).

Tonic electrodermal activity

Tonic electrodermal activity (EDA) was significantly higher in the stress condition compared to the control condition, b = 0.70, t(1337) = 9.73, p < .001. Social anxiety group was not significantly associated with tonic EDA, b = 0.04, t(1337) = 0.53, p = .559, and there was no significant interaction between stress condition and social anxiety group, b = -0.08, t(1337) = -0.73, p = .468. None of the covariates were significantly associated with tonic EDA (all p’s > 0.05). See Table 4 for full results.

Table 4 Multiple regression results with tonic EDA as the dependent variable.

Heart rate variability (RMSSD)

Root mean square of successive differences (RMSSD) was significantly lower in the stress condition compared to the control condition, b = -3.54, t(1286) = -4.79, p < .001. Social anxiety group was not significantly associated with RMSSD, b = -3.54, t(51) = -0.67, p = .505. There was also no significant interaction between stress condition and social anxiety group, b = 0.91, t(1286) = 0.85, p = .393. None of the covariates were significantly associated with RMSSD (all p’s > 0.05). See Table 5 for detailed results.

Table 5 Multiple regression results with tonic EDA as the dependent variable.

Exploratory analyses

In an exploratory analysis, we used the social anxiety disorder (SAD) classification as a categorical predictor, rather than relying on the high/low socially anxious group classification. Our study encompasses 13 participants diagnosed with SAD, and 44 individuals who did not meet the criteria. Using WM performance, reaction time and RMSSD as an outcome, analysis revealed the same pattern as described above, except for tonic EDA (see Supplementary Table S9−S12). Therefore, we present the findings of the analysis with the outcome variable tonic EDA.

Tonic electrodermal activity (EDA) was significantly higher in the stress condition compared to the control condition, b = 0.73, t(1337) = 12.23, p < .001. Individuals with social anxiety disorder (SAD) exhibited significantly higher overall tonic EDA than those without SAD, b = 0.20, t(1337) = 2.07, p = .038. There was a significant interaction between stress condition and SAD diagnosis, b = -0.25, t(1337) = -2.02, p = .043. As illustrated in Fig. 9, individuals without SAD displayed a steeper increase in tonic EDA during the stress condition than those with SAD. Additionally, the SAD group exhibited a higher level of tonic EDA in the control condition. None of the covariates were significantly associated with tonic EDA (all p’s > 0.05).

Fig. 9
figure 9

Mean plot of interaction between Stress Condition and Social Anxiety Disorder group. The x-axis shows the Control and Stress Condition. The y-axis is the z-standardized tonic EDA. The red line is the no SAD group, whereas the blue line shows the SAD group.

Discussion

Although impaired working memory performance under socio-evaluative stress may serve as a key cognitive mechanism underlying mental health difficulties such as social anxiety17, there is a lack of paradigms that can effectively induce social stress while simultaneously measuring working memory performance. This study sets out to test a newly developed socio-evaluative paradigm, called the Socio-evaluative N-back Task (SENT). We found that the stress condition compared to the control condition successfully increased psychophysiological stress responses and decreased WM accuracy. Contrary to our hypothesis, participants with low social anxiety did not exhibit reduced working memory accuracy or slower reaction times during the stress condition. Instead, they demonstrated faster reaction times in the control condition compared to the control group. Thus, individuals with high social anxiety consistently exhibit fast reaction times, regardless of exposure to socio-evaluative threat. In sum, the SENT can be used to investigate the effects of acute socio-evaluative stress on psychophysiological stress responses on working memory.

We found that the social evaluative stress condition resulted in reduced working memory (WM) accuracy compared to the control condition. These findings support the previously established link between psychosocial stress and working memory4,12,26. In addition, our study supports the previously found link between socio-evaluative stress and diminished parasympathetic nervous system activity, indicated by decreased heart rate variability46. In addition, our results suggests that socio-evaluative stress can activate the sympathetic nervous system, indexed by increased electrodermal activity34. In sum, during the socio-evaluative stress condition, compared to a control condition, all participants displayed decreased WM performance, as well as heightened physiological stress responses.

Contrary to our hypothesis, we found that individuals with high social anxiety consistently responded quickly and maintained memory accuracy, regardless of whether they were under social evaluation or not. Moreover, contrary to our hypothesis, individuals with social anxiety did not exhibit heightened physiological arousal in response to the socio-evaluative stress condition. This suggests that individuals with low social anxiety may not distinguish between control and stress conditions, as their responses to the control condition closely resemble those under social evaluation. Similarly, exploratory analyses indicate that individuals diagnosed with social anxiety disorder (SAD) exhibited heightened physiological arousal, as measured by electrodermal activity (EDA), even during the control task. This effect might lead to a “ceiling effect”, as people with social anxiety already showed a higher baseline physiological arousal, which led to a less steep increase of physiological reaction during the control phase. This finding is in line with previous findings, showing a higher baseline physiological stress reaction and flattened increase in people with social anxiety48,49.

As a result, the difference in arousal between the control and stress conditions was less pronounced compared to individuals without SAD. Over time, this lack of differentiation could place an additional burden on cognitive resources of socially anxious individuals, ultimately leading to reduced working memory performance, given that they are limited15. According to the “Yerkes–Dodson law”, performance increases with physiological or mental arousal, but only up to a point. When levels of arousal become too high, performance decreases50. The underlying reasons for this should be explored in future studies; however, one possibility is that individuals with social anxiety may be more motivated to please the experimenter, which could lead to heightened performance even during the control condition. Less differentiation between socio-evaluative threats from neutral contexts may, over time, contribute to cognitive strain and could explain the mixed evidence on WM impairments in individuals with high social anxiety18,24,51,52,53,54.

Furthermore, this inconsistency in findings could be attributed to laboratory setups that evaluate WM only after stress-induction tasks, potentially diminishing the effects of stress on working memory. In contrast to other socio-evaluative stress paradigms such as the Trier Social Stress Task with N-back12, ScanSTRESS30, or the Montreal Imaging Stress Task29, the Socio-evaluative N-back Task (SENT) enables the assessment of the impact of acute socio-evaluative stress on ongoing working memory (WM) performance. Moreover, this task cannot be solved when participants shift their attention away from the socio-evaluative situation. Finally, this task enables the collection of comparable working memory (WM) performance data, as the task’s difficulty remains constant throughout trials and across all participants.

Limitations and future studies

In this study, we primarily investigated individuals with subclinical social anxiety, only 43% of the highly socially anxious individuals were diagnosed with SAD. The findings of the exploratory analysis must be interpreted with caution due to marked group size differences. In particular, small sample sizes in physiological measures increase the risk of regression to the mean, which may artificially drive observed interactions55. To understand the impact of socio-evaluative stress on working memory in SAD, future research should aim to investigate these effects in a larger sample with individuals diagnosed with SAD.

Additionally, our use of an artificial laboratory setting might potentially limit the external validity of our findings. Moreover, we found that depressiveness decreased working memory accuracy. Prior research suggests that working memory impairment is a cognitive feature of major depressive disorder (MDD)6,56,57. Notably, major depressive disorder and social anxiety disorder commonly co-occur, with prevalence rates ranging from 15% to 33%58,59. To advance our understanding, future investigations should distinguish more precisely between the impacts of depression and anxiety on working memory impairments in SAD60,61.

Another limitation of this study is that we cannot fully distinguish between the effects of socio-evaluative stress and distraction caused by visual cues, such as facial stimuli. Therefore, it is possible that the observed task effects were driven less by stress per se and more by competing cognitive demands arising from emotional processing. Previous research suggests that negative facial expressions (e.g., fear, anger) are particularly disruptive to working memory, likely due to their threat-related salience62. In addition, this task includes multiple forms of negative evaluation, such as textual and verbal feedback, all of which are likely to influence physiological stress markers and task performance.

Future studies could disentangle these effects by systematically varying the presence of facial, textual, or no evaluative stimuli across experimental conditions, allowing for direct comparison of their individual impacts. Additionally, incorporating eye-tracking methods63 could help clarify the attentional mechanisms underlying responses to emotional cues during the SENT paradigm. A key strength of the SENT is its ability to assess how socio-evaluative stress influences both working memory and physiological responses in real time. This makes it a valuable tool for capturing dynamic, event-related changes as they unfold. Future studies could build on this by examining how specific task events elicit stress responses and whether participants show distinct learning curves or adaptation patterns throughout the task.

Conclusion

This study set out to test a newly developed laboratory socio-evaluative stress paradigm, called the Socio-evaluative N-back Task (SENT). We found that the socio-evaluative stress condition, compared to a control condition, increased psychophysiological stress response (i.e., increased tonic electrodermal activity and decreased heart rate variability), and decreased WM performance. In contrast to individuals with low social anxiety, highly socially anxious individuals show no difference in reaction time between the control and socio-evaluative stress conditions. This indicates that socially anxious individuals tend to not differentiate socio-evaluative threat from a neutral situation. In sum, we can conclude that the Socio-evaluative N-back Task (SENT) has the potential to induce socio-evaluative stress responses while simultaneously measuring working memory.