Early or late distractions hurt working memory differently depending on how long you look

Ren, Guofang; Liu, Ruyi; Guo, Lijing; Liu, Penglan; Nie, Dan; Chen, Jinru; Ye, Chaoxiong

doi:10.1038/s41598-025-18699-z

Download PDF

Article
Open access
Published: 09 October 2025

Early or late distractions hurt working memory differently depending on how long you look

Guofang Ren¹,
Ruyi Liu^2,3,
Lijing Guo^1,2,
Penglan Liu²,
Dan Nie²,
Jinru Chen³ &
…
Chaoxiong Ye^1,2

Scientific Reports volume 15, Article number: 35274 (2025) Cite this article

209 Accesses
Metrics details

Subjects

Abstract

Visual Working Memory (VWM) is essential for temporarily retaining goal-relevant visual information, yet its limited capacity renders it vulnerable to distraction. While previous studies have examined the effects of distractors occurring during encoding or maintenance, it remains unclear how stimulus presentation duration modulates these effects and whether different types or quantities of distractors exert similar costs. Across three experiments, we systematically investigated how distraction timing, distractor content, and distraction load interact with encoding duration to influence VWM performance. In Experiment 1, participants performed continuous recall and change detection tasks under four distraction conditions (no-, encoding-, delay-, and full-distraction) and two encoding durations (short: 200 ms; long: 1000 ms). Encoding-stage distractions impaired performance only in the short-duration, high-precision task, whereas delay-stage distractions consistently disrupted memory regardless of duration or task type. Experiment 2 manipulated distractor-target similarity (same-category vs. different-category distractors) and revealed that homogeneous distractors exerted stronger disruption, particularly when presented during the delay period. In contrast, heterogeneous distractors could be effectively suppressed when sufficient encoding time allowed for robust consolidation. Experiment 3 examined perceptual load (low vs. high) and showed that increasing distractor quantity did not amplify interference, suggesting that once memory consolidation is complete, delay-stage distractions disrupt VWM representations regardless of distractor load. Together, these findings reveal an asymmetry in how VWM handles distractions at different processing stages. While extended encoding supports resistance to early distraction, maintenance-stage distractions exert persistent effects—especially when distractors are similar to targets. Our results highlight the importance of presentation duration and distractor similarity in shaping VWM robustness, and suggest that distinct cognitive mechanisms may underlie suppression at encoding and maintenance stages.

All identical objects reduce memory load at the late maintenance stage in working memory

Article Open access 14 May 2025

Learning by task repetition enhances object individuation and memorization in the elderly

Article Open access 17 November 2020

Decoding the impact of aging on the interaction between visual attention and working memory

Article Open access 28 May 2025

Introduction

Visual working memory (VWM) is essential for everyday cognitive processes, as it allows us to temporarily hold and manipulate visual information. This system is crucial for activities like reading, navigation, decision-making, and even social interactions, as it helps us remember and process visual information about our environment¹. One of the critical aspects of VWM is its limited capacity—typically only 3–4 items can be stored simultaneously^2,3,4. Although strategies such as grouping and attention resource allocation can improve VWM performance to some extent^5––12, the fundamental capacity limit appears unavoidable. This limited storage space means that the brain must prioritize important information and ignore irrelevant details to maximize efficiency¹³. This is where the mechanism of distractor filtering in VWM becomes essential. Filtering out distractions allows the brain to focus only on relevant stimuli, enhancing VWM’s efficiency and enabling us to concentrate on the visual information that is most pertinent to the task at hand. Thus, there is a growing body of research investigating how individuals process distractor information and its influence on VWM mechanisms^14,15,16,17.

In traditional research of VWM, a typical experimental paradigm involves first presenting participants with a memory array containing several visual stimuli, which they are instructed to remember. After the stimuli disappear, there is a blank interval delay during which participants are required to maintain the memory targets in VWM. Following this delay, a test array appears on the screen, and participants respond based on the information held in VWM¹⁸. Previous research on distractor filtering within VWM can be categorized according to the stage at which distractor stimuli appear: some studies present distractors concurrently with the memory array (encoding-stage distraction)^{13,19,20,21,22}, while others introduce distractors only after the memory array has disappeared, during the delay interval (delay-stage distraction)^16,23. In the encoding-stage distraction paradigm, distractor processing occurs simultaneously with the encoding of memory targets. We refer to this condition as “encoding-distraction.” Conversely, in delay-stage distraction studies, distractor processing occurs after the stimuli have disappeared and during the blank interval delay, which we term this condition “delay-distraction.”

Research on encoding-distraction often uses event-related potentials (ERPs) to investigate how participants manage distractors during VWM encoding. The contralateral delay activity (CDA) ERP component^{24,25,26,27,28,29,30,31,32}, which reflects the VWM load, has been widely used to examine the relationship between distraction resistance and VWM capacity. Vogel, et al.¹³, for instance, demonstrated that individuals with lower VWM capacity tend to encode simple distractors (e.g., color or orientation), while those with higher capacity more effectively ignore these distractors, suggesting a link between VWM capacity and distractor resistance during encoding. Thus, the degree of disruption from encoding-stage distractors appears to correlate with individual VWM capacity.

Substantial evidence has also emerged from studies focusing on delay-distraction. For instance, Hakim, et al.²³ conducted a change detection task in which participants memorized six simple stimuli, with distractors presented during the delay period—after the memory array and before the test array. Their findings showed reduced task performance under delay-stage distraction, emphasizing that distractions during this stage can significantly impair VWM performance.

However, the effects of encoding- versus delay-stage distractors on VWM may differ markedly. Duan, et al.³³, for example, conducted a systematic investigation examining individual resilience against distractors at both stages. Using a continuous recall task, they assessed the effects of distractors presented during encoding versus delay on the recall of simple stimuli (e.g., teardrop orientations). Their findings indicated that VWM performance was significantly impaired only by delay-stage distractors, with encoding-stage distractions not adversely impacting performance. In our recent study³⁴, we extended this by using facial stimuli in a change detection task, presenting neutral face distractors either during encoding or delay stages, to examine the impact of complex distractors at each stage on VWM processing. Results similarly showed significant impairment from delay-stage distractors but not encoding-stage distractors. Thus, these findings suggest that stage-specific mechanisms underlie distractor influence on VWM maintenance, with delay-stage distractors exerting a significant impact on performance.

Additionally, our previous research suggests that the length of stimulus presentation influences the representational state of VWM³⁵. By manipulating presentation duration of memory stimuli, it is possible to place VWM consolidation at different temporal stages. VWM consolidation can be broadly divided into early and late stages, each relying on distinct mechanisms for allocating memory resources to VWM representations^36,37,38,39. In a recent ERP study on encoding-stage distraction²², we investigated distractor suppression by analyzing the distractor-induced ERP component (PD)^40,41. By manipulating the presentation duration of target and distractor stimuli, we examined differences in how participants process encoding-distractors across different VWM consolidation stages. Results indicated that with sufficient time to consolidate target stimuli, participants could more effectively suppress distractors, suggesting that distractor filtering may depend on the presentation duration, at least in encoding-distraction contexts.

Notably, previous encoding-distraction studies typically used brief stimulus presentations (e.g., 100–200 ms)^13,20,42. In contrast, both Duan, et al.³³ and Ye, et al.³⁴’s studies used longer presentation duration (1000 ms) for target and distractor stimuli. This experimental setup variation may account for conflicting findings in prior encoding-distraction research, where some evidence suggests that encoding-distractors impair VWM performance²⁰— a result not replicated in recent studies^33,34. However, no research has yet directly manipulated presentation duration to examine its effect on encoding- and delay-distraction processing within VWM.

This study aims to examine how the presentation duration of stimuli affects the impact of distractors presented at different stages of VWM processing. We manipulate the presentation duration for memory and distractor stimuli and investigate how variations in duration of memory targets and distractors influence VWM performance when distractions occur during the encoding versus the delay stage. This approach enables us to explore whether the length of stimulus consolidation affects the mechanisms by which individuals filter distractors at different VWM processing stages. We hypothesize two potential outcomes. First, if presentation duration indeed modulates the distractor filtering mechanism, then the extent of VWM performance impairment caused by distractors should vary according to duration. Specifically, we expect delay-stage distractors to substantially impair VWM performance regardless of presentation duration. In contrast, encoding-stage distractors should only impair performance at shorter duration, where consolidation is insufficient. With longer duration allowing for full consolidation, encoding-stage distractors are less likely to impact VWM performance. Alternatively, if presentation duration does not significantly modulate distractor filtering, the qualitative difference between encoding- and delay-stage distractors on VWM performance should remain consistent across presentation duration.

Additionally, the studies by Duan, et al.³³ and Ye, et al.³⁴ varied not only in visual stimuli but also in task type. Duan, et al.³³ used a continuous recall task requiring participants to memorize the orientations of three targets and then recall one target’s orientation angle accurately at test, thus demanding high memory precision for each item. In contrast, Ye, et al.³⁴ used a change detection task that required participants to determine if the test array exactly matched the memory array, allowing for successful task completion even with lower memory precision for each item. Although both studies found consistent results, each used a longer presentation duration (1000 ms), leaving it unclear whether the same stage-specific distractor effects would hold under shorter presentation duration across tasks with different precision demands. Therefore, in Experiment 1, participants complete both a continuous recall task and a change detection task to examine how memory precision requirements interact with presentation duration to influence distractor effects across different VWM stages.

Experiment 1

To examine whether stimulus presentation duration modulates individual processing of encoding-distraction and delay-distraction, participants completed a continuous recall task and a change detection task. In both tasks, participants were instructed to memorize three target orientations while ignoring the potential presence of three orientation distractors. We manipulated the factors of distractor presentation condition and presentation duration of stimuli. For the distractor presentation condition, four different distractor conditions were included: a no-distraction condition, an encoding-distraction condition, a full-distraction condition, and a delay-distraction condition. In the no-distraction condition, no distractors appeared. In the encoding-distraction condition, three distractors were presented alongside the targets in the memory array and disappeared simultaneously with the targets at the end of the encoding stage. In the full-distraction condition, three distractors appeared with the targets during the memory array presentation; however, unlike the encoding-distraction condition, the distractors persisted after the encoding stage until the test array appeared. In the delay-distraction condition, no distractors were present during the memory array, but three distractors appeared during the delay stage after the memory array disappeared. This experimental setup allowed us to compare VWM performance under different distraction conditions (during either the encoding stage, the delay stage, or both stages) against a no-distraction baseline. If the presence of distractors at a specific stage induced a significant distraction effect, we expected VWM performance in that condition to be significantly worse than in the no-distraction condition. Additionally, for the presetnation duration manipulation, we selected a short presentation duration of 200 ms, consistent with previous encoding-distraction research²⁰, and a long presentation duration of 1000 ms, as used in the studies by Duan, et al.³³ and Ye, et al.³⁴. It is worth noting that in these previous studies, the duration of distractor presentation was typically matched to the duration of the memory array, regardless of whether distraction occurred during encoding or delay stage. For instance, when Duan, et al.³³ and Ye, et al.³⁴ presented distractors only during the delay period, the distractors remained on-screen for the same 1000 ms duration as the memory array. To maintain consistency with this approach, we also adjusted the delay-stage distractor duration to match the memory array duration in each condition. Specifically, in the short presentation condition, delay distractors were shown for 200 ms; in the long presentation condition, they were presented for 1000 ms. This ensured that the durations of distraction during encoding and delay were equivalent within each level of presentation duration.

Methods

Participants

To ensure sufficient statistical power for the t-test comparisons, we conducted an a priori power analysis using G*Power 3.1.9.2. This analysis was informed by the expected effect size based on the study by Duan, et al.³³. Assuming a large effect size (Cohen’s d = 0.80) for our design, with a power of 80% and an alpha level of 0.05, the analysis indicated a minimum required sample size of 15 participants.

Our study adhered to the principles of the Declaration of Helsinki and received ethics approval from the Ethics Committee of Sichuan Normal University. Thirty-one college students participated in the study in exchange for compensation. However, one participant was excluded due to a program crash during the task, and two additional participants were removed due to accuracy in the change detection task below chance level (0.5), resulting in a final sample of 28 valid participants (2 males and 26 females; mean age = 20.29 years, SD = 1.212, age range 19–23 years) included in the data analyses. This sample size closely aligns with that used in the studies by Duan, et al.³³ (N = 24) and Ye, et al.³⁴ (N = 26). All participants reported normal or corrected-to-normal vision, normal color vision, and no history of neurological conditions. Written informed consent was obtained from each participant prior to the study.

Materials

We used arrows as stimuli (1.0° × 0.5° visual angle) for both targets and distractors. Each arrow’s orientation was randomly selected between 0° and 359°, with at least a 30° orientation difference between any two arrows to prevent overlap or similar orientations. Targets and distractors were distinguished by color (red [RGB: 255, 0, 0] or blue [RGB: 0, 0, 255]). Stimuli appeared on a gray (RGB: 128, 128, 128) background, with arrows distributed within an invisible rectangle (4.0° × 6.0°), ensuring a minimum of 1.6° spacing between any two arrows. The experiment was programmed using E-Prime software (E-prime 2.0, Psychology Software Tools, Inc.), and participants were seated 70 cm from a 17-inch screen in a dark, soundproof room.

Procedure

To examine the effect of target encoding time on distraction processing, we manipulated two factors: target presentation duration (short and long) and the type of distraction (no-distraction, encoding-distraction, full-distraction, and delay-distraction). Each participant completed both a continuous recall task and a change detection task, always performing the continuous recall task first.

The trial structure of Experiment 1 is shown in Fig. 1. In both tasks, participants memorized three target arrows in a memory array while ignoring distractor arrows. Target and distractor colors (red or blue) were assigned and counterbalanced across participants. In the baseline condition (no-distraction), each trial began with a fixation cross (1.5° × 1.5°) presented for 300–500 ms, followed by three arrows with varying orientations displayed for either 200 ms or 1000 ms. Participants were asked to remember these orientations. After a blank delay (1200 ms blank for the short presentation duration condition or 2000 ms blank for the long presentation duration condition), a test array was presented. In the continuous recall task, the test array contained a single arrow pointing vertically upward (always presented at 0° orientation) at one of the original target locations. Participants were instructed to adjust the arrow’s orientation using the computer mouse to match the orientation of the corresponding target from the memory array. After each trial, participants received feedback on the orientation offset (difference from the target). The next trial began 400–600 ms after participants acknowledged the feedback. In the change detection task, this test array contained one arrow at one of the original target positions. In half the trials, the test arrow’s orientation differed by 30–60° from the target’s, while in the remaining trials, it matched the target exactly. Participants indicated whether the test arrow’s orientation matched the target’s. The next trial began 100 ms after participants responsed.

In the encoding-distraction condition, the procedure was identical to the no-distraction condition, except that, during the memory array presentation, three distractor arrows appeared alongside the targets in a different color. Participants were instructed to remember only the target arrows (red for half the participants, blue for the other half). In the short presentation condition, distractor arrows appeared for 200 ms alongside the target arrows, while in the long presentation condition, distractors appeared for 1000 ms and disappeared at the same time as the targets.

In the full-distraction condition, the setup was similar to the encoding-distraction condition, except that the distractors remained visible during the delay period after the targets disappeared, up until the test array presentation. For the short presentation condition, distractors remained for an additional 1200 ms after the memory array; for the long presentation condition, they remained for an additional 2000 ms.

In the delay-distraction condition, the setup was similar to the no-distraction condition except during the delay period. Here, after the memory array disappeared, a 500 ms blank interval was followed by three distractor arrows appearing in new positions for 200 ms (short presentation condition) or 1000 ms (long presentation condition). A second 500 ms blank interval then preceded the test array.

Both tasks used a within-subject design with factors for presentation duration (short and long) and distraction condition (no-distraction, encoding-distraction, full-distraction, delay-distraction). The continuous recall task included 100 trials per condition, totaling 800 trials, while the change detection task comprised 48 trials per condition, totaling 384 trials. Participants took a 5-minute break between the two tasks. The entire experiment lasted approximately 90 min, and each task included 18 practice trials to ensure participants understood the procedure.

Data analysis

We analyzed the results separately for the continuous recall task and the change detection task. For the continuous recall task, memory performance was indexed by the absolute angular error between the reported and actual orientation of the target, referred to as the offset. Larger offsets indicated poorer memory performance. For the change detection task, the primary dependent measure was response accuracy (ACC) for each condition.

For both tasks, we conducted a two-way repeated-measures ANOVA with presentation duration (short vs. long) and distraction condition (no-distraction, encoding-distraction, full-distraction, delay-distraction) as within-subject factors.

To assess the effects of distraction, we performed planned pairwise t-tests comparing each distraction condition with the no-distraction baseline. Additionally, we compared performance between short and long duration within each distraction condition to examine how encoding time modulated distraction effects.

We also calculated distraction costs by computing the performance difference between each distraction condition and the no-distraction baseline. Specifically:

For the continuous recall task (offset):

$$\:{\text{O}\text{f}\text{f}\text{s}\text{e}\text{t}}_{\text{E}\text{n}\text{c}\text{o}\text{d}\text{i}\text{n}\text{g-}\text{d}\text{i}\text{s}\text{t}\text{r}\text{a}\text{c}\text{t}\text{i}\text{o}\text{n}\:\text{c}\text{o}\text{s}\text{t}}={\text{O}\text{f}\text{f}\text{s}\text{e}\text{t}}_{\text{E}\text{n}\text{c}\text{o}\text{d}\text{i}\text{n}\text{g-}\text{d}\text{i}\text{s}\text{t}\text{r}\text{a}\text{c}\text{t}\text{i}\text{o}\text{n}} - \:{\text{O}\text{f}\text{f}\text{s}\text{e}\text{t}}_{\text{N}\text{o-}\text{d}\text{i}\text{s}\text{t}\text{r}\text{a}\text{c}\text{t}\text{i}\text{o}\text{n}}$$

$$\:{\text{O}\text{f}\text{f}\text{s}\text{e}\text{t}}_{\text{F}\text{u}\text{l}\text{l-}\text{d}\text{i}\text{s}\text{t}\text{r}\text{a}\text{c}\text{t}\text{i}\text{o}\text{n}\:\text{c}\text{o}\text{s}\text{t}}={\text{O}\text{f}\text{f}\text{s}\text{e}\text{t}}_{\text{F}\text{u}\text{l}\text{l-}\text{d}\text{i}\text{s}\text{t}\text{r}\text{a}\text{c}\text{t}\text{i}\text{o}\text{n}}-\:{\text{O}\text{f}\text{f}\text{s}\text{e}\text{t}}_{\text{N}\text{o-}\text{d}\text{i}\text{s}\text{t}\text{r}\text{a}\text{c}\text{t}\text{i}\text{o}\text{n}}$$

$$\:{\text{O}\text{f}\text{f}\text{s}\text{e}\text{t}}_{\text{D}\text{e}\text{l}\text{a}\text{y-}\text{d}\text{i}\text{s}\text{t}\text{r}\text{a}\text{c}\text{t}\text{i}\text{o}\text{n}\:\text{c}\text{o}\text{s}\text{t}}={\text{O}\text{f}\text{f}\text{s}\text{e}\text{t}}_{\text{D}\text{e}\text{l}\text{a}\text{y-}\text{d}\text{i}\text{s}\text{t}\text{r}\text{a}\text{c}\text{t}\text{i}\text{o}\text{n}}-\:{\text{O}\text{f}\text{f}\text{s}\text{e}\text{t}}_{\text{N}\text{o-}\text{d}\text{i}\text{s}\text{t}\text{r}\text{a}\text{c}\text{t}\text{i}\text{o}\text{n}}$$

For the change detection task (ACC):

$$\:{\text{A}\text{C}\text{C}}_{\text{E}\text{n}\text{c}\text{o}\text{d}\text{i}\text{n}\text{g-}\text{d}\text{i}\text{s}\text{t}\text{r}\text{a}\text{c}\text{t}\text{i}\text{o}\text{n}\:\text{c}\text{o}\text{s}\text{t}}={\text{A}\text{C}\text{C}}_{\text{N}\text{o-}\text{d}\text{i}\text{s}\text{t}\text{r}\text{a}\text{c}\text{t}\text{i}\text{o}\text{n}}-\:{\text{A}\text{C}\text{C}}_{\text{E}\text{n}\text{c}\text{o}\text{d}\text{i}\text{n}\text{g-}\text{d}\text{i}\text{s}\text{t}\text{r}\text{a}\text{c}\text{t}\text{i}\text{o}\text{n}}$$

$$\:{\text{A}\text{C}\text{C}}_{\text{F}\text{u}\text{l}\text{l-}\text{d}\text{i}\text{s}\text{t}\text{r}\text{a}\text{c}\text{t}\text{i}\text{o}\text{n}\:\text{c}\text{o}\text{s}\text{t}}={\text{A}\text{C}\text{C}}_{\text{N}\text{o-}\text{d}\text{i}\text{s}\text{t}\text{r}\text{a}\text{c}\text{t}\text{i}\text{o}\text{n}}-\:{\text{A}\text{C}\text{C}}_{\text{F}\text{u}\text{l}\text{l-}\text{d}\text{i}\text{s}\text{t}\text{r}\text{a}\text{c}\text{t}\text{i}\text{o}\text{n}}$$

$$\:{\text{A}\text{C}\text{C}}_{\text{D}\text{e}\text{l}\text{a}\text{y-}\text{d}\text{i}\text{s}\text{t}\text{r}\text{a}\text{c}\text{t}\text{i}\text{o}\text{n}\:\text{c}\text{o}\text{s}\text{t}}={\text{A}\text{C}\text{C}}_{\text{N}\text{o-}\text{d}\text{i}\text{s}\text{t}\text{r}\text{a}\text{c}\text{t}\text{i}\text{o}\text{n}}-\:{\text{A}\text{C}\text{C}}_{\text{D}\text{e}\text{l}\text{a}\text{y-}\text{d}\text{i}\text{s}\text{t}\text{r}\text{a}\text{c}\text{t}\text{i}\text{o}\text{n}}$$

A positive distraction cost—whether in offset or accuracy—indicates performance impairment caused by the distractor, with larger values reflecting greater disruption.

To further explore the pattern of distraction costs, we performed an additional two-way repeated-measures ANOVA with presentation duration (short vs. long) and cost type (encoding, full, delay) as within-subject factors. Planned comparisons were also conducted to examine differences in distraction cost across conditions and presentation duration.

Effect sizes were reported as partial eta squared (η²_p) for ANOVAs and Cohen’s d for t-tests. In addition, we conducted Bayes factor analyses to quantify the strength of evidence for the alternative versus the null hypothesis⁴³. The Bayes factor (BF₁₀) provides an odds ratio for the likelihood of the alternative versus the null hypothesis, where values < 1 favor the null hypothesis, and values > 1 favor the alternative hypothesis. For example, a BF₁₀ of 0.25 would suggest the null hypothesis is four times more likely than the alternative.

Results

Continuous recall task

Offset

The mean offset for each distraction condition (no-distraction vs. encoding-distraction vs. full-distraction vs. delay-distraction) under short or long presentation duration is presented in Fig. 2a. The ANOVA on offset revealed a significant main effect of presentation duration, F (1,27) = 51.659, p < 0.001, η²_p = 0.657, and a significant main effect of the distraction condition, F (3,81) = 12.754, p < 0.001, η²_p = 0.321. However, no significant interaction on offset was found between the presentation duration and distraction condition, F (3,81) = 1.520, p = 0.220, η²_p = 0.053.

Planned pairwise comparisons revealed that, under the short presentation duration, the offset in the no-distraction condition was significantly lower than in the encoding-distraction condition, t(27) = 2.529, p = 0.018, Cohen’s d = 0.478, BF₁₀ = 2.871; the full-distraction condition, t(27) = 2.644, p = 0.013, Cohen’s d = 0.500, BF₁₀ = 13.587; and the delay-distraction condition, t(27) = 5.719, p < 0.001, Cohen’s d = 1.081, BF₁₀ > 1000. These results indicate that all forms of distraction, regardless of when they occurred, impaired VWM performance when encoding time was limited—with delay-stage distractors causing the greatest disruption. However, under the long presentation duration, no significant difference in offset was observed between the no-distraction and encoding-distraction conditions, t(27) = 0.650, p = 0.521, Cohen’s d = 0.123, BF₁₀ = 0.243. In contrast, the offset in the no-distraction condition was significantly lower than in the full-distraction condition, t(27) = 2.648, p = 0.013, Cohen’s d = 0.500, BF₁₀ = 3.610, and the delay-distraction condition, t(27) = 4.730, p < 0.001, Cohen’s d = 0.894, BF₁₀ = 399.497. This pattern suggests that when encoding time was sufficient, participants were able to resist encoding-stage distraction, but full and especially delay-stage distractions continued to impair VWM performance.

Additionally, the offset for the long presentation duration was significantly lower than that for the short presentation duration across all conditions: no-distraction, t(27) = 3.861, p < 0.001, Cohen’s d = 0.730, BF₁₀ = 50.12; encoding-distraction, t(27) = 5.448, p < 0.001, Cohen’s d = 1.029, BF₁₀ > 1000; full-distraction, t(27) = 4.436, p < 0.001, Cohen’s d = 0.838, BF₁₀ = 196.34; and delay-distraction, t(27) = 6.624, p < 0.001, Cohen’s d = 1.252, BF₁₀ > 1000. These results confirm that longer encoding time generally enhances memory performance.

Distraction cost (offset)

The mean distraction cost (offset) for each distraction condition (encoding-distraction, full-distraction, and delay-distraction) under short and long presentation durations is shown in Fig. 2b. A two-way repeated-measures ANOVA on distraction cost revealed a significant main effect of distraction type, F (2,54) = 20.168, p < 0.001, η²_p = 0.428. The distraction cost was significantly smaller in the encoding-distraction condition (M = 1.677, SD = 3.697) compared to the delay-distraction condition (M = 4.884, SD = 3.823), t(27) = 3.554, p < 0.001, Cohen’s d = 0.672, BF₁₀ = 24.794. Similarly, the distraction cost in the full-distraction condition (M = 3.008, SD = 4.437) was also significantly smaller than in the delay-distraction condition, t(27) = 2.048, p = 0.05, Cohen’s d = 0.387, BF₁₀ = 1.221. However, no significant difference was found between the encoding- and full-distraction conditions, t(27) = 1.644, p = 0.112, Cohen’s d = 0.311, BF₁₀ = 0.661. There was no main effect of presentation duration, F (1,27) = 0.236, p = 0.631, η²_p = 0.009, nor a significant interaction between presentation duration and distraction type, F (2,54) = 1.617, p = 0.208, η²_p = 0.057.

Planned pairwise comparisons revealed that under short presentation duration, participants showed significantly larger distraction costs in the delay-distraction condition compared to the encoding-distraction condition, t(27) = 2.54, p = 0.017, Cohen’s d = 0.480, BF₁₀ = 2.927 and the full-distraction condition, t(27) = 2.41, p = 0.023, Cohen’s d = 0.455, BF₁₀ = 2.292. The encoding- and full-distraction conditions did not significantly differ from each other, t(27) = 0.37, p = 0.713, Cohen’s d = 0.070, BF₁₀ = 0.214.

Under long presentation duration, a different pattern emerged. Distraction cost in the encoding-distraction condition was significantly smaller than in both the full-distraction condition, t(27) = 2.73, p = 0.011, Cohen’s d = 0.516, BF₁₀ = 4.233 and the delay-distraction condition, t(27) = 2.97, p = 0.006, Cohen’s d = 0.561, BF₁₀ = 6.902. However, no significant difference was observed between the full- and delay-distraction conditions, t(27) = 0.81, p = 0.424, Cohen’s d = 0.153, BF₁₀ = 0.271. This result suggests that sufficient encoding time enabled participants to better resist early distractors but did not fully protect against interference from later ones.

In addition, we compared distraction costs across presentation durations within each condition. None of these comparisons reached significance: encoding-distraction, t(27) = 1.53, p = 0.138, Cohen’s d = 0.289, BF₁₀ = 0.566; full-distraction, t(27) = 0.28, p = 0.782, Cohen’s d = 0.053, BF₁₀ = 0.208; delay-distraction, t(27) = 2.04, p = 0.052, Cohen’s d = 0.385, BF₁₀ = 1.198. These results overall suggest that presentation duration had limited influence on the size of distraction effects.

Change detection task

Accuracy

The mean accuracy for each distraction condition (no-distraction vs. encoding-distraction vs. full-distraction vs. delay-distraction) under short or long presentation duration is presented in Fig. 3a. The ANOVA revealed a significant main effect of presentation duration, F (1,27) = 9.479, p = 0.005, η²_p = 0.260, and a significant main effect of the distraction condition, F (3,81) = 16.969, p < 0.001, η²_p = 0.386. However, no significant interaction was found between the presentation duration and distraction condition, F (3,81) = 1.101, p = 0.352, η²_p = 0.039.

Planned pairwise comparisons indicated that, under the short presentation duration, accuracy in the no-distraction condition was significantly higher than in the full-distraction condition, t(27) = 2.830, p = 0.009, Cohen’s d = 0.535, BF₁₀ = 5.194, and the delay-distraction condition, t(27) = 3.845, p < 0.001, Cohen’s d = 0.727, BF₁₀ = 48.281. However, no significant difference in accuracy was observed between the no-distraction and encoding-distraction conditions, t(27) = 0.707, p = 0.486, Cohen’s d = 0.134, BF₁₀ = 0.252. This suggests that brief encoding-stage distractors did not impair performance, whereas later or sustained distractors did reduce accuracy under limited exposure. Under the long presentation duration, accuracy in the no-distraction condition remained significantly higher than in the delay-distraction condition, t(27) = 43.570, p < 0.001, Cohen’s d = 0.675, BF₁₀ = 25.659, but showed no significant differences with the encoding-distraction condition, t(27) = 0.134, p = 0.894, Cohen’s d = 0.025, BF₁₀ = 0.202, or the full-distraction condition, t(27) = 1.430, p = 0.164, Cohen’s d = 0.270, BF₁₀ = 0.499. This pattern indicates that with sufficient encoding time, participants were largely resilient to distraction occurring during or immediately following encoding, but still vulnerable to distraction during the delay period.

Additionally, accuracy for the long presentation duration was significantly higher than that for the short presentation duration only in the full-distraction condition, t(27) = 3.491, p = 0.002, Cohen’s d = 0.660, BF₁₀ = 21.486. In contrast, no significant differences in accuracy were found between long and short presentation durations in the no-distraction condition, t(27) = 1.043, p = 0.306, Cohen’s d = 0.197, BF₁₀ = 0.328; encoding-distraction condition, t(27) = 0.699, p = 0.491, Cohen’s d = 0.132, BF₁₀ = 0.251; or delay-distraction condition, t(27) = 1.300, p = 0.205, Cohen’s d = 0.246, BF₁₀ = 0.428.

Distraction cost (ACC)

The mean distraction cost (ACC) for each distraction condition (encoding-distraction, full-distraction, and delay-distraction) under short and long presentation durations is shown in Fig. 3b. The ANOVA on distraction cost (ACC) revealed a significant main effect of the distraction condition, F (2,54) = 6.747, p = 0.003, η²_p = 0.200. The distraction cost in the encoding-distraction condition (M = -0.006, SD = 0.060) was significantly lower than that in the full-distraction condition (0.03482 ± 0.05753), t(27) = 3.811, p < 0.001, Cohen’s d = 0.72, BF₁₀ = 44.62, and also significantly lower than that in the delay-distraction condition (M = 0.063, SD = 0.060), t(27) = 6.766, p < 0.001, Cohen’s d = 1.279, BF₁₀ > 1000. Moreover, the distraction cost in the full-distraction condition was significantly lower than in the delay-distraction condition, t(27) = 2.384, p = 0.024, Cohen’s d = 0.45, BF₁₀ = 2.189. However, no significant main effect of the presentation duration, F (1,27) = 2.170, p = 0.152, η²_p = 0.074., and no significant interation was found between the presentation duration and distraction condition, F (2,54) = 1.198, p = 0.309, η²_p = 0.042.

To better understand these effects, planned comparisons were conducted separately for each presentation duration condition. Under short presentation duration, the distraction cost in the encoding-distraction condition was significantly lower than in the full-distraction condition, t(27) = 3.85, p = 0.001, Cohen’s d = 0.727, BF₁₀ = 48.539, and was also significantly lower than in the delay-distraction condition, t(27) = 6.22, p < 0.001, Cohen’s d = 1.176, BF₁₀ > 1000. However, the difference between the full- and delay-distraction conditions did not reach significance, t(27) = 0.99, p = 0.331, Cohen’s d = 0.187, BF₁₀ = 0.313, suggesting that delay-stage distractors were not reliably more disruptive than full-interval distractors when encoding time was limited.

Under long presentation duration, the distraction cost in the delay-distraction condition remained significantly greater than in the encoding-distraction condition, t(27) = 3.62, p = 0.001, Cohen’s d = 0.684, BF₁₀ = 28.626, and also exceeded that of the full-distraction condition, t(27) = 2.46, p = 0.020, Cohen’s d = 0.466, BF₁₀ = 2.537. In contrast, the encoding- and full-distraction conditions did not significantly differ, t(27) = 1.53, p = 0.138, Cohen’s d = 0.289, BF₁₀ = 0.566, indicating that when stimulus presentation was extended, full-interval distraction no longer produced reliably more impairment than encoding-only distraction.

In addition, comparisons across presentation durations revealed no significant differences in distraction cost between short and long exposure durations for any of the three conditions: encoding-distraction, t(27) = 0.40, p = 0.694, Cohen’s d = 0.075, BF₁₀ = 0.216; full-distraction, t(27) = 1.32, p = 0.198, Cohen’s d = 0.249, BF₁₀ = 0.438; and delay-distraction, t(27) = 0.24, p = 0.812, Cohen’s d = 0.045, BF₁₀ = 0.206. This suggests that prolonging stimulus exposure did not systematically reduce distraction-related performance costs in any specific condition.

Discussion

In Experiment 1, participants performed both a continuous recall task and a change detection task. The continuous recall task required participants to recall the orientation of specified targets with high precision, making it a task that demands high visual VWM precision for optimal performance. In contrast, the change detection task only required participants to have a low-precision memory of target items to determine if a change had occurred, resulting in lower VWM precision demands. Accordingly, the results of Experiment 1 allowed us to observe whether different stages of distractor presentation impaired VWM performance under conditions requiring high versus low memory precision.

In the continuous recall task, the offset results indicate that any form of distraction, regardless of when it occurred, impaired VWM performance under the short presentation duration condition. This suggests that when VWM precision demands are high and encoding time is limited, both encoding-stage and delay-stage distractors disrupt memory performance. The presence of encoding-distraction effects aligns with previous findings showing that distractions during encoding can impair VWM performance²⁰. Moreover, we observed that delay-stage distraction caused significantly greater impairment than both encoding- and full-distraction conditions. However, with longer stimulus presentation duration, only delay-distraction effects were evident, while encoding- stage distraction effects were not observed, consistent with previous results from the study by Duan, et al.³³ which used a continuous recall task. Additionally, both full- and delay-distraction conditions led to significantly greater impairment than the encoding-distraction condition. Taken together, these results suggest that in tasks with high precision demands, delay-stage distractors consistently impair VWM and produce the greatest performance cost, while the effect of encoding-stage distraction depends on presentation duration—emerging only under shorter exposure durations.

In contrast, the accuracy results for the change detection task suggest that during short presentation duration, encoding-only distractions did not impair VWM, whereas both full- and delay-distractions did. With longer exposures, only delay distractions continued to impair VWM performance. This indicates that in tasks with lower memory precision demands, delay-stage distractors consistently impair VWM performance. During shorter stimulus presentations, individuals can only effectively resist the impact of encoding-stage distractions, while during longer presentations, they can resist both encoding-stage and continuously present distractions. These findings are consistent with those reported by Ye, et al.³⁴, which also used a change detection task. Across both presentation duration conditions, delay-stage distraction caused the most pronounced impairment in memory performance, mirroring the pattern observed in the continuous recall task.

The differing result patterns between the continuous recall and change detection tasks highlight that task demands on memory precision influence an individual’s ability to resist distractor effects. For high-precision memory tasks, filtering distractions may be more challenging. When high memory precision is required, distractions presented during shorter stimulus presentation duration in the encoding stage may be more difficult to filter, while during longer presentation duration, distractions appearing both in the encoding and delay stages may become increasingly challenging to resist.

Additionally, our results demonstrate that for high-precision memory tasks, longer exposure presentation duration enhance VWM performance. In contrast, for low-precision memory tasks, we did not observe an overall improvement in VWM performance with longer exposure presentation duration, except for a significant increase in accuracy under the full-distraction condition compared to shorter presentation duration. This suggests that the presentation duration of stimulus encoding is critical for forming high-precision memory representations, consistent with previous findings using continuous recall tasks that demonstrate VWM performance improvements with extended encoding time⁴⁴. This finding also aligns with our previously proposed two-phase model of VWM resource allocation, which posits that forming VWM representations involves early and late consolidation stages^36,37,38. During early consolidation, individuals form low-precision representations of as many target items as possible. Only after sufficient encoding time can individuals complete early consolidation and enter the late consolidation stage, where high-precision representations are formed as needed by the task. Consequently, extending stimulus encoding time significantly enhances the formation of high-precision representations, whereas for tasks requiring lower memory precision, prolonged exposure has a negligible impact on VWM performance.

Thus, in Experiment 1, by controlling stimulus presentation duration, we observed results that appeared to support seemingly contradictory findings from previous encoding-distraction^13,19,20,21 and the studies by Duan, et al.³³ and Ye, et al.³⁴. This underscores the crucial role of stimulus presentation duration in determining whether individuals can effectively resist distractor interference.

Furthermore, it is noteworthy that previous studies by Duan, et al.³³ and Ye, et al.³⁴ found that individuals could relatively easily resist full-distraction, similar to their resistance to encoding-distraction. In their studies, the presentation duration of full-distraction was twice as long as the target presentation. However, in our study, we observed evidence that full-distraction impairs VWM performance, especially during shorter presentation duration. This may be due to the fact that in our study, under short presentation duration, the presentation duration of full-distraction was seven times longer than that of target presentation, whereas under long presentation duration, it was only three times longer. This difference provided participants with relatively longer exposure to distractions under the short presentation condition, increasing susceptibility to impairment from full-distraction. Therefore, the prolonged presence of distractors in our full-distraction condition may account for the inability of some participants to effectively suppress full-distraction.

Although existing findings and our current results consistently support the notion that delay-stage distractors significantly impair VWM performance, these studies often involve distractors that are of the same type as the targets^33,34. For example, in Experiment 1, in the encoding-distraction condition, participants needed to remember three orientations, they could compare targets and distractors to identify and suppress the irrelevant items during the encoding stage. However, in the delay-distraction condition, when the target orientations disappeared and only distractors remained, participants may have automatically consolidated the new distractor orientations during the delay stage, creating competing VWM representations that impaired performance. Therefore, if distractors are of a different type than the targets (e.g., faces instead of orientations), participants may more effectively filter these heterogeneous distractors, even during the delay stage. However, no previous delay-distraction studies have examined the influence of distractor-target similarity on the delay-distraction effect.

In Experiment 2, we will further explore how stimulus presentation duration affects distractor interference at different stages, while also controlling for the similarity between distractor and target stimuli. This will allow for a more detailed examination of how individuals process and suppress distractors at different stages and how stimulus presentation duration modulates these processes.

Experiment 2

To further investigate whether the similarity between distractor and target stimuli modulates the effect of presentation duration on VWM performance under different distraction conditions, participants completed a change detection task similar to Experiment 1. Since our research focus was on the impact of stimulus presentation duration on distractors appearing at different stages, we aimed to minimize the influence of presentation duration on the maintenance of VWM representations. Thus, we selected a change detection task for Experiment 2, which is less affected by stimulus duration in terms of VWM performance.

We controlled the duration of stimulus presentation duration and retained the three distraction conditions from Experiment 1: no-distraction, full-distraction, and delay-distraction. Given that the results of the change detection task in Experiment 1 showed no evidence of differences between the no-distraction and encoding-distraction conditions, we did not include encoding-distraction condition in Experiment 2. However, in both the full-distraction and delay-distraction conditions, we introduced two types of distractor stimuli. In the full-orientation-distraction and delay-orientation-distraction conditions, orientation stimuli (same category as the target) were used as distractors. In the full-face-distraction and delay-face-distraction conditions, face stimuli (different category from the target) served as distractors.