Introduction

The Trail Making Test (TMT) is one of the most widely used tests in clinical and research settings as a tool to assess executive functions and brain impairments quickly and easily1,2,3,4,5. The original TMT is a paper-and-pencil task in which participants graphically connect, in ascending order, a series of 25 circles labeled with numbers and/or letters as quickly and accurately as possible. If errors occur, the examiner highlights them in real time, allowing the examinee to make corrections. The TMT is divided into two parts: the first part (TMT-A) consists of 25 circles randomly arranged on the page and numbered from 1 to 25. The second part (TMT-B) also contains 25 circles, but half are numbers (from 1 to 13) and the other half are letters (from A to L); the participants’ task is to connect the circles by alternating between numbers and letters in ascending order (i.e., 1 to A, A to 2, 2 to B, etc.2,4,6. In both TMT-A and TMT-B, the primary measure of performance is the total time required to connect all 25 circles6. The paper-and-pencil version was originally included in the Army Individual Test Battery7: researchers have found that TMT scores correlate strongly with intelligence, specifically the results have shown that individuals with lower IQ scores take longer to complete the TMT8.

Although the TMT appears to be an excellent tool for indicating the presence of brain impairment, poor performance on the task is a non-specific result, as it does not accurately pinpoint which aspects of the performance are deficient in an individual with cognitive impairment. Due to the complexity and the diverse nature of the cognitive processes involved in the TMT, poor performance can be attributed to different causes (e.g. motor slowing, poor coordination, visual scanning difficulties2. Therefore, although most of the studies agree on the multifactorial nature of the processes involved in the TMT, there is a lack of consensus regarding their exact roles and influence. Sànchez-Cubillo and collaborators9 presented an overview of 24 studies that aimed to clarify the processes underlying TMT scores and found that visual search, motor skills, working memory, and general intelligence are the most cited processes that contribute to TMT performance. Moreover, other studies highlight that the TMT-B involves additional executive function resources2,10,11,12 such as cognitive flexibility, inhibition/interference control, working memory, mental tracking, and attentional set-shifting13,14,15,16,17,18.

In recent years, two paradigms in visual cognition have emerged as particularly relevant to understanding performance in the Trail Making Test: The Multiple Item Localization (MILO) task and hybrid foraging. Both paradigms involve serial visual search under constraints of memory and target al.ternation conditions that closely mirror those found in the TMT, especially in part B. In hybrid foraging tasks19, participants search for multiple targets from two or more categories simultaneously. A consistent finding is the tendency to select items from the same category in bursts before switching reflecting the cognitive cost of alternating between target types. These switch costs suggest that participants optimize their search by minimizing category changes, revealing a trade-off between memory load and attentional control. The MILO task20 was designed to simulate sequential search across interleaved target categories, much like the letter-number alternation in TMT-B. In MILO, participants must select targets in a fixed order (e.g., 1-A-2-B.), resulting in longer response times at category-switch points. Although the so-called “saw-tooth” pattern has not always been directly reported, the RT dynamics in MILO have been interpreted in terms of switch costs and inhibitory tagging, a mechanism whereby recently selected items are suppressed in memory to prevent their re-selection21.

These experimental paradigms have substantially advanced our understanding of the cognitive architecture underlying the TMT, offering insights into how visual search, category switching, and memory interact. Their relevance is further supported by a long-standing body of research that has consistently highlighted visual search as a central component of TMT performance. Ehrenstein and collaborators22, for example, investigated the role of visual search in Part A of the TMT, by correlating the TMT-A performance with different verbal and non-verbal tests, such as the Objects Finding Test (OFT). The results of the study showed a significant correlation between performance in TMT-A and the visual search skills required by the OFT, leading the researchers to conclude that the primary process underlying the TMT-A performance is visual search22. Later, Crowe23 investigated the differential contributions of visual search and attentional functions on TMT performance, using a modified version of the TMT-A and TMT-B without numbers and letters. The results of the study demonstrated that TMT-A uniquely measured visual search and motor speed, while TMT-B measured visual search and cognitive alternation of operations23.

To better understand the mechanisms underlying visual search in the TMT, it is useful to turn to the extensive body of research on visual search paradigms, which has long examined how task and display properties affect performance. It has been long known that, in visual search tasks, some factors can affect performance in terms of reaction times (RTs), including the number of distractors displayed and the target that can have unique features or conjunctions of features with other distractors24. The demonstration that visual search response time frequently increases with the number of distractors has been a fundamental aspect of attention theories24,25,26,27,28. In other words, it is well known that the number of distractors affects serial visual search producing the so-called Display Set-size Effect(DSE, e.g29.). This happens because the level of interference in encoding a stimulus may increase with the quantity of stimuli, possibly as a result of the dispersal of attentional resources30,31. This phenomenon, often labeled as “resource limitation” or “limited capacity”32, does not restrict the number of stimuli encoded, but results in a more rapid decline in performance as a function of set-size. However, in some visual search tasks, the number of distractors has minimal impact, and the target seems to “pop out” from the display33. This can even capture attention without being the search target34,35. These patterns underlie Anne Treisman’s Feature Integration Theory (FIT)24. Treisman proposed an initial “preattentive” processing stage in which basic features like color and orientation are processed in parallel across the visual field.

While the number of distractors in a visual display can significantly impact search efficiency, it is also important to consider how individuals engage with the display. In other words, the strategy adopted to visually scan the environment can influence performance independently of stimulus quantity. Boot et al.36, for instance, demonstrated that different scanning strategies ranging from highly active to more passive, can lead to significantly different outcomes. Interestingly, participants who made fewer eye movements, relying instead on a radial “sit and wait” strategy, achieved better results. These findings are relevant not only in the broader context of visual search research but also for understanding performance in tasks such as the TMT, where visual scanning plays a central role. Such strategic components may therefore offer a useful lens through which to interpret how individuals handle varying display complexities and demands an aspect further explored in our experimental investigation.

In addition to strategic variations, perceptual features of the stimuli, particularly their salience, also play a critical role in modulating visual attention. Traditional interactions of TMT-A exhibit a uniform level of target salience, given the perceptual homogeneity of all display elements. However, experimental modifications of the TMT like the one we are going to introduce in this study can induce differential salience between the current target and previously acquired targets. Specifically, the alteration of prior target colors can result in a prominent “pop-out” phenomenon, thereby creating irrelevant singletons distinguished by their immediate physical dissimilarity from the present target. In this conceptual framework, TMT-A can be conceived as a series of single target searches where every time a target is found it gets turned into a distractor and one of the distractors becomes the new target. Lamy and Zoaris37, in a study that does not use the TMT, show that stimulus salience influences attention even when it is task-irrelevant. These results offer evidence that contradicts Theeuwes’38 proposal, which posits that attentional capture by a salient distractor does not occur in the context of serial search. Also, Lamy and Tsal39 showed that distractors with salient features did not interfere with performance when the participant was engaged in searching for the target. Additionally, the hypothesis of salient-signal suppression, proposed that individuals inhibit signals originating from items that are salient but irrelevant while searching the target40,41,42. Another study in a similar direction is by Proulx and Egeth43, who investigated whether the similarity between the target and the distractors can influence performance. In this case, they revealed that distractors dissimilar to the target facilitate visual search.

In line with Lamy and Zoaris findings, salient items in the TMT could facilitate participants attention, since salient items are dissimilar from the subsequent targets and easy to filter out from future search43. In contrast, other studies show that salient distractors are not able to capture attention during serial search38, because individuals tend to narrow their attentional focus and inhibit signals originating from items that are salient but irrelevant while searching for the target40,41,42. However, Burnham44 showed that in a dual-task condition, attention is captured by a task-irrelevant color, but this happens in a single-task condition. This suggests that manipulating the display set-size might affect performance more in TMT part B than in TMT part A. In Part B, participants need to not only search visually for targets but also to switch between target types (i.e., letters and numbers) during the search and keep track of the last item reached for each target type. Such a switching has been shown to be time-consuming, with longer response times when successive selections involve different target types compared to when they involve the same type. For example, studies on Hybrid Foraging tasks have demonstrated robust effects of category switching. Responses tend to be slower when switching between target types than when selecting successive items from the same category45. Further support for the impact of switching between target categories comes from research using the MILO task20,46,47. In MILO, participants must search for a fixed sequence of targets from two or more categories, often revealing increased response times following category switches. This has been interpreted as evidence for switch costs and mechanisms such as inhibitory tagging21. In a variant of the MILO task, participants search for interleaved sequences of targets belonging to different categories, similar to the alternation required in TMT-B. Trial by trial results from the MILO task showed a characteristic “saw-tooth” pattern of response times, with slower responses when switching to a different target type and faster responses on the following targets, suggesting a cost when switching target and a preliminary pianification of the search of the upcoming targets. Moreover, the results obtained with the MILO seem to suggest the interleaving targets also disrupts retrospective inhibitory tagging, a mechanism that helps participants suppress interference from previously selected targets20. Disruption of inhibitory tagging may contribute to the difficulties observed in TMT-B performance, as switching between categories appears to interfere with the ability to chunk upcoming targets and to exert inhibitory control over previously selected items. Finally, based on Burnham’s findings, it can be hypothesized that participants’ attention in Part B will be more influenced by irrelevant information. This means their attention could be drawn more to salient irrelevant items, making it easier to recognize them as distractors and exclude them from the search43. Just like in a dual-task condition where irrelevant items are quickly excluded, heightened attentional sensitivity could help in Part B, but might also make it harder when no items can be excluded. Therefore, the comparison between Parts A and B will be interesting precisely due to the presence of the switch task and the influence of the attentional and perceptual components.

Digitized TMT and eTMT

The digitization of paper-and-pencil tests has the advantage of increasing the ease of administration, standardizing the presentation of the task, but it also allows the acquisition of additional parameters for a more thorough evaluation of cognitive processes. Digital versions enable the collection of data and the processing of single items in customizable research settings. In the last decade, numerous digitized versions of the original paper TMT have been implemented and have successfully addressed these aspects48,49,50. For example, Reynolds created the Comprehensive Trail Making Test (CTMT51; that was designed to expand the task and address any issues of norming and standardization of the TMT. The CTMT consists of a sequence of five trails: Trails 1–3 are similar to TMT-A, but Trails 2 and 3 introduce increasing numbers of distractor circles that participants must ignore. Trails 4 and 5 are more similar to TMT-B: in Trail 4, participants are asked to draw a line between Arabic numerals in circles and number words in rectangles; in Trail 5, similar to TMT Part B, there are empty circles provided as distractors. Results show that the task gets progressively more difficult with the addition of distractors51. However, this study did not analyze what might happen instead with the gradual elimination of distractors in terms of a possible facilitation effect.

Custom versions of the TMT were created digitally to test specific hypotheses, for example, the Connection Test52. This test involves an array of circles where the targets are numbers, letters, or alternating numbers and letters. The task is to draw lines connecting the circles in the given sequence, with targets positioned above, below, to the left, to the right, or diagonally from the previous one. The study helps to understand individual differences in alternating and non-alternating versions of the TMT. Perceptual speed accounts for most of these differences, particularly age-related ones. Although alternating versions take more time, both versions share significant variance, indicating minimal age-related effects on switching processes.

Beyond the display set-size effect, other factors seem to influence participants’ performance. In 2015, Woods and colleagues53 proposed a new version of the Computerized TMT (C-TMT) with the aim, among others, of reducing the interference caused by visual clutter of the lines drawn. The authors hypothesized that participants who draw imprecisely or make errors will add visual clutter, which complicates the detection of subsequent targets and, consequently, increases completion times. In their C-TMT, errors were automatically corrected, and erroneous paths were removed to decrease visual cluttering. Results showed that completion times on both C-TMT-A (numbers only) and C-TMT-B (alternating letters and numbers) were significantly influenced by age. However, this study does not clarify whether the correct pen-drawn line may also produce visual cluttering, making the task more difficult or, conversely, whether the line drawn with the pen can facilitate the task. In other words, once a line has been drawn with the pen on and between the items, it is possible to hypothesize that they are functionally “canceled” and therefore no longer taken into consideration in the visual serial search task, effectively reducing the display set-size.

In summary, while various digital versions of the TMT have explored the role of visual clutter, it remains unclear under which specific conditions it helps or hinders performance. No studies so far have looked at how the number of distractors specifically affects TMT performance. Reynolds51 noted that adding distractors makes the task harder but did not examine what happens when they are reduced. Woods et al.53 studied the number of distractors, but the exact role of visual clutter remains uncertain. Salthouse et al.52 addressed visual disturbance and motor interference but excluded distance as a variable. Therefore, a study is needed to disentangle factors like strategy, the number of distractors, and whether the lines drawn connecting targets might actually help by “canceling” them. The main aim of this study is to verify how the item set-size affects performance in the TMT task. We intend to test set-size effects by manipulating the appearance of the already-found targets. Additionally, we will investigate the influence of visuospatial memory by adding a condition in which all items change position after each correct target-reaching action. Furthermore, we aim to delve deeper into the cognitive processes involved in this task by controlling for the influence of inter-target distance and path continuity. To do so, we implemented a new digital version of the TMT called eTMT to test the influence of the following factors on visual search:

  1. 1)

    item set-size

  2. 2)

    visual cluttering produced by the lines drawn

  3. 3)

    the permanence/cancellation of already-reached targets

  4. 4)

    visuospatial memory

  5. 5)

    Spatial arrangement of subsequent targets

To investigate these properties, in Experiment 1 we created three different versions of the TMT:

  1. 1.

    Standard version: replicates the original paper-and-pencil task.

  2. 2.

    No-Line (NL) condition: participants still draw lines with the mouse between target circles, but as soon as the next target is reached, the previously drawn line disappears, allowing all distractors to remain fully visible throughout the task. This condition aims to verify whether the line drawn by participants contributes to “canceling” items, thereby reducing the display set-size and the number of items among which participants must search. We hypothesize that this version of the TMT will be more difficult than the original, as the absence of a persistent line may make the visual search task more demanding.

  3. 3.

    Black Discs (BD) condition: once the next target is reached, the line disappears, and the just-reached circle turns black. This condition allows us to disentangle contrasting results and interpretations. Following Lamy and Zoaris37, black discs may influence participants’ attention by facilitating the task43. Conversely, attentional capture by a salient distractor might not occur in the context of serial search38. With this condition, we aim to understand whether transforming previous targets into black discs—making them more salient but also more dissimilar from other targets facilitates or interferes with the serial visual search task in the TMT.

Moreover, we speculated that if the main influence on the visual search task of the TMT is given by the display set-size, then we should get a facilitation by the transformation of previous targets into black discs, consequently reducing the set-size. If this were the case, we should observe a progressive decrease in response time as the task progresses due to the increasingly narrower display of distractors.

To summarize, the aim of this study is to verify the hypothesis that display set-size is a variable that could significantly affect TMT performance. Hence, we expect the first condition (Standard) to replicate the classic results of the TMT; the second condition (No-Line), is expected to be more difficult as the just-drawn line disappears preventing the participant from “canceling” the targets already reached and therefore forcing them to search for the next target throughout the screen without being able to neglect those already analyzed; finally, the third condition (Black Discs), the one in which the circles are transformed into black discs after they have been reached, could be the simplest condition because it allows the participant to easily filtering the black disks out of the search. This should happen for both parts A and B of TMT, since they are both based on a visual search task. Burnham44 found that attention is drawn to task-irrelevant colors in dual-task conditions, but not in single-task conditions. Based on this, we suggest that in Part B of the TMT, which involves a dual task, our manipulations could have a more pronounced impact. Drawing from Burnham’s study, we hypothesize that participants’ attention in part B will be more susceptible to task-irrelevant information, such as the black discs used in our study43. This heightened attentional bias towards the black discs may facilitate their quick identification as distractors or non-targets, aiding in their exclusion from the search. Conversely, this heightened attentional sensitivity might pose challenges in conditions where items cannot be excluded from the search, such as when connecting lines between successive targets are removed after drawing. Therefore, comparing parts A and B will be particularly insightful due to the dual-task nature of part B and its impact on attentional and perceptual processes43,44. Considering all previous results, we expect:

  1. 1.

    To replicate results of previous experiment with our Standard condition (part A easier than part B);

  2. 2.

    To find slower reaction time in the No-Line condition, since the absence of the line that passes and cancels items makes the visual search task more challenging;

  3. 3.

    To find a facilitation in the Black-Discs condition37,43;

  4. 4.

    To obtain different influences of our manipulations (NL and BD) on the performance of Part A and Part B, given that the two parts involve different cognitive processes, and Part B is considered more challenging.

Experiment 1

Method

Power analysis. This study was powered to detect moderate-to-large differences in test performance between test variants. The power to detect differences between testing variants was examined using the program Gpower 354. This indicated that detecting an effect size of 0.2, at 80% power, would require a sample of minimum 68 participants.

Participants

Sixty-nine participants (29 males; mean age = 25.59 years, SD = 11.91 years; range = 16–62) took part in the experiment. All participants reported normal or corrected-to-normal vision. Although previous studies55 did not find an influence of handedness on TMT performance, we chose to include only right-handed participants in our sample to avoid potential confounds. They were naïve as to the purpose of the study and gave written informed consent (for minors informed consent have been obtained by a parent) in accord with the declaration of Helsinki; the experimental protocol has been approved by the Ethical Committee of Università Europea di Roma (approval number 3/2024). Data were collected in 2022.

Materials

We created the eTMT, a digital version of the TMT, and manipulated it to generate three different conditions of the test. In the eTMT the test display is no longer vertical, as in the paper-and-pencil version, but has been adapted for a horizontal screen while maintaining the same spatial relationships between the targets, adjusted to fit a horizontal layout. Experiment 1 includes three conditions of the TMT test: The Standard (ST) condition is analogous to the paper-and-pencil version (both version A and B; see Fig. 1); The No-Line (NL) condition has the pen stroke removed: as soon as the participant reaches a target, the line just drawn with the mouse disappears (see Fig. 2); The Black Discs (BD) condition removes the lines and transform the reached targets into black discs (see Fig. 3).

Fig. 1
figure 1

Example of the training phase of the Standard condition of the eTMT-A and B. In version A, participants are asked to connect the numbers in ascending order by drawing a line with the mouse. In version B, participants are requested to connect the numbers and letters in ascending and alternating order (1-A; 2-B; 3-C; etc.). In the Standard condition all the strokes drawn remain visible on the screen until the end of the task (as happens in the original paper-and-pencil version of the test).

Fig. 2
figure 2

Example of the training phase of the No-Line condition of the eTMT-A. Each time the participant reaches the next target, the line just drawn disappears.

Fig. 3
figure 3

Example of the training phase of the Black Discs condition of the eTMT-A. Each time the participant reaches the next target, the line just drawn disappears and the previous turns into a black disc.

Procedure

After receiving on-screen instructions, each participant performed the three conditions Standard, No-Line, and Black Discs (ST, NL, and BD) in the two different task versions (Parts A and B) resulting in a total of six trials per participant. All participants completed Part A first and then Part B, so the order of the two tasks was the same for everyone. The administration order of the 3 conditions was instead randomized for each participant to control for potential order effects. Before starting each condition, participants had the opportunity to undergo training in a TMT version containing only eight items (see Figs. 1, 2 and 3). A separate training trial was provided for each manipulation of the task, to ensure that participants were familiar with the specific format of the upcoming condition. Participants were asked to complete all tasks as accurately and as quickly as possible. In the TMT-A, the participants’ task was to connect, using the mouse, 25 encircled numbers distributed on the screen from 1 to 25 in ascending order (1-2-3, etc.). For the TMT-B, participants were asked to connect numbers and letters in alternating, ascending order (1-A; 2-B; 3-C, etc.). To connect one circle to the next, participants had to keep the mouse button pressed and could release it and press it again as they wished.

Apparatus

The study was carried out using a laptop computer (MacBook Pro 15’) and a mouse. The computer was running eTMT: a custom-made script in Max 8 (Cycling ’74 (https://cycling74.com/)).

Results

Firstly, a correlation analysis was conducted to determine whether age had any effect on the total time taken to complete the TMT tasks. We calculated the total time elapsed to complete the TMT tasks (i.e., the A and B parts of the Standard, No-Line, Black Discs TMT versions) and assessed whether these times correlated with the participants’ age. After verifying that age did not significantly influence the total time (r =.129, p =.294), we calculated the z-scores associated with the total time elapsed to complete all tasks for each participant. The maximum z-score was 2.41, thus all participants were included in the following analyses, and we proceeded to analyze the mean reaction times (RTs) taken to reach the targets for each participant. For each participant, we analyzed the mean reaction times (RTs) taken to reach the targets. In other words, the program calculates each individual RTs, from the moment the previous target is reached to the moment the next target is touched. Repeated measures analysis of variance (ANOVA) was used to evaluate whether the mean RT, varied as a function of TMT parts, (2 levels: “A” and “B”), the TMT conditions (3 levels: Standard, No-Line, Black Discs) and the comparison between the individual average reaction time taken to reach the first 12 targets (from 1 to 12) and the average time taken to reach the last 12 targets (from 13 to 24) (2 levels: Early and Late targets); see Figs. 1, 2 and 3. We have included this last variable to assess whether participants’ performance benefited from the reduction in display set-size (in the BD condition as well as in the ST condition) and thus corroborated the results obtained so far regarding the influence of distractors once eliminated from the display. In case of violation of the sphericity assumption (Mauchley’s sphericity test; p <.05), a Greenhouse-Geisser correction was performed. The results of the ANOVA revealed, as expected, a main effect of parts [F(1, 67) = 90.73, p <.001, η2p = 0.575], indicating in general faster RT in the TMT-A (M = 1400, SE = 34.5 ms) than in the TMT-B (M = 2222, SE = 98.6 ms) were also task-switching abilities are required. Moreover, the ANOVA revealed a main effect of conditions [F(2, 134) = 11.97, p <.001, η2p = 0.152]. Post-hoc Bonferroni comparisons showed significantly longer RT in the NL condition (M = 2038, SE = 90.8 ms) as compared to the ST (M = 1771 ms, SE = 58.4 ms, Mean difference = − 267 ms, 95% CI [− 439, − 95], t(67) = − 3.11, p =.008, d = 0.38) and the BD (M = 1624 ms, SE = 80.3, mean difference = − 413 ms, 95% CI [− 564, − 262], t(67) = − 5.48, p <.001, d = 0.66) conditions (see Fig. 4). We observed no significant main effect in the comparison between the average reaction times between early (positions 1–12) and late targets (13–24) [F(1, 67) = 0.590, p =.445, η2p = 0.009]. Analysis revealed a significant interaction between the conditions (ST, NL and BD) and parts (A and B) [F(1.82, 122.23) = 5.18, p =.009, η2p = 0.072]. Indeed, post-hoc comparison showed within the part A, significant longer reaction time for the ST condition compared to the BD condition (mean difference = 165.2, 95% CI [80.2, 250.2], t(67) = 3.89, p =.004, d = 0.47). Moreover, within Part B, post-hoc comparisons showed longer reaction times in the NL condition compared to the ST condition (mean difference = 242 ms, 95% CI [91, 393], t(67) = 3.21, p =.031, d = 0.39) and to the BD condition (mean difference = 560 ms, 95% CI [412, 708], t(67) = 7.58, p <.001, d = 0.92). Participants in part B experienced an 11.5% slowdown in the NL condition compared to the ST condition, while conversely, they sped up their reaction times by 15.4% in the BD condition compared to the ST condition. In Part A instead participants were faster by 11.8% in the BD condition compared to the ST one and were slower in the NL condition by 3.2% compared to the ST condition. Finally, analysis revealed another significant interaction between TMT parts (A and B) and Early and Late targets [F(1, 67) = 46.93, p <.001, η2p = 0.412]. Post-hoc comparisons (Bonferroni) showed a significant difference between early and late targets within Part A (mean difference = 414 ms, 95% CI [338, 490], t(67) = 10.94, p <.001, d = 1.33) and within Part B (mean difference = − 344 ms, 95% CI [− 531, − 157], t(67) = − 3.67, p =.003, d = 0.4; see Fig. 5). Therefore, these analyses indicate that participants take less time to reach the next target in the second half of the task compared to the first. However, this only occurs in Part A of the TMT because this pattern reverses in Part B of the test; in this case, reaction times in the second half of the task increase compared to the first half. This result seems to precisely reflect the presence of the switch task in part B of the test. As the task progresses, the switch task becomes more demanding, as it becomes increasingly challenging to keep track of the running count with increasing magnitudes. To more precisely characterize the response patterns in the first and second halves of the task, we conducted a more complex ANOVA that included serial reaction time (SRT) defined as the sequential time taken to reach each of the 24 targets. This ANOVA incorporated three factors: TMT parts (2 levels: “A” and “B”), TMT conditions (3 levels: Standard, No-Line, Black Discs), and SRT (target number as a factor: 24 levels, from 1 to 24; see similar approaches in20,46,47. As expected, this analysis confirmed all previous findings: specifically, a main effect of parts [F(1, 67) = 89.68, p <.001, η2p = 0.572], a main effect of conditions [F(1.82, 122.29) = 12.38, p <.001, η2p = 0.156], and a significant interaction between conditions and parts [F(1.79, 120.14) = 4.81, p =.012, η2p = 0.067]. In addition, the analysis revealed a main effect of target number [F(4.81, 322.89) = 15.08, p <.001, η2p = 0.184], as well as significant interactions between target number and part [F(5.10, 341.98) = 12.98, p <.001, η2p = 0.162], and between target number and condition [F(46, 3082) = 1.91, p <.001, η2p = 0.028]. Figure 5b illustrates the SRT and clearly shows the significant time variability required to reach each sequential item in both parts of the TMT. Notably, the “saw-tooth” pattern characterized by alternating slower and faster responses in consecutive pairs was absent in TMT-B, in contrast to the pattern reported by Thornton and Horowitz20 in a MILO variant that resembled the features of TMT-B with 12 sequential targets.

Fig. 4
figure 4

Mean reaction time of all conditions. We found significant differences in both parts of TMT but especially in eTMT-B (switch task). Error bars represent standard error. Within TMT-A, the difference between the Standard and No-line conditions was not significant; all other differences were significant.

Fig. 5
figure 5

a Mean reaction time participants took to complete Early (1–12 positions) and Late (13–24) targets of the test and the two parts of the TMT (A and B). The pattern of results shows, within part A, a reduction of reaction times in late targets compared to early ones, while on the contrary, an increase of reaction times in late targets of the test compared to early targets, within part B. Error bars represent standard error. Asterisks denote significance levels (p <.05 (*), p <.01 (**), p <.001 (***). b Mean sequential reaction times (SRT) of the participants as a function of the two parts of the TMT (A and B). Error bars represent the standard error of the mean. The pattern of results shows, within part A, a reduction of reaction times in late targets compared to early ones, while on the contrary, an increase of reaction times in late targets of the test compared to early targets, within part B. Error bars represent standard error.

Transparency and openness

We report how we determined our sample size, all manipulations, and all measures in the study. All research materials are available upon request from the corresponding author. Data were analyzed using Jamovi, version 2.3.21. This study’s design and its analysis were not pre-registered. All data have been made publicly available on the Open Science Framework- OSF and can be accessed at https://osf.io/kwjxu.

Discussion of exp. 1

The results confirm, as expected, that reaction times are slower in Part B of the TMT suggesting that the switching requirement included in TMT-B makes it more difficult than TMT-A52,56,57. The removal of the drawn line specifically affects Part B: the No-Line condition, where all the distractors remain visible, and line feedback is removed, is the most difficult one. These results confirm our hypothesis regarding the influence of distractors on this task: When distractors remain on screen, reaction times are delayed. This is a remarkable result, as it allows for deeper exploration into the type of processes and variables that influence this visual search task. Our NL manipulations highlights that in TMT-B, a task already complicated by the requirement of switching between numbers and letters, the participants slow down even more (an 11.5% slowdown in the No-line condition Part B compared to the Standard condition Part B; a 3.2% slowdown in the No-line condition Part A compared to the Standard condition Part A). We argue that in TMT-B participants could be forced to analyze the entire visual display without being able to neglect previously selected targets. This experiment shows a strong set-size effect, clearly affecting performance and creating a consistent pattern of results. This evidence is further supported by the fact that the Black Discs condition replicates the effect of the Standard one in canceling already selected targets. Our finding confirms the hypothesis that both in the original paper-and-pencil and in the digital versions, visual search is facilitated by the drawn line or by the canceling of the items (turned in black discs) which gradually reduces the field of distractors among which to detect the next target. Moreover, these results confirm that cognitive load interacts more prominently with the difficulty introduced by the absence of the drawn line. As hypothesized and consistent with the literature44, the most significant effects are observed in TMT-B. These findings also align with the literature on visual stimulus salience: at a perceptual level, the removal of the line prevents the formation of visual prominence within the field of stimuli, making it impossible to visually exclude already-processed (and blackened) items43. MILO studies have shown that making previously selected items disappear (Vanish condition) can facilitate sequential search [20; 46]. In Experiment 1 of the present study, by contrast, visited targets remained visible but were transformed into black discs, thereby variating their salience without reducing overall set-size. This manipulation differs from the MILO Vanish condition, yet our results suggest that even without full distractor removal, altering the visual status of visited items can modulate search efficiency, pointing to a graded role of display dynamics in sequential search.

The final aspect we aimed to investigate pertains to the difference in average reaction times between completing the initial and latter halves of the test. We conducted this analysis to confirm the effect of distractor set-size on task performance. Based on the results we obtained comparing the three different conditions, we expected to find a gradual reduction in reaction times within the Standard and Black Discs conditions. This expectation stemmed from our hypothesis that the presence of black discs (BD) or the removal of targets via the line (ST) would progressively diminish distractor magnitude, thereby easing the task and resulting in quicker reaction times, for instance, comparing Early targets (where most distractors are present in the visual field) with Late targets (where half the distractors are absent due to darkening or line erasure). However, we found no significant interaction between our conditions and average reaction times across the test’s two halves (Early and Late targets). These null results could be interpreted as an evidence that participants had more time to process the position of late items due to their increased exposure time. However, a significant interaction emerged between the two parts of the TMT (A and B) and Early/Late targets showing a contrasting patterns of faster performance in the second half of TMT A, and slower responses in the second half of TMT-B. This discrepancy notably reflects the presence of the switching task in TMT-B. As complexity increases, maintaining a running count of numbers and letters becomes more arduous, consequently delaying performance in Late targets of TMT-B. Alternatively, or in conjunction, this finding could be explained by considering the presence of automatic processing of the beginning part of the alphabet (A < B < C < D) and more effortful processing of later letters (H < K < I < L) (see also58. Remarkably, this interaction persists, regardless of our experimental manipulations. Based on the results of Experiment 1, it remains unclear why, despite the observed general facilitation from cancelling reached items, the gradual reduction of active distractors did not result in a progressive decrease in reaction times as the task advanced. Consequently, we conducted another experiment to assess whether other variables, such as the distance between successive targets and the degree of continuity in the direction taken to move from one target to another, could affect reaction times.

Experiment 2

The main purpose of Experiment 2 was to test why the TMT task does not become simpler over time (as observed in Experiment 1, where no significant interactions emerged between the conditions, ST/BD/NL and the stage of the task, Early/Late targets) but remain equally demanding throughout all repetitions. Additionally, Experiment 2 aims to control for variables that may influence TMT performance, which were not accounted for in previous versions of the test.

It has been demonstrated that the implicit learning of distractor configurations, when repeatedly presented, can provide a “contextual cueing” effect that facilitates target search in visual search tasks. A study conducted by Manginelli and Pollmann59 investigated how search behavior adjusts when the target location shifts from a constant position to a new location. The results showed that altering implicitly learned spatial relations between target and distractor configurations eliminated the contextual cueing advantage.

Thus, to control for this possible effect in the TMT task, we introduced a new manipulation: once the next target is reached, not only does the drawn line disappear (a condition that, as we saw in Experiment 1, increased reaction times in the NL condition), but all the items also change their position on the screen. More specifically, while the position of the circles on the screen remains constant, the items are shuffled among them each time a target is reached. In this way, the potential facilitation provided by memorizing item positions is eliminated, as targets switch positions pseudo-randomly with each new target.

Previous studies have indicated that the distance between targets and the spatial arrangement of distractors correlates with the difficulty of visual search. Specifically, Gaudino and collaborators56 identified variables related to spatial positioning that influence performance in TMT-B. They showed that part B differs from part A in terms of average distances (part B has 2.4 cm more between each target) and in the relationships between one target and another (part B involves more visual interference as there are more distractors between targets compared to part A). Moreover, in a study by Linari and collaborators49, a new digitized version of the TMT was proposed, in which they systematically controlled the accuracy and speed of task execution and added a measure for tracking eye movements. In this study, in addition to replicating the results of the original version, they were able to provide indices that more specifically reflect measures of executive control. Furthermore, they found differences between Parts A and B, specifically in the number of fixations, which was greater in Part B. The authors of this study do not exclude the possibility that some differences may have emerged due to the influence of the different spatial organization of the stimuli in the test. Regarding proximity, Salthouse and collaborators52 demonstrated that it is possible to reduce irrelevant influences of visual search and hand movements by using a version of the test in which the targets are adjacent. On the other hand, Reddy and VanRullen60 investigated whether items being close together make the task harder due to visual “clutter” or, conversely, increasing the spacing between objects results in better search performance. Their results showed that increasing the distance between objects improves visual search performance. Regarding spatial continuity between subsequent targets, Fischer and Whitney61 demonstrated that after seeing a particular pattern, people’s judgments about the orientation of similar patterns that follow are influenced by the orientation of the initial pattern. They suggested that this serial dependence arises from a spatiotemporal integration window, termed the continuity field, where stimuli observed a few seconds ago interact with the perception of current visual stimuli. This concept has been supported by additional research, which found that judgments related to numerosity62, eye gaze63, shape64, motion coherence65, facial identity66, gaze direction63, and emotional expressions67 also exhibit serial dependence based on past perceptual experiences. Moreover, Fritsche and colleagues68,69 as well as Rafiei and collaborators70 demonstrated the existence of an attractive or repulsive bias concerning the proximity between an item and another item processed earlier. Their studies indicate that an attractive bias in perceived orientation occurs when preceding items share similar orientations, while a repulsive bias occurs when they exhibit dissimilar orientations. Conversely, it is possible to consider the “Inhibition of Return” (IOR) effect that suggests that items that have been attended to are afterwards inhibited in a way that enables attention to shift away, creating a tendency to avoid selecting that specific item again, at least for a certain period. Posner and Cohen71 were the pioneers in presenting proof for a mechanism wherein, after attention is directed to a location and subsequently withdrawn, there is a delay in response time (RT) when a target emerges at the previously focused spot. They coined the term “Inhibition of Return” to convey the concept that attention may be restrained from returning to locations that have already been examined. The interaction with the IOR effect with visual search is well established: utilizing a set of visual search tasks modeled after Treisman and Gelade24, Klein72 evaluated reaction time to a dot-probe displayed immediately following a visual search trial. When the search task was challenging (requiring effort to locate the target), but not when it was easy (with the target easily standing out), observers were slower to respond to the dot-probe if it appeared at the same location as a distractor in the search array. Klein72 and others employing this task, such as Müller and Mühlenen73 and Takeda and Yagi74, attributed this pattern to IOR under the assumption that attention assesses distractors to figure out if they are the target in situations where the search is demanding. Boot and collaborators36 also discovered a similar IOR-like pattern in saccadic responses to sequentially presented stimuli: participants were less likely and slower to fixate on the target when it appeared at a location previously fixated. This effect could also influence a task like the TMT, and it is possible to imagine that this phenomenon is related to direction as well as position, as other studies have discussed oculomotor suppression related to movement and direction (e.g75.,.

Considering all the above-mentioned studies, in Experiment 2, we decided to introduce a manipulation in the eTMT where we control the distance between successive targets and their spatial arrangement among distractors. Following Salthouse and colleagues52, it is possible to hypothesize that the task could be simpler, with reduced reaction times, when the next target is adjacent to the target just reached and, conversely, harder, with longer reaction times, when the next target is farther away. However, following Reddy and VanRullen60, it is plausible to hypothesize the opposite: when items are adjacent the task is harder due to visual “clutter”, and increasing the spacing between objects improves search performance. Indeed, in this new condition, we systematically manipulated target positions to ensure that half of the targets were near, and half were distant from the previous target. Moreover, following studies on serial dependence61 and attractive and repulsive bias68,69,70 it is possible to hypothesize that, in the context of a serial visual search task, a condition of spatial continuity or discontinuity between successive targets can affect performance. Specifically, if the line we have drawn to reach the last target maintains continuity with the direction of the subsequent target (e.g. within a 90° variation), it could be easier to reach the latter with shorter reaction times compared to when spatial discontinuity in the orientation exists between the two targets (e.g. more than 90°). Conversely, based on studies considering the IOR phenomenon36,72,73,74, it is possible to speculate that participants could be less likely and slower to fixate on the subsequent target when it is in the same direction as the just-reached one. In other words, it is possible to hypothesize that the IOR effect could extend to the inhibition of gaze direction.

Finally, following the results of Experiment 1 regarding the better performance obtained in the black discs condition, we aim to further simplify the task by creating an additional condition in which, once the target has been reached, it completely disappears from the screen. It is possible to hypothesize that although there is a benefit in Experiment 1 from turning the targets into black discs, they may still interfere with visual search, as they are all still visible. With this additional condition, we can verify whether making an item disappear completely can further improve visual search performance, confirming the fact that in TMT visual search can be influenced by the number of distractors. For example, a similar manipulation was performed in a previous digital implementation of the Multi-Item Localization (MILO) task46, a task designed to mimic the A and B variants of the TMT [20, 47]. In the digital version of the MILO task, targets either vanished or remained visible after being touched, revealing a greater facilitation effect for vanishing targets, thus an improved visual search performance, specifically in the more complex switching condition, which resembled the demands of TMT-B.

In summary, in Experiment 2, we administered two new conditions: a hypothesized simple one called Disappearing Discs where the just reached target disappears, and a complex one we called MeDiSp (controlling for Memory, Distractors, and Spatial positioning). In the MeDiSp condition it is impossible for participants to memorize the positions of the items (see Manginelli & Pollmann59, because each time the participant must search for the next target in a new field of distractors in which the latter are always effective and do not have salient features that can be easily eluded (see Proulx & Egeth43. Additionally, in the MeDiSp condition, we systematically manipulated the spatial arrangement of the test items following precise criteria crossing the two variables Distance (near/far) and Continuity (continuous/discontinuous). Each time the participant reaches a target, all the other items change their position on the display in such a way that the next target can be near or far compared to the just reached target. Moreover, each time the participant reaches a target, all the other items change their position on the display so that the next target will require a continuation of movement in the same direction (Continuous) or in a different one (Discontinuous).

Experiment 2 is thus designed to confirm and expand our understanding of how the display set-size affects TMT performance (Exp. 1) and to explore the influence of spatial positioning of targets, while controlling for implicit learning and discrepancies between Parts A and B.

Method

Power analysis. This study was powered to detect moderate-to-large differences in test performance between test variants. The power to detect differences between testing variants was examined using the program Gpower 354. This indicated that detecting an effect size of 0.2, at 80% power, would require a sample of minimum 68 participants.

Participants

87 participants (53 males; mean age = 25.47 years, SD = 11.9 years; range = 17–62) took part in the experiment. Only right-handed participants were included in the study. All of them reported normal or corrected-to-normal vision, and none of the participants suffered from colour blindness. The participants did not take part in Experiment 1, were naïve to the purpose of the study, and gave written informed consent (for minors, informed consent have been obtained by a parent) in accordance with the Declaration of Helsinki. The experimental protocol was approved by the Ethical Committee of Università Europea di Roma (approval number 3/2024). Data were collected in 2022.

Material

We included 3 different versions of the eTMT in Experiment 2: 1. Standard condition (ST), same as the paper-and-pencil condition (both Parts A and B), identical to the Standard version used in Exp. 1; 2. Disappearing-Disc (DD) condition, in which targets disappear as soon as the following one is reached. For example, once the participant has arrived at the circle with the number 2 inside, the circle with the number 1 disappears completely from the display, leaving an empty space (see Fig. 6); 3. Memory, Distractors, and Spatial positioning condition (MeDiSp), in which the line connecting the previous and the next target disappears as soon as the participant reaches the next target. Additionally, all item positions are reshuffled according to the manipulation of the variables Distance and Continuity (see Fig. 7). Specifically, half of the subsequent items were “near” (less than 6° 12’ of visual angle apart), and half were “far” (more than 9° 31’ of visual angle apart). The “continuity” or “discontinuity” of the next target was determined by the direction of the mouse path used to reach it, creating specific spatial relationships between the preceding and the following target, which we called cases. “Continuous” cases are those where the following target falls within a 45°-line segment from the previous and current targets; “Discontinuous” cases are those where the following target is located in a line segment of more than 45° from the preceding and current targets (see Figs. 7 and 8). The targets were systematically placed to include an equivalent number of near-continuous, far-continuous, near-discontinuous, and far-discontinuous occurrences, allowing us to create a balanced version of the TMT for these variables. Finally, in the MeDiSp condition, the two parts of TMT (A and B) feature the same identical positioning of the targets: the pattern of distances and positions used in creating Task A is identical to that used in Task B but flipped horizontally and vertically to avoid the possibility of recognizing it.

Fig. 6
figure 6

Example of the training phase of the Disappearing Discs condition in eTMT-A. Every time the participant reaches the next target the drawn line and the previous target disappear.

Fig. 7
figure 7

Example of the training phase of the MeDiSp condition in eTMT-A. Every time the participant reaches the next target the drawn line disappears, and all the items’ positions change with a specific criterion: the next target could be near and continuous; near and discontinuous; far and continuous; far and discontinuous.

Fig. 8
figure 8

Diagram of the spatial relationships between the different positions of the items in the MeDiSp condition.

This diagram is not what the participants see (since the numbers varied in each trial and no lines connecting the discs are visible), but it represents the structure we used to design the condition: namely the position in which each target will appear when the previous target is reached. The angle on the disc indicates the angle within which the next target will be considered continuous (e.g. varying of less than 45° from the previous direction). When the gray angle does not cover the position of the next target, the condition will be Discontinuous. Conversely, when the position of the next target falls within the angle of the previous target, the condition will be Continuous (e.g. varying of more than 45° from the previous direction). Labels indicate how each connection is categorized: NC – Near Continuous; FC – Far Continuous; ND – Near Discontinuous; FD – Far Discontinuous. For example, between target 2 and 3, the connection is seen as continuous because the angle formed by the line connecting 2 and 3 is less than 45°. The link between 3 and 4 is considered discontinuous because the angle formed by the line connecting 3 and 4 exceeds 45° when compared to the trajectory needed to reach 3. This pattern has been flipped horizontally and vertically for part B to maintain identical spatial relationships without making them recognizable. The first segment/trial (from “1” to “2” in part A and “1” to “A” in part B) is not included in the analyses.

Procedure

After receiving on-screen instructions that required participants to connect the subsequent targets as quickly as possible, each participant performed the three conditions (ST, DD, and MeDiSp) in the two different task parts (A and B). The administration order of the conditions was randomized for each participant, but the presentation order of Part A and Part B was kept constant for all participants. In the eTMT-A, the participants’ task was to connect, using the mouse, 25 encircled numbers distributed on the screen from 1 to 25 in ascending order (1-2-3, etc.). For the eTMT-B, participants were asked to connect numbers and letters in alternating, ascending order (1-A; 2-B; 3-C, etc.).

Results

As in Experiment 1, a correlation analysis was conducted to determine whether age had any effect on the total time taken to complete the TMT tasks. We calculated the total time elapsed to complete the TMT tasks, i.e., the A and B parts of the Standard (SD), Disappearing-Discs (DD), and MeDiSp TMT versions and assessed whether these times correlated with the participants’ age. Again, we verified that age did not significantly influence the total time (r =.195, p =.070). Moreover, as part of the analysis, we calculated the z-scores associated with the total time elapsed to complete all tasks for each participant. This revealed that only one participant had a z-score exceeding 3 standard deviations with a z-score of 3.81. After this sanity check, we computed the mean RT elapsed to reach the targets for each of the six tasks, i.e., the three conditions of the TMT, implemented with the TMT parts “A” and “B.” Moreover, we calculated the mean RT for each of the four conditions present in the MeDiSp version (i.e., Near-Continuous, Near-Discontinuous; Far-Continuous, Far-Discontinuous), separately for TMT parts “A” and “B.” Two repeated ANOVAs were performed. The first ANOVA evaluated whether the mean RTs varied as a function of the two parts of the TMT (2 levels: “A” and “B”), and the TMT conditions (3 levels: ST, DD, MeDiSp). The second ANOVA analyzed the additional conditions within the MeDiSp version. Specifically, we tested whether RT performance varied as a function of the two parts of the TMT (2 levels: “A” and “B”), the distance of the next target (2 levels: Near and Far), and the continuity of the next target (2 levels: Continuous and Discontinuous). For both ANOVAs, in case of violation of the sphericity assumption (Mauchley’s sphericity test; p <.05), a Greenhouse-Geisser correction was performed. The results of the first ANOVA revealed a main effect of the TMT part [F(1, 86) = 96.31, p <.001, η2p = 0.528], indicating faster RT in the simpler TMT-A (M = 1791, SE = 73 ms) than in TMT-B (M = 2478, SE = 73 ms). Moreover, the ANOVA revealed a main effect of the TMT conditions [F(2, 172) = 347.75, p <.001, η2p = 0.802]. Post-hoc comparisons (Bonferroni corrected) showed significantly longer RT in the MeDiSp version (M = 3318, SE = 78 ms) compared to the ST version (M = 1754, mean difference = 1564 ms, 95% CI [1352, 1776], t(86) = 18.03, p <.001, d = 1.81), which in turn revealed significantly longer RTs than the DD version (M = 1333, SE = 78 ms, mean difference = 421 ms, 95% CI [232, 610], t(86) = 5.44, p <.001, d = 0.49). We also found a significant interaction between TMT parts and TMT conditions [F(1.7, 146.9) = 5.46, p =.008, η2p = 0.060]. Post-hoc comparisons revealed that the difference between TMT-B and TMT-A was significantly higher in the MeDiSp version (mean difference (Delta) = 872 ms, 95% CI [580, 1164], t(86) = 8.85, p <.001, d = 1.009), compared to ST (mean difference (Delta) = 706 ms, 95% CI [414, 998], t(86) = 7.167, p <.001, d = 0.817) and DD (mean difference (Delta) = 475 ms, 95% CI [183, 768], t(86) = 4.828, p <.001, d = 0.550) conditions. This result indicates that the more difficult condition (MeDiSp) actually has more impact on part B of the TMT, in which there is also the load resulting from the switch task. It seems that performance and the difference between the two parts of the TMT are modulated by the newly introduced manipulations: the impact of switch cost in TMT-B depends on the different conditions in which it occurs. The results indicate that our manipulations maximize the differences between parts A and B of the TMT (see Fig. 9). Furthermore, post-hoc comparisons detected significant interactions within the two different parts of the TMT, in both part A and B: The ST condition is significantly different from the MeDiSp condition (part A: mean difference = −1481 ms, 95% CI [−1775, −1186], t(86) = −14.871, p <.001, d = −1.714; part B: mean difference = −1647 ms, 95% CI [−1942, −1353], t(86) = −16.539, p <.001, d = −1.906) and from the DD condition (part A: mean difference = 306 ms, 95% CI [11, 600], t(86) = 3.069, p =.035, d = 0.354; part B: mean difference = 536 ms, 95% CI [241, 831], t(86) = 5.382, p <.001, d = 0.620). For completeness, we replicated the analyses after excluding the single potential outlier with a z-score of total RT equal to 3.81, and the results remained consistent. Specifically, the main effects of part [F(1, 85) = 93.13, p <.001, η2p = 0.523], the TMT condition [F(2, 170) = 354.04, p <.001, η2p = 0.806], and the interaction between TMT part and condition [F(1.7, 145.47) = 6.09, p =.005, η2p = 0.067] were confirmed.

Fig. 9
figure 9

Mean reaction time of the interaction between Condition (3 levels) TMT parts (A and B) and Delta values (B-A).

Post-hoc comparisons revealed that the difference between TMT-B and TMT-A was significantly higher in the MeDiSp version (Delta = 872 ms, p <.001) of the TMT, as compared to the ST (Delta = 706 ms, p <.001) and to DD (Delta = 475 ms, p <.001) conditions. Moreover, in both part A and B the ST condition is significantly different from the MeDiSp condition (both ps < 0.001) and from the DD condition (p =.035, and p <.001, respectively for part A and B). Error bars represent standard error.

The results of the second ANOVA, focused on the MeDiSp version, confirmed the main effect of the TMT part [F(1, 86) = 73.18, p <.001, η2p = 0.460]: that is, faster RT in the simpler eTMT-A (M = 2882, SE = 67 ms) than in the eTMT-B (M = 3754, SE = 130 ms). Moreover, it revealed a main effect of Distance [F(1, 86) = 342.07, p <.001, η2p = 0.799] and Continuity [F(1, 86) = 6.4, p =.013, η2p = 0.069]: that is, faster RT for the close targets (M = 2419, SE = 96 ms) than for the distant targets (M = 4536, SE = 124 ms), and faster RT for the discontinuous targets (M = 3372, SE = 98 ms) than for the continuous targets (M = 3583, SE = 98 ms). However, this surprising result was partially clarified by the significant interaction between Distance and Continuity [F(1, 86) = 13.17, p <.001, η2p = 0.133]. Indeed, the post-hoc analyses revealed shorter RT for the discontinuous targets in the Near condition (M = 2167, SE = 115 ms) compared to the continuous targets in the Near condition (M = 2672, SE = 106 ms; mean difference = 505 ms, 95% CI [290, 720], t(86) = 4.68, p <.001, d = 0.50), while there was no difference between discontinuous (M = 4578, SE = 147 ms) and continuous targets (M = 4495, SE = 129 ms) in the Far condition (mean difference = 83 ms, 95% CI [−164, 330], t(86) = 0.67, p = 1.000, d = 0.07; see Fig. 10, and Table 1). Again, we replicated the analyses after excluding the single outlier confirming the previous results: i.e., the main effects of TMT part [F(1, 85) = 70.61, p <.001, η2p = 0.454], continuity [F(1, 85) = 6.22, p =.015, η2p = 0.068], distance [F(1, 85) = 337.22, p <.001, η2p = 0.799], and the interaction between continuity and distance [F(1, 85) = 12.88, p <.001, η2p = 0.132].

Table 1 Post-hoc analysis of the interaction between distance and continuity conditions. All values are expressed in milliseconds. Results with p <.001 are highlighted in bold.
Fig. 10
figure 10

Mean reaction time of the interaction between the variable Continuity (2 levels, Continuous and Discontinuous) and Distance (2 levels, Near and Far) in the MeDiSp version of eTMT. All comparisons are significant except for the difference between Far Continuous and Far Discontinuous. Error bars represent standard error.

Transparency and openness

We report how we determined our sample size, all manipulations and all measures in the study. All research materials are available upon request at the corresponding author. Data were analyzed using Jamovi, version 2.3.21. This study’s design and its analysis were not pre-registered. All data have been made publicly available at the Open Science Framework- OSF and can be accessed at https://osf.io/4mfzx.

Discussion of exp. 2

The results of this second experiment corroborate that Part B of the TMT is more challenging than Part A, confirming the results of Experiment 1 and previous studies. As emphasized in a study by Gaudino and collaborators56, several factors influence performance in part B of the TMT that differ from part A, such as different spatial relations between targets (in the version used in their study) and increased visual interference in part B. Their interpretation of these data suggests caution when attempting to pinpoint performance differences between the two parts, as it indicates that part B may not necessarily be a precise measure of higher cognitive processes. However, it is possible to interpret the difference between parts A and B in terms of cognitive load: in part B, task switching represents a significant additional cognitive load; therefore, in our study, participants might experience greater facilitation (in the DD condition) and greater interference (in the MeDiSp condition). It appears that in the MeDiSp condition, and specifically in part B, both interference and facilitation are maximized and amplified (see also Burnham44. The crucial result is the significant difference between our three conditions. The most difficult condition of all, as expected, is MeDiSp, in which every time a target is reached, the entire spatial configuration renews. It is the most difficult because it excludes possible memory influences59, and the absence of drawn lines makes it impossible to neglect already reached items: every trial takes place in a new field of distractors, which does not feature any sign that can easily exclude any of them from the search. In this case, the display set-size effect has the greatest impact of all the conditions, since every trial features the same number of distractors without any facilitation.

On the other hand, the DD condition appears to be the simplest of all. The comparison between the DD (Disappearing Discs) and ST (Standard) conditions in our study provides valuable insights into the impact of display set-size. In the ST condition, drawn lines between targets remain visible and, as we hypothesized, help to neglect previously visited locations. While this reduces search times by preventing participants from revisiting targets, the presence of these lines can still lead to some interference, as nullified targets are not completely removed from view. In contrast, the DD condition removes targets entirely once they are identified, effectively clearing the search space and minimizing visual clutter. This allows participants to focus solely on the remaining targets without distraction from previously located targets. The disappearance of targets in the DD condition highlights how reducing the display set-size can markedly enhance visual search efficiency. This result aligns with Treisman and Gelade’s research24 on the effect of distractor numbers on visual search times, emphasizing the practical implications of manipulating display set-size to improve task performance.

In examining several factors influencing visual search in the TMT framework, our investigation included variables such as target spacing, directional paths, and their integration with memory processes. In this regard, a core manipulation of Experiment 2 was the introduction of the MeDiSp condition, with which we aimed to control for distance and spatial arrangement between consecutive targets. Our results confirm previous literature about distance: when a successive target is far from the current one, it takes longer to be found than when it is closer52.

These aspects of Experiment 2 also invite a direct comparison with the Multiple Item Localization (MILO) task20, which shows that spatial dislocation disrupts sequential search46. In line with those results, we found that reshuffling imposed a strong cost on performance. At the same time, when previously selected items disappeared (akin to the MILO Vanish condition), performance was facilitated. Importantly, these findings extend the MILO evidence by showing that comparable effects emerge in a TMT-like framework, underscoring also the clinical relevance of such display dynamics.

Regarding the Continuous and Discontinuous conditions, our results show that when two subsequent targets are far apart, the direction in which the next target is placed does not seem to be influential: it makes no difference whether it is placed in the same general direction (within a variation of 45° in each direction) as the previous segment or not. However, when they are close, we found that participants are faster when the successive target is in a discontinuous direction. In other words, results show that participants appear to look first in a different direction than the one they just followed, as if they (implicitly) assume that the next target should be in a different direction. After reaching a target, they appear to search for the next one nearby, but not in the same direction from which they came (see Fig. 11). When participants are not able to find the next target nearby, they appear to broaden their search field in all directions (see Fig. 11).

Fig. 11
figure 11

This diagram shows how the visual search strategy may proceed, according to our results. After finding a target, the search for the next one will start within the area labelled “First” (e.g. near-discontinuous), followed by the area labelled “Second” (e.g. near-continuous), and finally in the far areas labelled “Third” (e.g. Far both continuous and discontinuous). Please notice that this image is not to scale, and the areas are shown for illustrative purposes only.

It appears that participants tend to shift their attention away from the direction or area they have just explored. This kind of pattern is in line with earlier findings by Hilchey and colleagues76, who observed, through eye-tracking, that eye movements tend to avoid returning to the area of the previous target (see also Fischer & Whitney61, for contrasting results). This tendency may relate to the well-known phenomenon of Inhibition of Return (IOR), where items that have already been attended to are inhibited, prompting attention to move elsewhere. Although IOR is typically described in terms of spatial locations, it is plausible that a similar mechanism could apply to the direction of gaze: once someone has scanned in a particular direction, they may be less likely to continue along that path. This idea is also supported by evidence from other studies, such as75. Indeed, this possible tendency to avoid the previously followed direction may reflect a directional component of inhibition of return. While IOR is typically described as a mechanism that inhibits returning attention to a previously attended location, evidence suggests that its spatial distribution is not strictly localized. For example, Bennett and Pratt77 found that the inhibitory effect is strongest at moderate distances from the previously attended target location, and weaker both at very short and very long distances. This suggests that IOR operates along spatial gradients rather than being limited to a specific point. Although their study did not directly address directionality, such a distributed pattern of inhibition may also influence the direction of attentional shifts. In our task, this could help explain why participants were slower to identify a nearby target placed in the same direction as the previous one, possibly because attention is temporarily biased away from continuing along the same search path.

More recent work has gone a step further, showing that IOR may not apply equally across all directions. Soballa et al.78, for example, found that downward shifts of attention are less affected by IOR than upward or horizontal ones, suggesting the presence of directional asymmetries in how attention is inhibited. In a similar manner, Britt & Sun79 reported that attention shifts downward tend to be faster, potentially due to routine visuomotor behaviors or ecological factors related to interacting more frequently with the lower visual field.

Although our study did not manipulate vertical or lateral directions explicitly, these findings still offer a useful perspective for interpreting our results. In particular, the tendency we observed to move away from the previously followed direction especially when targets were spatially close might not only reflect a general IOR effect, but one that interacts with directional preferences. It is possible that in the TMT display, certain directions such as downward or toward the center of the screen are more efficient or more likely to be selected, and that these biases could influence how participants respond to different spatial configurations. This raises interesting possibilities for future research. Directional biases in IOR could play a more important role in visual search than previously assumed, especially in tasks that involve frequent reorientation within a constrained space like the TMT. Eye-tracking could be particularly informative in exploring this, as it would allow to directly observe whether certain directions are more or less likely to be revisited, and how this affects performance over time. Investigating these patterns further could offer a clearer view of how attention is allocated in spatially dynamic tasks.

Altogether, these considerations highlight the importance of attentional dynamics not just in terms of avoiding previously visited locations, but also in how directionality itself might shape participants’ visual search. This adds nuance to our interpretation of search behavior in tasks like the TMT, where spatial structure and attention are tightly interlaced.

In summary, the findings suggest that participants in the TMT tend to start the search for the next target near the one they just reached, but not in the same direction. This suggests a visual search strategy that focuses initially on nearby targets but not along a continuous path with the direction they just came from. It also suggests the possibility of a radial search pattern, exploring directions different from those just traversed (see also36. This search pattern may be due to the confined space represented by the screen: when reaching the edge of the field of search, it may be obvious to look for targets closer to the center of the field, where the likelihood of finding them is higher. This hypothesis about the specific search pattern could be tested by research focusing on tracking the participants’ eye movements while they perform the MeDiSp condition (e.g. checking if the tendency to search in a discontinuous way is due to the vicinity of the edge of the field).

General discussion

In this study, we examined how visual search processes contribute to performance in a digital adaptation of the Trail Making Test (eTMT). Across two experiments, we found that manipulations such as altering the visual status of visited targets (Experiment 1) and implementing either their disappearance or spatial reshuffling (Experiment 2) significantly influenced search efficiency, highlighting the role of display dynamics in sequential linking tasks. Our results can be interpreted in the context of related paradigms such as hybrid foraging19 and the Multiple Item Localization (MILO) task [20; 46]. These paradigms share important features with the TMT, including sequential selection and the need to cope with dynamic changes in the display. Importantly, the MILO task is available in digital form (https://maltacogsci.org/MILO/), and studies using it have demonstrated that distractor removal facilitates sequential search, whereas reshuffling disrupts it20,46. Building on this literature, our contribution lies in testing these manipulations within a TMT-like framework that preserves the clinical focus of the task. This approach allows us to examine how factors such as salience, distractor disappearance, and spatial instability modulate search efficiency under conditions directly comparable to the TMT. Taken together, these findings suggest that the eTMT can provide a useful framework for studying how visual guidance and search strategies adapt to both sequential constraints and changing display conditions.

The conditions tested in Experiment 1 shed light on the role of the drawn line in helping participants ‘cancel out’ already reached targets. In the No Line condition where the trajectory disappears, participants were placed in a context where the visual field remained cluttered with distractors throughout the task. As expected, this made the search process more difficult. By contrast, both the Black Discs and Standard conditions allowed to reduce the number of visible targets during the task either by turning selected items black or by leaving a visible trace of the path taken, and this appeared to support performance.

These results highlight the importance of distractor management during visual search. When distractors remain constantly present and cannot be visually excluded, reaction times increase. This pattern was especially evident in Part B of the task, which is already more demanding due to the requirement to alternate between numbers and letters44.

Altogether, the findings suggest that in both the original paper-and-pencil version and digital versions of the TMT, the persistence of the drawn line serves as a visual aid that simplifies the search by progressively narrowing down the field of potential targets.

Another aspect that may be relevant, though not directly addressed in our design, concerns spatial attention biases, specifically the phenomenon of pseudoneglect. A substantial body of research, particularly from line bisection and landmark tasks, has shown that neurologically healthy individuals tend to exhibit a leftward bias80,81,82. This effect, commonly attributed to right hemisphere dominance for spatial attention, reflects a natural asymmetry in how visual space is explored and processed.

In the context of the TMT, this raises an interesting possibility. While our study did not specifically control for the position of the first target or the direction of search initiation, one might expect that participants, especially in conditions where no visual guide is present (e.g., the No-Line), would show a tendency to begin their search from the left side of the display. If this were the case, trials starting on the left could potentially result in faster response times, not because of task-specific strategies, but due to this underlying attentional asymmetry. Recent neurophysiological studies provide further support for this idea. For instance, Benwell et al.83 demonstrated that leftward biases in line bisection are associated with greater activation in right temporo-parietal regions, reinforcing the idea that pseudoneglect has a neural basis that could also affect performance in visual search contexts.

While the role of spatial biases may influence how participants initiate the task, we also considered whether performance changed as the task progressed. To further investigate this idea, we examined whether there were systematic differences in performance between the early and late parts of each trial comparing reaction times for the first twelve targets with those for the last twelve. If reducing the number of active items truly facilitates search, we would expect to see faster responses in the latter part of the task, where fewer targets remain on screen. However, we found no significant interaction in the comparison between our three conditions (Standard, Black Discs, and No-Line) and the average reaction times between early and late targets. The analysis did reveal, though, a significant interaction between the two parts of the TMT (A and B) and the average reaction times between early and late targets. Interestingly, contrasting patterns emerged: participants demonstrated faster performance in late targets of TMT A, but slower responses were observed in late targets of TMT B. This discrepancy notably stems from the presence of the switching task in TMT B. We interpreted this result as the effect of an increase in task complexity: maintaining a running count of numbers and letters becomes gradually more challenging, leading to slower performances. Moreover, as shown by Klahr and collaborators58, it is possible to hypothesize that it is easier to process stimuli near the beginning of the alphabetical order rather than stimuli near the end of the alphabetical order. Lovelace and Spence84 indeed found an irregular increase in reaction time as a function of alphabetical position. Specifically, they observed an increase in reaction times from the first portion of the alphabet compared to the last portion of the alphabet (RTs for the first six letters of the alphabet averaged 890 ms, while for the last six letters, they averaged 1.180 ms).

We thus questioned why, despite the significant beneficial impact of “cancelling” reached items (either by turning them black or passing over them with the line), compared to the condition where no items can be “cancelled” (the No-Line condition), the gradual reduction of active distractors did not result in a gradual reduction in reaction times as the task progressed over time. Therefore, we conducted an additional experiment to assess whether other variables, explored in literature24,36,59,62,70,72 but not specifically applied to the TMT such as the distance between successive targets and the degree of continuity in the direction taken to move from one target to another could affect reaction times.

In Experiment 2 we introduced two additional conditions alongside the Standard one: the MeDiSp condition, controlling for memory, number of distractors, and spatial positions, introduced systematic variations in distances and continuity between targets, while the Disappearing Discs condition was designed to check if the complete disappearance of reached targets would enhance performance even more.

The results of Experiment 2 showed a critical increase in reaction times in the MeDiSp version and a significant decrease in the Disappearing Discs version. Moreover, as in part B participants are burdened with a task that becomes gradually more difficult, they benefit more from conducting a serial visual search in a field of distractors that gradually empties (DD), and they may experience greater interference from a field of distractors that remains the same, offering no mnemonic foothold or spatial relationships that allow them to neglect already analyzed targets (MeDiSp).

If in Experiment 1 the Black Discs (BD) condition decreased the impact of the distractors on performance, resulting in shorter reaction times, in Experiment 2 the effect turns out to be decisive, making the Disappearing Discs (DD) the simplest condition of all. These results indicate that the only way to ensure that distractors and display set-size effects no longer affect the task is to make items disappear completely. From Experiment 2 it is possible to appreciate that even if weakened—as demonstrated in Experiment 1—the distractors continue to slow down visual search to some degree, acting as exogenous stimuli attracting attention, as also demonstrated by studies on the salient characteristics of stimuli in visual search37. Also, Kent and collaborators85 demonstrated a processing speed advantage for displays with no distractors compared to displays containing distractors.

A detrimental effect of distractors is not always found in the literature. For example, Carrasco and McElree’s86 study suggests that the presence of distractors does not slow down reaction time, indicating that participants were capable of effectively filtering out irrelevant distractors. Our findings help integrate these contrasting results. In fact, the two conditions of Black Discs (Experiment 1) and Disappearing Discs (Experiment 2) indicate that differentiated performance can be achieved by modulating the degree of distractor interference (present but “weakened” in the Black Discs, absent in the Disappearing Discs). Indeed, the results of the Standard and Black Discs conditions in Experiment 1 confirm that the presence of distractors does not decrease reaction time: we obtained better reaction times compared to the condition in which the drawn lines between subsequent targets were eliminated (No-Line). This finding aligns with Carrasco and McElree’s86 study.

On the other hand, the results of the Disappearing Discs condition in Experiment 2 also confirm the findings by Kent and colleagues85, as this condition, in which distractors (previously analyzed targets) are removed, appears to be the simplest of all. Thus, we confirm the effect of display set-size, showing its full impact in Experiment 2. Results from Experiment 2 show that the number of distractors present in a visual serial search task, as in the TMT, can induce different levels of difficulty in completing the test. These effects are reflected in the relationship between the three different conditions and the two parts of the TMT (A and B).

In Experiment 2, we also discovered an intriguing pattern in the visual search strategy used to locate the next target. According to our results, after identifying a target, participants began their search for the next target by starting with the nearest item, but in a different direction from the one they had just followed. They then proceeded to search in nearby continuous directions and, finally, expanded their search to farther areas. Even though this strategy is quite interesting, our data are not sufficient to confirm it; however, it opens the field to speculation and new studies. For example, this spatial pattern of visual search could support the speculation about IOR, applied not only to the spatial position but also to the direction of movement. Indeed, in our case, participants searched for the next target primarily near where they currently were, even if in a discontinuous direction different from the one they were previously proceeding in.

This speculation could be confirmed by a study using eye-tracking methodology and could bring new insights into this phenomenon. Furthermore, it would be interesting to verify whether a similar spatial pattern of visual search applies to other visual search tasks not strictly related to the TMT. Conducting such studies could help determine if this pattern is a general characteristic of visual search behavior or if it is specific to certain tasks. Additionally, comparing these patterns across different tasks could provide a deeper understanding of the underlying mechanisms of visual attention and search strategies.

This broader investigation could potentially lead to improved methodologies in cognitive and behavioral research, contributing to the development of more effective tools for assessing and training visual search skills in various contexts.

Overall, the performance difference between the two parts of the TMT is accurately modulated by the manipulations introduced: a different impact of the switch cost task present in part B emerges depending on the different conditions in which the distractors are either “cancelled” by the line (Standard condition), no longer present (Disappearing Discs condition, and also the Black Discs condition of Experiment 1), or always active and relocated (MeDiSp condition). The results, therefore, indicate that our manipulations and their interaction with the two parts of the TMT assess different degrees of performance, while also elucidating the effects of task switching in Part B.

Experiment 2 results further validate and fill the gap identified in previous research52 regarding the impact of target distance on performance: greater distance between targets correlates with longer search times. Moreover, the investigation into the Continuous and Discontinuous conditions reveals that when targets are distant, the direction of subsequent target placement does not significantly influence search behavior. These results have been interpreted here as indicating a specific tendency in visual search (see Fig. 11). This tendency regarding the specific search pattern could be tested in the future by eye-tracking research, using the MeDiSp condition.

Based on the results obtained, it is possible to hypothesize a change in the standard administration of the TMT. This change could be based on a direct comparison between the versions that have proven to be the least and the most challenging in this study. The MeDiSp version of the eTMT is well-balanced in terms of distance and continuity, ensuring that visual search remains consistently challenging from start to finish (e.g., a consistent search among 25 items, repeated across 24 identical trials). This version could be administered along with the simpler Disappearing Discs version, providing insights to identify specific issues in the patient at hand.

More specifically, we could expect the Disappearing Discs version to show a difference between parts A and B—if it does not, the problems could be severe, possibly even motor-related. The more challenging version (MeDiSp), however, allows for the assessment of more nuanced aspects, such as the quality of visual search and an “expected” exploration pattern, which, if not met, could be symptomatic of some underlying cognitive deficit (executive control, task-switching ability, serial processing skills, etc.). Administering these two versions of the eTMT could offer additional data in the diagnostic field, helping to understand the impact of increasing difficulty on different injuries or pathologies, thus adding data that can be compared.

Overall, a digitally redesigned version of the TMT has the potential to reform cognitive assessment practices by providing clinicians with detailed, reliable data on individual patient performance, trial by trial, ultimately leading to more effective treatment planning and intervention strategies. Further research could focus on this possibility by testing the actual increased diagnostic potential of such measurements.