Introduction

Contemporary challenges like the pandemic, climate change, and rapid societal shifts result in significant environmental changes, often fostering problematic habits and maladaptive behaviors1. In this context, the detailed comprehension of the neurocognitive dynamics underlying habit formation and the way we modify them has become a research question of the utmost importance. Traditionally, habits have been conceptualized as automatic links between stimuli and responses2 indifferent to the outcome value of the response, which distinguishes them from goal-directed behaviors in non-human animal studies3. In the context of humans, habits take on a more intricate nature, defined by a set of behavioral attributes4,5,6. They are acquired gradually through associative learning processes over extended periods of practice, frequently without conscious awareness7. Once established, these habits can be executed with minimal thought or attention, essentially operating automatically8. Numerous human studies have focused on the environmental contingencies fostering successful habit updating or rewiring9,10,11,12. These studies commonly emphasize the modification of context cues as a primary strategy for rewiring habitual responses. Building upon the premise that habit change generally involves the strategic adjustment of environmental contingencies, we turn our attention to the neuropsychological processes that shape habit formation and modification. In this article, we focus on uncovering the internal cognitive mechanisms that govern habit formation and modification. To achieve this, we examine how interindividual differences in cognitive profiles influence habit change through the lens of a probabilistic learning task.

Statistical learning (SL) stands out as a fundamental element of the intricate process of habit formation13,14,15. SL refers to the cognitive function that allows the implicit extraction of probabilistic regularities from the environment, even in the absence of explicit intention, feedback, or rewards16,17,18. This cognitive mechanism plays a pivotal role in predictive processing and contributes significantly to the acquisition of diverse skills, including language19,20,21motor22,23, musical24,25,26, and social abilities27,28,29. In the context of habit formation, SL fortifies the probabilistic connections between environmental cues and the invoked associated responses that constitute habitual behaviors30,31, and facilitates the development of predictions of upcoming events, leading to shorter reaction times for expected outcomes26.

For instance, consider the habitual actions of a French driver. Through SL, this driver unconsciously forms associations between the likely locations of relevant stimuli at intersections and the corresponding direction to turn their gaze. Consequently, it becomes second nature for them to instinctively look left at a crossing. Conversely, a British driver, also influenced by learned probabilistic associations, automatically looks right at a crossing32. A potential challenge arises when a French driver, accustomed to the habitual association of relevant stimuli emerging from the left at intersections, travels to England, where relevant stimuli are more likely to appear from the right. To adapt effectively, the French driver must update or rewire the previously acquired probabilistic representations, which might prove difficult33. Elucidating the neurocognitive processes involved in adapting to new or altered environments is crucial to understanding how probabilistic representations are updated.

Updating implicit probabilistic representations poses several challenges. Firstly, because these representations are largely unconscious, individuals may lack explicit awareness of what needs to be modified34, which can hinder rewiring. Secondly, SL has been characterized as a resilient cognitive mechanism, leading to representations that are resistant to forgetting35. While this robustness is beneficial for maintaining learned associations, it can be a barrier to modifying existing representations. Thirdly, this robustness may cause proactive interference36, where existing strong associations interfere with the encoding of novel probabilistic relationships, creating a cognitive obstacle to effectively adapt to environmental changes.

Previous research on rewiring implicit probabilistic information investigated whether providing explicit instructions could enhance the effectiveness of the rewiring process14. While the results indicated that explicit instructions did improve rewiring, they also uncovered that the updating process could occur implicitly as well. Participants were able to adapt and rewire their implicit probabilistic representations without explicit guidance, underscoring the flexibility and adaptability of the underlying cognitive processes.

The examination of rewiring implicit probabilistic representations has also been approached through the lens of dual-process neuro-computational models of learning37. These models differentiate between model-free and model-based algorithms, with the former relying solely on recent outcomes and the latter building and utilizing a model of the environment to guide choices more flexibly, but at a higher computational cost38,39. The brain is postulated to alternate between these two learning systems, both within the completion of individual tasks40,41,42 and over the lifespan43,44,45. In their investigation, Kurdi and colleagues employed a stimulus reevaluation protocol based on the Implicit Association Test45 to examine the potential for updating implicit and explicit representations via model-based and model-free learning mechanisms. Their findings showed that while the updating of explicit probabilistic representations was influenced by both model-based and model-free processes, implicit probabilistic representations were updated exclusively through model-free processes.

Cognitive control functions, also known as executive functions (EF), may be a crucial tool for effectively rewiring probabilistic representations. EF is particularly important in non-habitual situations and includes cognitive functions such as attentional control, cognitive flexibility, cognitive inhibition, and working memory updating46,47,48. These functions are indispensable for model-based processes49 and for the flexible regulation of behavior46,50, making them pivotal in habit change51,52. For instance, a study demonstrated that cognitive remediation therapy aimed at enhancing EF could effectively disrupt bad habits associated with obesity52. Investigations into the neural underpinnings of EF consistently underscore the significant involvement of the prefrontal cortex (PFC)53,54,55,56,57. This reliance on the PFC is further evident at an interindividual level, where superior EF is associated with greater prefrontal activity58, and increased functional connectivity between the PFC and other brain regions59. Our focus here centers on examining whether and how interindividual differences in EF may influence the rewiring of implicit probabilistic representations.

While the association between EF and habit change may suggest a positive correlation, the relationship between EF and the cognitive processes underlying habit formation is more nuanced. Greater PFC activity has been consistently shown to hinder the acquisition of implicit probabilistic information. Disruption of PFC activity through interventions such as transcranial magnetic stimulation60, hypnosis61, cognitive fatigue induction62, or dual-tasking63,64 has resulted in improved acquisition of probabilistic representations. Neuroimaging studies have revealed that decreased PFC activity65 and decreased PFC functional connectivity66is also associated with enhanced SL. Similarly, at a cognitive level, studies have shown a negative relationship between EF and SL. For example, one study demonstrated that alcohol-dependent patients with weaker EF exhibited enhanced SL capacity67. Similar results were a study that offered an internal replication with healthy participants68. These studies suggest that while initial SL may rely on model-free learning algorithms, EF may facilitate model-based learning algorithms and align with the competition hypothesis between model-based and model-free learning. According to this hypothesis, these systems alternate in the brain during task completion, with greater reliance on one reducing reliance on the other39,42,69. The PFC plays a crucial role in mediating between model-based and model-free systems, with the latter serving as the default but able to be overridden by cognitive control mechanisms implemented by the PFC41,70. However, it is important to note that these studies mainly focus on the relationship between cognitive control and SL during initial knowledge acquisition. The relationship between cognitive control and the updating of probabilistic knowledge remains unclear.

To our knowledge, only one study has investigated the influence of EF on the rewiring of probabilistic representations to date15. This study specifically examined the impact of cognitive control, as manifested by active inhibition, on rewiring. The findings revealed that participants, while actively attempting to inhibit unwanted behaviors, inadvertently impeded their ability to acquire new probabilistic knowledge. Simultaneously, they reinforced previously acquired knowledge, suggesting that cognitive control may hinder the rewiring process. However, this study did not examine how individual differences in EF capacities affect rewiring. Moreover, cognitive control functions extend beyond inhibitory control. Therefore, there remains a critical need for further investigation into how various EF interact with the rewiring of implicit probabilistic representations. Understanding these interactions is crucial for gaining insight into the neuropsychological dynamics underlying habit change.

In this study, our objective was to investigate the relationship between EF and the updating of implicit probabilistic knowledge. We aimed to characterize these cognitive processes at the interindividual level using a four-session experimental protocol comprised of empirically validated tasks and measures of EF and SL (see Methods section). In the first session, participants underwent a learning phase in which they acquired implicit probabilistic representations in a visuo-motor probabilistic sequence learning task known as the Alternating Serial Reaction Time (ASRT) task. In the second session, participants completed a rewiring phase, where structural changes to the ASRT task sequence required them to update or rewire their initially acquired probabilistic representations. On the third day, during the retrieval phase, participants were tested on both their initial and rewired knowledge. The fourth session was dedicated to assessing participants’ EF capacities using a battery of validated and reliable neuropsychological tasks targeting various aspects of cognitive control. Drawing upon the framework of competitive neurocognitive systems and existing literature, we hypothesized a negative correlation between EF capacities and the rewiring of implicit probabilistic representations.

Methods

Participants

Seventy-nine healthy young adults were recruited through online advertisement with eligibility based on the following criteria: participants had to be right-handed, aged under 35 years, and have no or limited musical training (less than 10 years of practice). Participants declared having no active neurological or psychiatric conditions, and not taking any psychoactive medication. Of the 79 individuals initially recruited, three did not complete the four-session experiment and were excluded from the analysis. To accurately assess the relationship between EF and rewiring performance, participants had to demonstrate acquisition of implicit probabilistic representations that could be rewired (see Statistical analysis section for the selection criteria). Of the 76 remaining participants, 17 did not exhibit SL during the Learning Phase. This proportion of non-learners is consistent with previous studies examining inter-individual differences in the ASRT task67,68,71,72, addressing concerns about task sensitivity in inter-individual SL studies73. Thus, the data from the remaining 59 participants (32 females, M age = 22.51 years; SD age = 2.92 years; M education = 15.34 years; SD education = 1.64 years) were included in the analyses. Exploratory analyses confirmed that within our sample of young adults, there were no significant correlations between age and our primary learning or EF measures, nor were there significant differences between genders. Thus, age and gender were not included as covariates in subsequent analyses. Further details on participant characteristics can be found in Supplementary Table 1. The relevant institutional review board (i.e., the “Comité de Protection des Personnes, CPP Est I” ID: RCB 2019-A02510-57) gave ethical approval for the study. The authors confirm that all research was performed in accordance with relevant guidelines and in accordance with the Declaration of Helsinki. All participants provided signed informed consent agreements and received a 200 euros financial compensation for their participation in the four sessions of the experiment.

Tasks

Measure of SL and rewiring: the alternating serial reaction time (ASRT) task

To evaluate the acquisition and rewiring of implicit probabilistic information, we utilized a visuomotor SL task that has been empirically validated71and previously established as reliable72, known as the Alternating Serial Reaction Time (ASRT) task74. Employing a modified version of the ASRT task (Fig. 1A), our study adhered to a structured three-session paradigm14, designed to investigate the phases of acquisition, rewiring, and consolidation of implicit probabilistic information.

Session 1: learning phase

During the Learning Phase, participants performed Sequence A of the ASRT task. In each trial, a yellow arrow appeared at the center of the screen for 200 ms, pointing in one of four possible directions (left, up, down, or right). This was followed by a fixation cross displayed for 500 ms. Participants were instructed to respond as quickly as possible by pressing the button corresponding to the arrow’s direction on a Cedrus RB-530 response box. Finger placement was controlled as follows: the left index finger for the up button, the right thumb for the down button, the right index finger for the right button, and the left thumb for the left button. Correct responses were followed by another 750 ms display of the fixation cross, while no response or incorrect responses were met with a 500 ms display of an exclamation mark or an “X,” respectively, followed by a 250 ms fixation cross. Each block consisted of 85 stimuli, with the initial five presented randomly for practice, followed by 10 repetitions of the eight-element alternating sequence. The Learning Phase comprised 25 blocks of the ASRT task.

Unbeknownst to participants, the stimuli followed a structured sequence alternating between predefined pattern elements (P) and random elements (R) (e.g., 2-R-4-R-3-R-1-R, where the numbers represent the direction of the yellow arrows: 1 for left, 2 for up, 3 for down, 4 for right, and “R” indicating a randomly chosen direction out of the four possibilities). This structure resulted in certain triplets (runs of three trials) having a higher probability of occurrence than others (Fig. 1B). For instance, in the 2-R-4-R-3-R-1-R sequence, triplets like (2-3-4) or (3-4-1) were more likely to occur than triplets like (2-3-2) or (4-3-1) since the former triplets could appear in both P-R-P or in R-P-R structures whereas the latter triplets could only appear in R-P-R structures Ü(Fig. 1C). Consequently, the former triplets were five times more likely to occur than the latter, hence they were categorized as high-probability (H) and low-probability (L) triplets, respectively. Previous studies using the ASRT task consistently demonstrate that, with practice, participants respond more quickly to the last element of high-probability triplets compared to low-probability triplets, indicating SL35,44,75. Importantly, participants typically do not develop explicit knowledge of the sequence structure despite this learning effect75,76,77. Thus, learning in the task is assessed by calculating the reaction time difference between the last elements of high-probability and low-probability triplets, and this learning measure reflects implicit SL.

Fig. 1
figure 1

ASRT task structure. (A) ASRT task design. The presentation stimuli (yellow arrows, indicating one of the four cardinal directions), followed an eight-element sequence alternating between pattern (P) and random (R) stimuli. This sequence was presented 10 times per block. Here we illustrate an example of a 2-R-4-R-3-R-1-R sequence. (B) Triplet composition. Each stimuli position can be coded with a number. Here we have 1 = left, 2 = up, 3 = down, 4 = right. From the alternating structure of the sequence between P and R stimuli, certain triplets (sequences of three consecutive stimuli) have a higher probability of occurrence than others. For example, in the 2-R-4-R-3-R-1-R sequence, the triplet 3-2-1 has a higher probability of occurrence since it can be formed in P-R-P and R-P-R structures. The triplet 3-4-2 has a lower probability of occurrence since it can only appear in R-P-R structures. High-probability triplets could end in either a pattern or random element, while low-probability triplets always concluded with a random element. High- and low-probability triplets are denoted in green and yellow, respectively. (C) Triplet probability. High-probability triplets account for 62.5% of all trials and low-probability triplets account for the rest. Notably, for each high-probability triplet (e.g., 3-4-1), there are low-probability triplets with three different last elements (e.g., 3-4-2, 3-4-3 and 3-4-4). This structure overall results in each high-probability triplet being five times more likely to occur than the low-probability triplets.

Session 2: rewiring phase

During the Rewiring Phase, participants completed 25 blocks of Sequence B of the ASRT task, which involved structural modifications to the original sequence of the Learning Phase (Fig. 2A). Specifically, the last two pattern elements (P) of the original eight-element sequence were inverted. For example, if Sequence A was structured as 2-R-4-R-3-R-1-R, Sequence B became 2-R-4-R-1-R-3-R. This change in sequence structure affected the occurrence probability of certain triplets (Fig. 2B). Specifically, 75% of triplets classified as high-probability during the Learning Phase became low-probability in the Rewiring Phase (HL, with the first letter denoting the triplet probability in Sequence A and the second letter referring to the probability in Sequence B). Conversely, new high-probability triplets (LH) emerged during the Rewiring Phase, replacing those that became low-probability. Meanwhile, the probability of other triplets remained constant, either maintaining a low probability (LL) or a high probability (HH) in both phases.

The LH triplets were crucial for assessing the acquisition of new knowledge during the Rewiring Phase. As these triplets transitioned from low- to high-probability in the Rewiring Phase, any associated knowledge could be considered acquired during this session. The LL triplets, on the other hand, served as a baseline for controlling general practice effects. Thus, the reaction time difference between LH and LL triplets was used as a key measure of participants’ rewiring ability in the second session15.

Fig. 2
figure 2

Experimental design and sequence changes. (A) Experimental design. The experiment comprised four sessions: the first three were held 24 h apart, and the fourth was independent. Sessions one to three included Learning, Rewiring, and Retrieval phases. In the Learning Phase, participants completed 25 blocks of Sequence A of the Alternating Serial Reaction Time (ASRT) task to acquire initial probabilistic representations. During the Rewiring Phase, structural changes introduced Sequence B, and participants updated their representations over 25 ASRT blocks. The Retrieval Phase involved alternating between Sequences A and B every five blocks for 30 blocks, probing knowledge of both sequences. The starting order of the sequences in the Retrieval Phase was counterbalanced across participants (half started with Sequence A and the other half started with Sequence B). In the figure, blue, red, and half-colored rectangles represent a bin (five blocks) of the ASRT task. The fourth session included EF assessments with five tasks: Go/No-Go, Berg Card Sorting Test (BCST), Counting Span, Attention Network Test (ANT), and three Verbal Fluency tasks (action, semantic, and lexical fluency). (B) Example of structural changes in the ASRT task. The figure shows the stimulus order for Sequences A and B, coded as numbers, with predetermined stimuli in color alternating with random stimuli (R in black). The alternating structure resulted in high- and low-probability triplets. The Rewiring Phase modified Sequence A into Sequence B, altering triplet probabilities. Specifically, 75% of initially high-probability triplets became low-probability (HL trials, gray squares), replaced by new high-probability triplets that were initially low-probability in Sequence A (LH trials, gray ovals). Other triplets maintained their original probability, being either low-probability (LL trials, white ovals) or high-probability (HH trials, white squares) in both phases.

Session 3: retrieval phase

During the Retrieval Phase, participants completed 30 blocks of the ASRT task. Unbeknownst to them, they were tested on both Sequence A and Sequence B. To ensure balanced testing, half of the participants started with Sequence A followed by Sequence B, while the other half started with Sequence B followed by Sequence A. Participants completed five blocks of one sequence before switching to the other sequence, and this alternation continued until all participants completed 15 blocks of each sequence (i.e., 30 blocks in total). Knowledge of Sequence A was assessed by computing the difference between LL and HL triplets, whereas knowledge of Sequence B was evaluated by comparing LL and LH triplets15.

Measures of executive functions (Session 4)

The assessment of EF was guided by the Unity and Diversity model of EF, which encompasses inhibition, updating, and shifting processes47,48. Additionally, we examined executive control of attention, working memory, and verbal fluency, which are also prefrontal-dependent functions59,78,79,80,81,82. To operationalize these constructs, we employed a battery of well-established and reliable cognitive tests, which included the Attentional Network Test, the Berg Card Sorting Test, the Counting Span Task, the Go/No-Go Task, and three verbal fluency tasks. These tests were selected based on their well-documented validity and reliability in previous research17,83,84,85,86,87, ensuring robust measurement of the targeted cognitive functions.

Attentional network test (ANT)

The ANT assessed three distinct attention networks: the alerting, orienting, and executive networks88. Participants had to identify whether a central arrow, embedded in an array of five arrows, pointed left or right. The array could appear above or below a fixation cross and might be preceded by a spatial cue indicating the subsequent location. Additionally, in half of trials, the central arrow was congruent with the other arrows (pointing in the same direction), and in the other half it could be incongruent with the other arrows (pointing in the opposite direction).

To compute network scores, we adhered to the standard methodology88. The alerting component was derived by subtracting the mean reaction time (RT) of the central cue conditions from the mean RT of the no-cue conditions for each participant. The orienting component was computed by subtracting the mean RT of the spatial cue conditions from the mean RT of the central cue conditions. The executive component of attention was determined by subtracting the mean RT of all congruent conditions from the mean RT of all incongruent conditions. In this task, higher scores across the alerting, orienting, and executive domains indicated enhanced performance in the respective aspects of attention. In simpler terms, elevated scores suggest better attentional abilities in these three facets of attentional processing.

Berg card sorting task (BCST)

We assessed set shifting or cognitive flexibility using the computerized version of the Berg Card Sorting Test89 with 64 cards (BCST.64), available in the Psychology Experiment Building Language (PEBL) software90. In this test, a set of four cards with varying characteristics—color, shape, and number of items—was displayed at the top of the screen. Participants were instructed to match new cards to those at the top based on one of the three characteristics, without knowing the current rule but received feedback after each attempt. The rule could change during the task. Cognitive flexibility was measured through perseverative errors, which indicated failure to adapt quickly after a rule change. In essence, these errors provide insights into the participant’s ability to adjust their cognitive approach when the task requirements change.

Counting span (CSPAN) task

To assess updating or working memory capacity, we used the CSPAN task91. In this task, various shapes, including blue circles, blue squares, and yellow circles, were presented on a computer screen. Participants were instructed to audibly count and remember the number of blue circles (targets) among other shapes (distractors) in a series of images. Each image consisted of three to nine blue circles, one to nine blue squares, and one to five yellow circles. After each trial, participants verbally reported the total number of targets in the image. If the count was accurate, the experimenter proceeded to the next trial. A recall cue at the end of a set prompted participants to recall the number of targets for each image in their order of presentation. If the recall was correct, a new set with an additional image commenced, up to maximum six images in a set. If a participant made an error in recall, the task was halted, and a new run, starting with a set of two images, commenced. Each participant completed three runs of the task. Memory span was computed as the mean of the highest set sizes that the participant correctly recalled in the three runs. This measure provides an insight into the participant’s ability to efficiently update and maintain information in working memory.

Go/No-Go (GNG) task

We measured cognitive inhibition with the computerized version of the GNG task available in the PEBL software90. In the GNG task, participants were instructed to respond to certain stimuli (“go” stimuli) by clicking on a button as fast as possible and to refrain from clicking on other stimuli (“no-go” stimuli). In this version, participants were presented with a 2 × 2 array with four blue stars (one in the center of each square of the array). Every 1500 ms, a stimulus (the letter P or R) would appear for 500 ms in the place of one of the blue stars. For the first half of the task, the letter P would be the “go” stimulus and the letter R would be the “no-go”. This rule would be then inverted in the second half of the task. Participants completed 320 trials. The ratio between “go” and “no-go” trials was 80:20, respectively. Cognitive inhibition capacity in the GNG task was measured by the d’:

$$d\prime ={\text{ }}Z\left( {hit{\text{ }}rate} \right){\text{ }} - {\text{ }}Z\left( {false{\text{ }}alarm{\text{ }}rate} \right),$$

with higher d’ scores indicating better cognitive inhibition.

Verbal fluency tasks

Verbal fluency was assessed through three subtasks, examining the lexical, semantic, and action components of verbal fluency. In the lexical fluency subtask, participants were tasked with generating as many words as possible beginning with the letter “P,” a common practice in the French version of phonemic fluency92. For the semantic fluency subtask, participants were instructed to name animals, and for the action fluency subtask, they had to articulate isolated verbs describing actions feasible for a person to perform.

In each subtask, participants aimed to produce as many words as possible within one minute while adhering to specific guidelines: avoiding word repetitions, words with the same etymological root, and proper nouns. Any violation of these rules constituted an error. The score for each verbal fluency subtask was calculated by subtracting the number of errors from the total words generated within the one-minute timeframe. A higher score in each verbal fluency component denoted greater verbal fluency capacity, reflecting the participant’s ability to generate words efficiently while adhering to specified constraints.

Experimental design

Participants took part in a four-session experiment (Fig. 2A). The first three sessions were dedicated to assessing the acquisition, rewiring, and retrieval of implicit probabilistic information, each separated by a 24-hour interval. In the initial session (Learning Phase), participants’ SL abilities were evaluated using the ASRT task. Here, participants acquired associations related to Sequence A. In the second session (Rewiring Phase), a structural change was introduced to the task by incorporating Sequence B (see Tasks section for details). This modification allowed us to measure participants’ rewiring abilities. In the third session (Retrieval Phase), participants were tested on both sequences A and B in a counterbalanced order, providing insight into the consolidation of initial and rewired knowledge. Half of the participants started the Retrieval Phase with Sequence A and the other half with Sequence B. The fourth session was conducted independently from the first three sessions and was planned according to each participant’s availability. In this session, prefrontal-dependent cognitive functions were assessed with a battery of well-established and reliable neuropsychological tasks: the ANT, the BCST, the CSPAN task, the GNG task and three variants of the Verbal Fluency tasks (action, lexical and semantic fluency).

Statistical analysis

Evaluating someone’s ability to rewire their knowledge during the Rewiring Phase requires that the initial knowledge is acquired during the Learning Phase. Therefore, participants who did not exhibit initial SL during the Learning Phase (i.e., L-H < 0) were excluded. This refinement in participant selection aimed to enhance the relevance and meaningfulness of all subsequent analyses focusing on the relationship between rewiring and EF. Seventeen participants did not demonstrate SL during the Learning Phase. Consequently, our analyses were performed using the data of the remaining 59 participants.

Learning, rewiring and retrieval trajectories

For the Learning, Rewiring, and Retrieval Phases, the first five trials (comprising five warm-up random trials) of each block were excluded from the analysis, along with all incorrect responses throughout all trials. Additionally, trials featuring trills (e.g., 2-1-2) or repetitions (e.g., 2-2-2) were omitted from the analysis because participants tend to respond more rapidly to these types of triplets due to pre-existing tendencies94. To facilitate the analysis and improve the signal-to-noise ratio, the blocks of the ASRT task were organized into units of five blocks, referred to as bins.

In the first session (Learning Phase), learning performance on Sequence A was assessed by calculating the median RTs within each bin for correct responses in high-probability triplets specific to Sequence A (HL) and low-probability triplets common to all sessions (LL), which served as a baseline to control for general practice effects. Subsequently, learning scores were computed for each bin by subtracting the median RTs of HL triplets from the median RTs of LL triplets. A higher difference score between HL and LL triplets indicated better initial learning of probabilities. The learning scores for each bin were then averaged to generate a single score reflecting statistical learning (SL) for each participant in the Learning Phase.

In the second session (Rewiring Phase), rewiring performance on Sequence B was evaluated by calculating the median RTs for correct responses in high-probability triplets specific to Sequence B (LH) and LL triplets. Data processing followed a similar approach to the first session, with the unique distinction being that rewiring scores resulted from the difference between LH and LL triplets. A higher difference score between LH and LL triplets indicated better rewiring performance.

During the third session (Retrieval Phase), participants alternated between bins of sequences A and B in a counterbalanced order. Data for assessing the retrieval of Sequence A was processed similarly to the Learning Phase, while data for Sequence B retrieval followed the same procedure as in the Rewiring Phase.

To evaluate SL in the Learning and Rewiring Phases separately, we conducted a repeated measures analyses of variance (RM ANOVAs) with two factors for each phase: bin (1–5) and triplet (HL vs. LL for the Learning Phase; LH vs. LL for the Rewiring Phase). To compare the retrieval of Sequences A and B in the third session, we conducted an RM ANOVA with two factors: bin (1–3; corresponding to the three bins presented for each sequence in the Retriaval Phase), and sequence (A vs. B). In all ANOVAs, Greenhouse-Geisser epsilon correction was used when Mauchly’s test of sphericity indicated that sphericity could not be assumed. Original df values and p values are reported together with partial eta-squared (ηp2) as the measure of effect size.

To complement these analyses, we conducted Bayesian RM ANOVAs using JASP93. Bayesian analysis allows us to provide evidence in favor of the null hypothesis, rather than failing to reject it. The inclusion Bayes factor quantifies the change from prior inclusion odds to posterior inclusion odds and can be interpreted as evidence in the data for including a predictor or factor. More generally, Bayes factors quantify the relative weight of evidence provided by the data for two theories, the null and the alternative hypotheses, H0 and H195. According to a commonly used classification96, BF values between 1 and 3 indicate anecdotal evidence, values between 3 and 10 indicate substantial evidence, and values above 10 indicate strong evidence for H1. Conversely, values between 1/3 and 1 indicate anecdotal evidence, values between 1/10 and 1/3 indicate substantial evidence, and values below 1/10 indicate strong evidence for H0. Values near 1 indicate that the data do not favor either hypothesis.

Correlations

For our correlation and multiple regression analyses, we derived single indices of learning, rewiring, and retrieval of sequences A and B by averaging scores across all bins for each participant. Prior research has indicated that while the reliability of individual bins is typically low, averaging across at least five bins results in acceptable levels of reliability72. Given the importance of using reliable measures for assessing inter-individual differences, our analysis of the relationship between rewiring and EF utilized these average scores. Our reported Pearson’s correlations between learning, rewiring, retrieval and EF scores include Benjamin-Hochberg adjusted p-values to account for multiple comparisons. Additionally, we calculated Bayes factors using JASP93 with default priors (stretched beta distribution with a width of 1) to further validate our findings.

Multiple regression analysis

To assess the relationship between EF and the different phases of the experiment (i.e., learning, rewiring, and retrieval), we conducted multiple linear regression analyses. The predictor variables included the scores of the executive component of the ANT, BCST, CSPAN task, GNG task, and verbal fluency tasks. Note that the alerting and orienting components of the ANT were not included in the models as these do not reflect EF88. The regression analyses were applied systematically to the outcome variables for each experimental phase.

For the Learning Phase, the EF tests were used as predictor variables to assess their contribution to the variance in the initial learning performance on (Sequence A). For the Rewiring Phase, we explored the association between EF test scores and rewiring performance (on Sequence B). For the Retrieval Phase, two separate multiple regression analyses were conducted to assess the relationship between EF and the retrieval of Sequence A and Sequence B. Multicollinearity among predictor variables was assessed using variance inflation factor (VIF) in each model. All VIF values were below the commonly accepted threshold of 5, with the highest value being 1.804 for action fluency, indicating that multicollinearity is not a concern in our models.

Results

Learning, rewiring and retrieval trajectories

Learning phase

To evaluate SL in the learning phase, we conducted a repeated measures analysis of variance (RM ANOVA) with two factors: bin (1–5) and triplet (HL vs. LL). The main effect of bin was significant (F(4, 232) = 13.056, p <.001, ηp2 = 0.184, BFinclusion = 3.527e + 9), indicating general skill learning, i.e., participants became faster overall as the task progressed, irrespective of triplets (bin 1 significantly slower than all further bins all p-values < 0.001, all other pairwise comparisons p >.248). The main effect of triplet was significant (F(1, 58) = 12.395, p <.001, ηp2 = 0.176, BFinclusion = 14816.224), with average responses to HL triplets being faster than responses to LL triplets (MHL = 358 ms, 95% CI = [352, 364]; MLL = 362 ms, 95% CI = [356, 368]; d = 0.157), revealing successful SL during the Learning Phase (Fig. 3A). The bin by triplet interaction was non-significant (F(4, 232) = 1.644, p =.164, ηp2 = 0.028, BFinclusion = 0.121) indicating that SL did not change substantially across the bins.

Fig. 3
figure 3

Acquisition and retrieval of implicit knowledge during the different phases of the experiment. (A) Leaning Phase. Acquisition of the initial sequence during the Learning Phase. The learning scores for each bin denote the difference of RTs between HL and LL triplets. Participants acquired the probabilistic regularities of Sequence A, responding faster to HL triplets compared to LL triplets. (B) Rewiring Phase. Acquisition of a new sequence during the Rewiring Phase. The rewiring scores for each bin denote the difference of RTs between LL and LH triplets. Participants acquired the probabilistic regularities of Sequence B, responding faster to LH triplets compared to LL triplets. (C) Retrieval Phase. Retrieval of both sequences during the Retrieval Phase. The retrieval scores denote the differences between HL and LL triplets for Sequence A (in blue), and LH and LL triplets for Sequence B (in red). Sequence A and Sequence B knowledge was retrieved equivalently and did not vary across the bins. Error bars in all figures denote the standard error of the mean.

Rewiring phase

To evaluate SL in the Rewiring Phase, we conducted an RM ANOVA with two factors: bin (1–5) and triplet (LH vs. LL). The main effect of bin was non-significant (F(4, 232) = 0.513, p =.726, ηp2 = 0.009, BFinclusion = 0.006), indicating that, irrespective of triplets, the RTs did not progress in the second session. Importantly, the main effect of triplet was significant (F(1, 58) = 5.596, p =.021, ηp2 = 0.088, BFinclusion = 12.602), with average responses to LH triplets being faster than responses to LL triplets (MLH = 347 ms, 95% CI = [341, 353]; MLL = 349 ms, 95% CI = [344, 355]; d = 0.098), indicating updating of the initial knowledge during the second session (Fig. 3B). The bin by triplet interaction was non-significant (F(4, 232) = 1.797, p =.130, ηp2 = 0.030, BFinclusion = 0.001), indicating that rewiring performance did not vary substantially across the bins.

Retrieval phase

To evaluate the retrieval of sequences A and B in the third session, we conducted an RM ANOVA, this time on the learning scores, with two factors: bin (1–3) and sequence (A vs. B). Learning scores for both sequences were significantly different from 0 in almost all bins (ps < .043), except the Sequence A learning score for bin 3, which did not significantly differ from 0 (p =.190). The main effect of bin was non-significant (F(2, 116) = 2.081, p =.129, ηp2 = 0.035, BFinclusion = 0.168) indicating that learning scores remained relatively constant during the retrieval phase. The main effect of sequence was non-significant (F(1, 58) = 3.220, p =.078, ηp2 = 0.053, BFinclusion = 0.421) indicating that both sequences were retrieved equally during the third session (Fig. 3C), although there is a trend towards smaller Sequence A learning scores (MSeqA = 3.23 ms, 95% CI = [1.02, 5.45]; MSeqB = 4.89 ms, 95% CI = [2.63, 7.15]; d = 0.152). The bin by sequence interaction was non-significant (F(2, 116) = 0.300, p =.742, ηp2 = 0.005, BFinclusion = 0.022) indicating that the retrieval of sequences A and B did not vary across the bins.

Correlations

Bivariate Pearson’s correlations with Benjamin-Hochberg adjusted p-values between all variables are presented in Fig. 4. Overall, most EF measures showed weak or non-significant correlations with learning, rewiring, and retrieval scores for sequences A and B. The strongest correlations were found between the rewiring and GNG scores (Pearson’s r =.439, 95% CI = [0.206, 0.625], p = .003, BF10 = 59.484) and between the rewiring and Semantic fluency scores (Pearson’s r = −.378, 95% CI = [−0.578, − 0.135], p = .015, BF10 = 11.450).

Fig. 4
figure 4

Correlations Between Learning, Rewiring, Retrieval Scores, and EF Measures. Negative and positive correlations are visually represented by red and blue backgrounds, respectively. A significant positive correlation was found between cognitive inhibition (measured by GNG) and rewiring performance, while a significant negative correlation was found between semantic fluency and rewiring. Benjamin-Hochberg corrected p-values are included in the matrix. The Learning score reflects knowledge acquired in Session 1, while the Rewiring score reflects the updating of knowledge (i.e., rewiring performance) in Session 2. SeqA score indicates the retrieval of initial knowledge in Session 3, and SeqB score reflects the retrieval of updated knowledge in Session 3. ANT alerting, orienting, and executive represent the three aspects of the Attentional Network Test. BCST pers score denotes cognitive flexibility performance in the Berg Card Sorting Test. CSPAN indicates working memory performance in the Counting Span task. Go/No-Go reflects inhibition performance in the Go/No-Go task. Lastly, Act, Sem, and Lex fluency scores represent performances in the Action Fluency, Semantic Fluency, and Lexical Fluency tasks, respectively.

Multiple regression analysis

Rewiring phase

To test whether EF scores predict the rewiring of implicit probabilistic information, a multiple regression analysis was conducted with the rewiring score as the dependent variable. Predictor variables included the executive component of ANT, BCST, CSPAN, GNG, and verbal fluency tasks (action, semantic, and lexical). Together, these predictors explained 22.6% of the variance in rewiring performance (Adjusted R2 = 0.226, F(7, 51) = 3.418, p = .005). Notably, the GNG score emerged as a strong predictor of rewiring (B = 5.881, 95% CI = [1.458, 10.304], p =.010, β = 0.341), suggesting that better inhibition, as measured by GNG performance, was associated with more effective rewiring of implicit probabilistic representations (Fig. 5A). In contrast, the semantic fluency score negatively predicted rewiring performance (B = −0.455, 95% CI = [−0.838, −0.073], p =.021, β = − 0.325), indicating that better semantic fluency performance was inversely related to rewiring (Fig. 5B). Other EF scores, including ANT executive, BCST, CSPAN, lexical fluency, and action fluency, did not significantly contribute to the model (Supplementary Table 2).

Fig. 5
figure 5

Relationship between the Rewiring score and the GNG d-prime and Semantic fluency scores. Each data point represents a participant. The black line represents the best-fit regression line, and the gray shading indicates the 95% confidence interval.

Learning and retrieval phases

In line with the correlation results reported above, multiple regression analyses for learning in Session 1 and retrieval of Sequences A and B in Session 3 revealed no significant predictors. Details of these multiple regression results are presented in Supplementary Tables 3, 4, and 5, respectively.

Discussion

When the environment changes, our old habits and predictions no longer function optimally. Our study aimed to explore how individual differences in EF affect the updating of our predictive models, leading to changes in our habits. Our findings revealed distinct correlations between EF and the rewiring of implicit probabilistic representations. Specifically, we observed a positive relationship between inhibition, measured by the Go/No-Go task, and rewiring. We also observed a negative relationship between semantic fluency and rewiring. These correlations were further confirmed by multiple regression analysis, which showed that while Go/No-Go scores positively predicted rewiring performance, semantic fluency scores negatively predicted it. This suggests that individuals with stronger inhibitory control and/or lower semantic fluency tend to exhibit a better capacity to update their probabilistic representations during rewiring.

The positive relationship between rewiring and the Go/No-Go task suggests that the ability to inhibit irrelevant or outdated information and behaviors may facilitate the adaptation to new probabilistic contingencies in the presence of pre-existing knowledge. This positive correlation resonates with existing literature on EF and habit change, where inhibitory control has been shown to play a crucial role in disrupting and modifying unhealthy habits and ingrained behaviors97,98,99,100 Cognitive inhibition may facilitate habit change by shielding the individual from maladaptive memory representations101 and suppressing automatic motor responses tied to old habits102. This may be particularly relevant in our task, where successful rewiring requires suppressing automatic motor responses tied to the old sequence. It is worth noting that we focused on response inhibition (Go/No-Go task) rather than interference control (e.g., Stroop task). While response inhibition appears critical for overriding learned motor actions, overcoming proactive interference from prior cognitive representations is also a key challenge in rewiring. Future research employing tasks that measure both facets of inhibition could help disentangle their respective contributions to the updating of implicit knowledge.

Intriguingly, this finding contrasts with previous research which reported that inhibitory control was detrimental to the rewiring of probabilistic representations15. Notably, efforts to understand the relationship between SL and EF have led to various experimental approaches, generally categorized into four main types. The first involves behavioral studies adopting an interindividual differences approach, measuring SL and EF with separate tasks and investigating the correlations between their performance indices. These studies have found both positive and negative associations, suggesting potential cooperation, competition, and instances of independence65,67,68. The second category includes interventional studies that deplete, disrupt, or suppress EF to observe how this affects SL, generally finding a competitive relationship, with weakened EF leading to better SL performance61,62,63. The third category focuses on brain network studies, examining the relationship between SL and the disengagement of brain networks supporting EF, and finding that such disengagement improves SL60,65,66. The fourth category employs combined paradigms that integrate features of both SL and EF in a single experimental design, providing direct insight into their real-time interactions. Studies following this approach have shown evidence for both competition and independence between SL and EF103,104.

While most of these studies focus on the initial acquisition of probabilistic knowledge, their methodological approaches can also be extended to study the relationship between rewiring and EF. Our study follows the first approach. In another study, participants actively inhibited motor responses associated with the knowledge acquired in the Learning Phase, whereas in our study, we measured interindividual differences in inhibitory control capacity15. This methodological difference may explain the discrepancy in results. It is possible that, in Horvath et al., individuals’ cognitive inhibitory resources were allocated to refraining from specific motor responses rather than inhibiting implicit probabilistic representations acquired in the Learning Phase. These results can be interpreted within a dual-task framework, which suggests that if a secondary task impairs performance, the cognitive function engaged in the secondary task is crucial for achieving the good performance in the primary task105. Thus the weaker rewiring scores observed in the previous study for inhibited (no-go) trials compared to go trials may indicate that inhibitory capacity is essential for the rewiring process itself15.

Conversely to the positive relationship between rewiring and inhibition observed in our study, we found a negative correlation between rewiring and semantic fluency. This result suggests that individuals with higher semantic fluency may be less proficient at updating their implicit probabilistic representations in response to environmental changes. While this result might seem unexpected within the broader context of EF and habit change research52, it is consistent with the competitive learning and memory systems framework41,44,60,68,70. Given that verbal fluency tasks, especially semantic fluency, involves the controlled access to long term memory representations106,107,108, a possible interpretation for our result could be that individuals with greater verbal fluency rely more heavily on pre-existing cognitive mnesic models, which could impede their ability to integrate novel probabilistic information68. This negative relationship aligns with the competition hypothesis of dual-process neuro-computational models where a greater reliance on model-based cognitive functioning is associated with a smaller reliance on model-free processes39,42,44,109. In this context, proficiency in semantic fluency could reflect more efficient access to long-term memory, suggesting a heavier reliance on model-based processes. This reliance may hinder the acquisition of new probabilistic information, which may be dependent on the model-free system. Similar results were also reported, suggesting that the ability to acquire new probabilistic representations might be related to a reduced reliance on model-based functioning, as reflected by poorer performance on verbal fluency tasks68.

It is important to note, however, that while verbal fluency tasks are generally considered good indicators of EF110,111, their precise role within the Unity and Diversity model of EF47 is still debated. Studies examining verbal fluency tasks in the context of Miyake’s model have suggested that these tasks may reflect cognitive flexibility or set shifting110,112. Based on this, it could be hypothesized that the negative correlation between rewiring and verbal fluency found in our study reflects competition between updating probabilistic representations and cognitive flexibility. Nonetheless, this interpretation seems unlikely given our data, as no relationship was found between rewiring and BCST, a well-established set-shifting task86. Additionally, the lack of correlation between BCST and verbal fluency tasks in our analysis further suggests that verbal fluency tasks are not purely measures of cognitive flexibility. Semantic, lexical, and action fluency tasks likely involve different cognitive processes and may differ to what extent they reflect EF. For example, while cognitive flexibility has been shown to be the primary factor in the lexical fluency task, the semantic fluency task may also involve other components such as categorical production112, which likely reflects long-term memory access113. These distinctions support our interpretation that semantic fluency reflects model-based functioning, particularly within the context of the competition hypothesis of dual-process neuro-computational processes.

Verbal fluency tasks have also been argued to reflect working memory updating114,115,116. In the context of our results, this could imply that working memory updating proficiency might counteract the rewiring of probabilistic representations. However, this interpretation seems unlikely given our data, as we observed no relationship between the Counting Span task, a valid and reliable working memory task117, and rewiring or verbal fluency. In our previous study68, using exploratory factor analysis we have identified a common factor underlying both the Counting Span and verbal fluency tasks, which negatively correlated with SL. We have suggested that this factor could reflect either updating or access to long-term memory models. Based on our current results, the latter explanation seems more plausible. This interpretation aligns with results that suggest that the updating of implicit probabilistic representations depends exclusively on model-free processes37. It is also important to note that the updating referenced in the context of EF models differs from the updating involved in the probabilistic learning and neuro-computational modeling framework. The former involves the conscious maintenance and manipulation of information in working memory48,118, whereas the latter involves the implicit modification of informational priors119,120. This is consistent with our interpretation that not the updating in working memory, but the access to long-term memory affects rewiring performance.

Our correlational analyses results were further corroborated through multiple regression analysis, which revealed that inhibition, measured by the Go/No-Go task, positively predicted rewiring performance, while semantic fluency negatively predicted rewiring it. This statistical approach strengthened the validity of our results and provided additional support for the observed relationships between executive functions and the rewiring of probabilistic representations121. Our findings highlight the complex cognitive dynamics involved in the rewiring of implicit probabilistic representations. Successful rewiring appears to require a balance between inhibiting outdated cognitive schemas and acquiring new probabilistic information. Different cognitive functions, such as inhibition and verbal fluency, may contribute to these processes in distinct ways. Inhibition may facilitate the suppression of motor responses and memory representations related to outdated knowledge, facilitating the integration of new information101,102. Conversely, verbal fluency may reflect a greater reliance on model-based functioning, which could hinder the flexibility needed to acquire novel probabilistic representations44,68,107,108.

These findings carry significant theoretical implications for understanding the cognitive mechanisms involved in acquiring probabilistic information across various contexts. In the literature, there has been ongoing discourse regarding whether the acquisition of new information, whether under conditions of proactive interference or not, relies on similar or distinct neural and cognitive mechanisms15,122,123. The negative correlation we observed between semantic fluency and rewiring performance echoes findings from studies investigating the link between EF and learning67,68. This suggests a possible convergence in the neurocognitive mechanisms that support both learning and rewiring processes. Nonetheless, this hypothesis warrants further investigation, particularly through studies exploring the cellular and neural architectural dynamics underlying learning and rewiring of probabilistic information.

Our findings also highlight the potential for cognitive control interventions, particularly those targeting inhibitory control, to enhance the rewiring process. Research in fields such as food intake and addiction has shown that cognitive control training can effectively modify habitual behaviors52,97,98. Bolstering inhibitory control could enable individuals to better suppress outdated cognitive models and more easily integrate new information, facilitating adaptive learning and habit updating. Nonetheless, it is important to acknowledge the limitations of this study. For instance, the Rewiring Phase and executive function measurements were conducted on separate days. Intraindividual differences in the reliance on model-based or model-free functioning may be influenced by factors such as emotional state124,125 or hormonal fluctuations during the menstrual cycle126,127. These factors could contribute to variability in our findings. While other approaches such as Canonical Correlation Analysis could reveal powerful associations between our multivariate executive function and ASRT measures, offering deeper insights into how these distinct cognitive processes interrelate, our current sample size is too small for reliable results with this method. Furthermore, future studies could explore alternative metrics of learning beyond average performance, such as the rate of acquisition (i.e., the learning slope) or asymptotic performance levels, which may capture different aspects of learning dynamics and relate differently to executive functions. Future studies should address these limitations to improve the robustness and generalizability of the results. Additionally, incorporating neuroimaging techniques such as fMRI or EEG could elucidate the neural mechanisms underlying the interplay between these cognitive processes.

A further limitation of our study is the relatively small sample size (N = 59) used for the main analyses, which resulted from the necessary exclusion of participants who did not show evidence of initial learning. This sample size may limit the statistical power of our analyses and the stability of the correlation estimates. For instance, we found a specific negative relationship between rewiring and semantic fluency, but not lexical or action fluency, even though these tasks were inter-correlated. While we have offered a theoretical interpretation for this specificity, it is possible that this pattern is an artifact of our sample size. Therefore, our findings, particularly the nuanced relationships with specific EF measures, should be interpreted with caution and warrant replication in larger, more powered studies to ensure their robustness and generalizability.

To our knowledge, this study is the first to explore the relationship between the rewiring of implicit probabilistic information and a wide range of prefrontal-dependent executive functions. Our findings suggests a nuanced interaction between these cognitive processes, as revealed through the study of individual differences: while rewiring showed a positive relationship with cognitive inhibition, it was negatively associated with verbal fluency. This indicates that the rewiring of implicit probabilistic representations is a multifaceted cognitive process intricately intertwined with executive functions. It appears to necessitate both overcoming proactive interference from prior knowledge, facilitated by cognitive inhibition, and a reliance on model-free functioning to integrate novel probabilistic information, as evidenced by the negative relationship with semantic fluency. By shedding light on the relationship between executive functions and rewiring, this study provides a foundational understanding of the cognitive dynamics underlying habit change.