Learning to operate an imagined speech Brain-Computer Interface involves the spatial and frequency tuning of neural activity

Bhadra, Kinkini; Giraud, Anne-Lise; Marchesotti, Silvia

doi:10.1038/s42003-025-07464-7

Download PDF

Article
Open access
Published: 20 February 2025

Learning to operate an imagined speech Brain-Computer Interface involves the spatial and frequency tuning of neural activity

Communications Biology volume 8, Article number: 271 (2025) Cite this article

9264 Accesses
9 Citations
23 Altmetric
Metrics details

Subjects

Abstract

Brain-Computer Interfaces (BCI) will revolutionize the way people with severe impairment of speech production can communicate. While current efforts focus on training classifiers on vast amounts of neurophysiological signals to decode imagined speech, much less attention has been given to users’ ability to adapt their neural activity to improve BCI-control. To address whether BCI-control improves with training and characterize the underlying neural dynamics, we trained 15 healthy participants to operate a binary BCI system based on electroencephalography (EEG) signals through syllable imagery for five consecutive days. Despite considerable interindividual variability in performance and learning, a significant improvement in BCI-control was globally observed. Using a control experiment, we show that a continuous feedback about the decoded activity is necessary for learning to occur. Performance improvement was associated with a broad EEG power increase in frontal theta activity and focal enhancement in temporal low-gamma activity, showing that learning to operate an imagined-speech BCI involves dynamic changes in neural features at different spectral scales. These findings demonstrate that combining machine and human learning is a successful strategy to enhance BCI controllability.

Chisco: An EEG-based BCI dataset for decoding of imagined speech

Article Open access 21 November 2024

EEG-based BCI Dataset of Semantic Concepts for Imagination and Perception Tasks

Article Open access 15 June 2023

A large EEG database with users’ profile information for motor imagery brain-computer interface research

Article Open access 05 September 2023

Introduction

Neurological disorders of language such as aphasia, amyotrophic lateral sclerosis, and locked-in syndrome can disrupt natural speech dramatically impacting the quality of life for both patients and caregivers^1,2. A promising approach to restore language communication is to decode imagined speech directly from neurophysiological signals and translate them into text, synthesized speech or even moving avatars through a brain-computer interface (BCI). This approach has raised two important challenges: how the machine can decode neural signals, and how the patient can optimize its interaction with the decoder. For the latter, providing a feedback to the user in real-time is crucial.

Recent years have seen great advances in the field of speech-BCIs, most often through the decoding of attempted speech from intracranial electrophysiological recordings^{3,4,5,6,7,8,9}, which have led to impressive decoding speeds reaching about 78 words per minute⁴. Such an approach, however, is unlikely to be suitable for disorders of language where speech production areas are damaged, such as in post-stroke expressive aphasia. A more appropriate BCI for these disorders would require decoding imagined, rather than attempted speech. Also termed covert or inner speech, imagined speech consists of the internal production of speech without self-generated audible output^10,11, thus without the involvement of the musculoskeletal system. Depending on the brain damage location, different imagined speech strategies can be considered, from kinesthetic to abstract phonological ones. Although previous studies have characterized the neural correlates of imagined speech¹², mostly in comparison with overt speech^{13,14,15,16,17,18}, only a handful of BCI studies have attempted to decode imagined speech in real-time, with promising but often limited effectiveness^19,20,21,22. This is due to different challenges and limitations primarily pertaining to the weakness of imagined speech signals as compared to overt speech^{13,14,16,17,23}, the difficulty in precisely identifying the onset of speech imagery²³, inter-individual differences in the ability to control the BCI^24,25, and the technique employed to record brain activity.

The state-of-the-art approach to experimentally address imagined speech decoding is to exploit intracranial recordings such as electrocorticography (ECoG) and stereotactic EEG (sEEG), which allow neural sampling from key language regions with higher spatial resolution and to use higher-frequency neural activity than with surface recordings. Exploiting these experimental advantages, imagined speech decoding for BCI-control has been first attempted using ECoG to decode imagined phoneme pronunciation versus rest from the perisylvian area¹⁹. Although decoding accuracy was highly above chance, this study did not provide evidence that the method could be used to discriminate between two imagined speech units in real-time. More than a decade later, sEEG was used to synthesize imagined speech in real-time into continuous acoustic feedback from high-gamma activity in the frontal cortex and motor areas²¹. Although the reconstructed speech was unintelligible and less accurate for imagined than overt speech, this study was the first proof of concept that imagined speech could be used for naturalistic communication with a speech neuroprosthesis. More recently, impressive real-time control (up to 91%) was achieved by decoding eight imagined words from single-neuron activity in the supramarginal gyrus²², highlighting the superior effectiveness of decoding speech from individual neurons. Yet, using intracortical recordings for speech-BCIs remains a clinical and ethical challenge, owing to the high risk of clinical complications (loss of contacts, infection²⁶) potentially requiring explantation and the loss of the new communication means, a dramatic outcome for the patient²⁷. Much research and clinical efforts are still required to optimize the success of future speech-BCIs.

Capitalizing on its far greater ease of use, several studies have employed surface EEG for decoding offline (i.e., open-loop) a wide variety of imagined speech units such as phonemes, syllables, and words, most often in binary classification paradigms^10,28 (see for reviews^10,29,30). However, nearly all studies address imagined speech decoding from an engineering perspective, their main goal being the optimization of current classifiers to boost decoding accuracy (see³¹ for a review of classification methods). Despite the great amount of data and the possibility of applying computationally demanding decoders, offline classification accuracy from pre-recorded datasets remains below 80% when discriminating between two imagined speech units and around 60% for a three-class problem^{28,32,33,34,35}.

In the single BCI study that used EEG for online (i.e., closed-loop, real-time) speech imagery decoding²⁰, performance remained below 70% in discriminating between “yes” and “no”. Interestingly, however, this study pointed out important inter-individual differences in BCI-control, with accuracies varying between 53.75% and 95%²⁰. The variability in control abilities is well known in motor-imagery EEG-BCIs, in which up to 50% of participants are unable to achieve above chance BCI-control^24,36. Given the lack of speech-imagery EEG-BCI studies and the fact that invasive-BCI studies are mostly single-case, it remains to be assessed whether speech-BCI skills can improve with training. In the present study, we addressed speech-BCI controllability from a neurophysiological rather than neuroengineering perspective. We investigated whether BCI-control performance can be trained, and identified the neural and behavioral mechanisms underpinning the acquisition of these new skills. We designed a closed-loop BCI system based on EEG signals to decode in real-time the imagery of two syllables /fɔ/ and /gi/, chosen for their contrasted phonetic features, and trained 15 healthy participants to control the BCI for 5 consecutive days. We addressed the importance of feedback accuracy in BCI control learning by comparing these data with those obtained in a group of 10 healthy participants who trained with a discontinuous real-time feedback. This study thus targets both the variability and the dynamic range that can be achieved via training a whole brain EEG speech-imagery BCI.

Materials and methods

Participants

Fifteen healthy participants (5 females, mean age 23.9 years, SD ± 2.3, range 19–29) took part in this study which was approved by the local Ethics Committee (Commission Cantonale d’Éthique de la Recherche, project 2022-00451) and was performed in accordance with the Declaration of Helsinki. All ethical regulations relevant to human research participants were followed. All participants provided written informed consent and received financial compensation for their participation. All participants were right-handed.

Experimental paradigm and syllables imagery

Participants took part in the study daily for 5 consecutive days, at the same time of the day. To avoid a potential effect of circadian fluctuations on the participants’ performance, each of them began the training at the same time each day. Each session lasted approximately 2.5 h, amounting to a total duration of 12–13 h of experimental time per participant. The experiment took place in an optically, acoustically, and electrically shielded room.

On each day, participants performed a mental chronometry task followed by a BCI-control session, both involving the imagery of two syllables /fɔ/ or /gi/, chosen for their contrasted phonetic features regarding consonant manner (fricative vs plosive), place of articulation (labiodental vs velar), vowel place (mid back vs high front) and rounding (rounded vs unrounded). As different neural responses are associated with these distinct phonetic features^17,37,38,39, we expected to maximize the discriminability between the EEG signals associated with the imagery of each syllable. Participants were asked to focus on the kinesthetic sensation they would experience if they pronounced the syllable aloud. As the long-term goal of speech-BCI is to provide a means of communication for individuals who have lost the ability to speak, and consistent with the latest works of speech-BCI^4,52^,, participants were instructed to focus on how they would articulate speech rather than how speech would sound or look like in writing. Using imagined articulating speech, we expected to obtain a consistent neural response across the entire group and thus get reliable EEG analyses. Another technical advantage of exploring first this strategy is that kinesthetic imagery recruits more superficial brain areas than imagined speech perception thus more accessible with surface EEG⁴⁰.

At the end of the last and 5^th day of training, participants were asked to report the strategy they used during the BCI-control session (see Supplementary Table 1 for individual reports).

Mental chronometry

The mental chronometry test is a well-known experimental approach to empirically evaluate motor imagery skills (see for instance⁴¹) that has previously been used to evaluate motor imagery abilities for BCI-control²⁵. According to this previous literature, the temporal congruency between the time required to perform the motor imagery and its actual execution indicates good imagery abilities (and vice-versa). Here we applied this methodology for the first time to the speech domain to probe a possible relationship between interindividual variability in speech imagery timing (acutely and across training days) and BCI performance, and to gain insights into the pace used to perform the imagery.

To do so, we asked our participants to either repeat aloud (i.e., overt) or imagine pronouncing (i.e., covert) five times one of two syllables used for the BCI control. Participants were instructed to verbally report the moment in which they began and completed the task by saying respectively “start” and “stop”. The time between these two verbal indications was measured with a chronometer by the experimenter. There were a total of four experimental conditions with modality (speak/imagine) and syllable (/fɔ/ or /gi/) as factors, each of which was repeated 10 times. Participants were instructed to keep a constant rhythm for repeating the syllables throughout the trials.

EEG acquisition and BCI loop

EEG recording

Neural data were recorded using a 64-channel ANT Neuro system (eego mylab, ANT Neuro, Hengelo, Netherlands) at a sampling rate of 512 Hz using electrode AFz as ground and CPz as reference. Channels’ impedance was kept below 20 kΩ throughout the experiment. Electromyography signals (EMG) were recorded from the right side of the participant’s face to measure potential articulatory muscles’ activation despite our explicit instructions to avoid any movement⁴². The zygomaticus major and the orbicularis oris were targeted as these are most prominently involved in the place of articulation of the two syllables (respectively for /gi/ and /fɔ/⁴²) and the right side was determined by the participants’ handedness (all right-handed), typically matching the dominant side of the face, and therefore tends to exhibit more pronounced movements during speech production^43,44. EEG and EMG data were acquired using Lab Streaming Layer (LSL, https://github.com/sccn/labstreaminglayer).

During the EEG recording and while operating the BCI, participants sat comfortably on a chair in front of a computer screen while keeping their hands on their thighs and were instructed to avoid any physical movement.

BCI loop

The EEG-BCI loop was developed using an adapted version of the framework Neurodecode (Fondation Campus Biotech Geneva, https://github.com/fcbg-hnp/NeuroDecode), already used in previous BCI studies^45,46,47. On each training day, the BCI-control included two sessions, an offline session in which data were recorded for the classifier’s calibration and an online part where participants controlled a visual feedback in real-time. Therefore, the classifier was different on each experimental day. Participants were asked to use the same imagery strategy throughout the entire duration of the BCI training (see Syllable imagery during BCI-training section).

Syllable imagery during BCI-training

Participants were instructed to imagine repeating each of the two syllables (such as “/fɔ/-/fɔ/-/fɔ/…” or “/gi/-/gi/-/gi/-…” using the cognitive strategy described in the above section “Experimental paradigm and syllables imagery”) while keeping a constant pace. Unlike the mental chronometry test, they were not instructed to imagine a specific number of syllable repetitions, and the experimenter made no reference to this previous cognitive task. Participants were explicitly told to avoid any movements during imagery, especially those involving the face, and not to mouth nor whisper. They were informed that the muscle activity of the face was being monitored with EMG electrodes throughout the entire experiment to control for such movements.

Offline session and classifier calibration

EEG data were acquired while participants performed syllable imagery without receiving any real-time feedback (offline runs) and subsequently used to calibrate the classifier. Each offline trial began with a text indicating the trial number (1 s), followed by a fixation cross (2 s), and a written cue indicating which of the two syllables participants had to imagine pronouncing (2 s). After the cue disappeared, an empty battery appeared on the screen, which then progressively filled for 5 s (Fig. 1a). Participants were instructed to start imagining pronouncing the syllable immediately after the battery appeared on the screen and stop when the battery was filled. They were explicitly told that the battery filling was independent of their brain signals. At the end of the 5 s imagery period, the battery was displayed as it appeared at the last filling level, for 2 additional seconds, with the tip of the battery turning yellow to indicate the participant to stop imagining. Participants had 5 s to rest while the instruction ‘Rest’ was displayed on the screen. There were a total of 40 trials per syllable, arranged in 4 blocks each consisting of 10 trials per syllable, with short breaks in between blocks. The offline session lasted approximately 25–30 min.

**Fig. 1: Experimental paradigm, *online* BCI performance, and decoding accuracy.**

Offline data was then used to calibrate the decoder. Features were extracted by computing the power spectral density (PSD) of the EEG signal from 1 to 70 Hz (with a 2 Hz resolution) using a sliding window of 500 ms and 20 ms overlap. The PSD was calculated for each EEG channel excluding the three electrodes placed over the mastoid region bilaterally and the reference channel, leading to 61 channels. Therefore, there were a total of 2135 features (61 channels and 35 frequencies), each of which consisted of a channel-frequency pair. These features were fed to a random forest (RF) algorithm to extract the classifier parameters (i.e., the covariance matrix). This nonlinear classifier has already been proven effective in previous two-class BCI studies⁴⁸ and is known to be robust to overfitting⁴⁹. The RF classifier assigns a weight (expressed in percentage) to each feature, indicating its relative contribution to the classification. An 8-fold cross-validation (CV) was performed to test the model validity and calculate the offline CV accuracy.

Online BCI-control

During the online part of the experiment, the RF classifier was applied in real-time to the EEG data to decode which of the two syllables the participant was imagining, and accordingly, provided a continuous real-time feedback to the user to inform them about their neural performance. Trials were the same as during the offline part and participants were instructed to fill the battery by performing the same imagery task as before, keeping the same pace. The mapping of the decoder output to the battery feedback at each time sample was done in such a way that if the probability output by the classifier changed in the direction of the cued syllable, the battery’s filling would increase, and it would decrease if it didn’t. The real-time control went on until the battery was full or until a 5 s timeout. The delay between the recorded data and the feedback presentation was on average 100 ms. As for the offline session, there were 40 trials per syllable, divided into 4 blocks.

To boost participants’ motivation and keep them engaged in the task throughout the entire training period, we provided monetary bonuses based on performance. Participants received an additional 10 CHF for each experimental day in which their CV accuracy during the offline session was above chance (50%) and was higher than on the previous day.

Experiment with discontinuous real-time feedback

Ten healthy participants (5 females, mean age 24.9, SD ± 4.38 years, range 18–35, different individuals from those included in the main study) took part in a separate experiment in which the real-time feedback was experimentally altered. As in the main experiment, participants trained to control a BCI for 5 consecutive days, through the imagery of the same two syllables (/fɔ/ and /gi/). The sequence of the different experimental parts, imagery instructions, EEG acquisition, and BCI-system were the same as the one described in the previous paragraph, except that the real-time feedback was discontinuous, i.e., not systematically related to the classifier output and displayed only positive changes (see Supplementary Fig. 1 and Supplementary Methods for more details about the paradigm). Unlike in the main experiment, participants were presented with an auditory cue similar to the sound of a metronome, imposing a pace for the syllable repetition arbitrarily set at 1.4 Hz.

Data analyses

BCI Performance and CV accuracy

Participants’ BCI-control performance was calculated by considering, for each trial, the percentage of the classifier’s outputs that corresponded to the cued syllable. To probe for an increase in BCI-control performance, we performed a planned contrast analysis, considering the average performance during each training day for each participant, and testing for a linear increase or decrease from day 1 to 5 (numeric contrast using as weights −2, −1, 0, 1, and 2, respectively for day 1 to 5). This set of contrasts was tested in a linear mixed model (LMM), with participants as a random factor. The same statistical approach was used to probe changes across days in other dependent variables (e.g., features weight and power modulation evolution), and it is hereinafter referred to as “LMM with planned contrast”.

We investigated whether the learning dynamics (i.e., the evolution of performance across the 5 training days) were related to the individual’s ability to control the BCI. To do so, we fitted, separately for each participant, a linear model considering the average performance on each day and extracted the individual learning slope. We then computed the Pearson correlation between the average BCI-control performance across the whole training period and the learning slope.

Additionally, we tested for differences in performance between the two syllables (2-tailed paired t-test) and between blocks (one-way repeated measures ANOVA with Block number as within-participant factor, and 2-tailed paired t-tests for post hoc comparisons).

To further assess potential learning mechanisms across training, we considered the CV accuracy of the classifier obtained using the offline data (same as “classifier calibration” section) and applied the same method to the online session data. This approach pools together all data from an individual session (offline or online) and thus differs from the method used to compute the BCI-control performance, which is based on individual samples at the single-trial level. We tested for a linear increase in CV accuracy using the LMM with planned contrast, separately for the offline and online sessions. We compared the CV accuracy between the two sessions, averaged across days, with a 2-tailed paired t-test.

To test for differences in learning between offline and online sessions, we computed the training slope considering the CV accuracy across the 5 days for each participant and each session (offline/online) using the same method as for the BCI-control performance. We then assessed differences in CV-accuracy slope between offline and online sessions by performing a 2-tailed paired t-test.

Impact of the discontinuous real-time feedback on learning

To address whether the accuracy and consistency between the decoded brain patterns and the real-time feedback were a critical factor in BCI-control learning, we performed a subset of the analyses carried out for the main experiment, on the data acquired using the discontinuous feedback (behavioral and classifier’s data). First, we evaluated the presence of the learning pattern in BCI-control performance (%) across the 5 days of training as observed in the main experiment, using the approach defined above as “LMM with planned contrast”.

Next, we compared CV accuracies between the two groups by running a linear mixed model with fixed factors the planned contrast modeling a linear increase (“LMM with planned contrast”), the Group (continuous/discontinuous feedback), and the Session (offline/online). We performed post-hoc comparisons by performing a two-tailed paired t-test for within-group comparisons and a two-tailed unpaired t-test for between-group comparisons.

Classifier features

Next, we investigated which brain regions and frequency bands contribute most to BCI-control and studied changes in decoding patterns across training days. For this, we considered the feature weights of the classifier used during real-time BCI-control and computed based on the offline session data. Each feature refers to a specific channel-frequency pair leading to a total of 61 (channels) x 35 (frequencies) features. The weight of each feature is expressed as a percentage, where higher values indicate a stronger contribution in discriminating between the two syllables.

First, we addressed whether better BCI-control is associated with higher features’ weights, reflecting a better discriminability between the two classes. To do so, we considered, separately for each participant, the sum of the weights across the first 200 features (i.e., irrespective of frequency or channel location, ranked according to their weight). This sub-sampling was necessary since the sum of all features’ weights would have led to the same value of 100% across all participants. This subset size was chosen based on the cumulative sum of the first 200 features’ weights exceeding on average 50% and on its standard deviation across training days increasing up to the 170th ranking place (Supplementary Fig. 2a). This shows that only a part of higher ranking features is most prominently involved in training. We performed a Pearson correlation coefficient between the feature’s weight sum, averaged across days, and the average individual BCI-control performance (obtained as described in the BCI Performance and CV accuracy section).

Next, we investigated the topography of the most discriminant features, as well as their frequency distribution. We considered the average weight over the 5 training days (1) separately for each individual frequency disregarding to which electrode the features belonged and (2) across the scalp separately for each frequency value and frequency band, to obtain a topographical representation of the feature weights.

The frequency distribution as well as the topography for the most discriminat frequency interval was also computed for the dataset acquired using the discontinuous feedback.

Evolution of features over training

Subsequently, we quantified the evolution of the features’ weight over the five training days. First, to assess global changes in the weights across training, we considered the weights’ sum of the first 200 features for each training day and ran the “LMM with planned contrast” analysis, testing for a linear change in the weight with training. A positive relationship between BCI-control performance and weight over the 5 days would reflect a behavioral improvement.

Next, we explored changes due to training more specifically at the level of the decoding frequencies and brain regions. We first inspected changes by visualizing feature pairs as individual elements in a 2D map, with frequency values on the x-axis and individual channels on the y-axis.

We quantified changes over time separately in the frequency and spatial domain (i.e., at the topographic level). To limit the number of multiple comparisons, we guided the spatial analyses by the results obtained in the frequency domain.

To identify frequency-wise changes to the discrimination between the two syllables, we considered, separately for each individual frequency value the average weight across all features and performed a “LMM with planned contrast”. We assessed the statistical significance of the linear change across the 5 days of training with 1000 permutations, assigning randomly the day to which data belonged, within participants. Based on these results, we defined frequency intervals, over which performing the analysis in the spatial domain. We then considered the average weight across all features and ran a “LMM with planned contrast” for each electrode and each interval. The statistical significance was assessed with 1000 permutations, again by randomizing the factor Day on a single-participant basis.

Next, we evaluated the relationship between changes in BCI-control performance across days and changes in the features space. To quantify the global feature changes in a single index per participant, we considered the Euclidean distance between the feature weights of two consecutive days according to the following formula (Eq. 1):

$${dist}({DayN}+1,{DayN})=\sqrt{{\sum} _{{iFeature}}{\left({Day}{N+1}_{{iFeature}}{{{\rm{\hbox{-}}}}}{{DayN}}_{{iFeature}}\right)}^{2}}$$

(1)

where N indicates the experimental Day and iFeature a Frequency-Channel pair (Fig. 2f). The four distance matrices (one for each couple of consecutive days) were then averaged to obtain an individual index per participant, referred to hereafter as the global index. Last, we correlated the global index with the average BCI performance computed considering all training days.

**Fig. 2: Random forest classifier features and their evolution over training.**

EEG data preprocessing and analysis

EEG data recorded during the online session were preprocessed using Fieldtrip⁵⁰ and the Semi-Automatic Selection of Independent Component Analysis (SASICA⁵¹) toolboxes within the MATLAB environment (version R2018b; The MathWorks, Natick, MA, USA). The data were first filtered using a zero-phase Butterworth bandpass filter with cutoff frequencies of 1 and 70 Hz. They were then divided into epochs of 12 s centered around the syllable imagery onset, including 4 s pre-stimulus (during the fixation cross and the cue presentation) and 8 s post-imagery onset (5 s of online BCI control and 3 s of rest). Noisy channels and epochs were removed via visual inspection, after which the data were re-referenced to the common average (which also served the purpose of retrieving data from the reference channel). Principal Component Analysis (PCA) was used to identify and remove ocular and muscular artifacts. The choice of the components to be removed was guided by different metrics such as autocorrelation, focal trial activity, and dipole fit residual, computed through the SASICA toolbox. Last, noisy channels that were initially removed were added back to the dataset by interpolation.

Power changes during BCI-control

To investigate oscillatory modulations associated with speech imagery and BCI control, we computed the power change for each individual frequency and each channel during the entire trial with respect to the baseline activity in the −3 to −2 s pre-imagery onset (i.e., during the fixation cross presentation). For this, we used Morlet wavelets to decompose the preprocessed EEG time series in the time-frequency domain. The power values were baseline-normalized at the single trial level and separately for each frequency band, expressing the change in percentage.

The statistical significance of the power modulation averaged across the 5 training days was assessed separately in several frequency bands of interest (namely theta: 4–7 Hz, alpha: 8–13 Hz, beta: 14–26 Hz, low-gamma: 27–40 Hz and high-gamma 41–70 Hz) at the level of scalp topography by considering the average over the 5 s of real-time BCI-control. To do so, we used a Monte Carlo test with 1000 permutations, shuffling baseline and BCI-control labels, and a 2-tailed paired t-test to compare the average power over the two intervals. We used a standard cluster-based correction to account for multiple comparisons over the scalp⁵⁰. To assess the statistical significance of the identified clusters, their size was compared to the distribution of cluster sizes expected under the null hypothesis. Clusters that had a p value < 0.05 (two-tailed) were considered significant.

Changes in EEG power across training

To assess the evolution of the neural activity during BCI-control over the 5 training days, we first averaged the power over the 5 s of real-time BCI-control, separately for each channel, day, and participant. Next, a “LMM with planned contrast” was fitted for each channel and frequency band separately. The statistical significance was assessed with 1000 permutations, randomly permuting which day data belonged to, within participants. Multiple comparison correction was performed based on the same clustering method as mentioned in the previous section.

Evolution of Brain-Behavior over 5 days of training

Next, we addressed how the relationship between the power modulation in each frequency band and BCI-control performance changes over the training period. We performed separately for each frequency band and channel, a linear mixed model with BCI performance as a dependent variable, the “LMM with planned contrast” and EEG power as independent variables, and participants as a random factor, leading to the following model: BCI_performance ~ EEG_power*day + (1|Participant). Significance was calculated using permutation tests followed by cluster-based multiple comparison correction (see previous section). In particular, we considered the coefficients for the interaction term “EEG_power*day”: a positive coefficient indicates that the effect of power on BCI-control performance becomes stronger with training, and vice versa.

Electromyography

We assessed the potential contribution of muscular activity by computing the classifier’s features and CV accuracy considering exclusively data recorded from the two EMG electrodes, separately for the offline and online session. We then compared these CV accuracies, averaged across training days, with those obtained with the EEG data using a two-tailed t-test. We tested for a linear improvement in CV accuracy computed based on the EMG data throughout training during the online and offline sessions (“LMM with planned contrast”).

Next, we investigated differences in the evolution of the CV accuracy across the 5 days by computing the training slopes for the EMG data, and both sessions. We compared these values with those previously obtained from the EEG data (as displayed in Fig. 1e) by performing a linear mixed model analysis using the Data (EMG/EEG) and the Session (offline/online) as factors.

We assessed the relationship between the improvement in BCI-control performance and EMG activity by calculating the Pearson correlation coefficient between the slope of the CV accuracy on the EMG-online data and the learning slope of BCI-control performances.

Last, we evaluated the similarity between the feature’s weight frequency-wise extracted with the EMG-classifier and EEG-classifier: an overlap between the frequency profiles would likely indicate strong muscular contamination in the discrimination between the two imagined syllables.

Mental chronometry data analysis

First, we considered the average time required to complete the mental chronometry imagery and speaking task for the two modalities and ran a linear mixed model with fixed factors: the Modality (imagine/speak), the Group (continuous/discontinuous feedback), and the “LMM with planned contrast”. This latter linear trend was further probed separately in the two groups of participants by considering the average across both modalities (“LMM with planned contrast”).

Next, we computed an index of deviation from isochrony as the ratio between the average duration of the imagined and spoken syllable repetition according to the following formula already used in a previous BCI study²⁵ (Eq. 2):

$${deviation\; from\; isochrony}={abs}\left(1-\frac{{Imagine}}{{Speak}}\right)$$

(2)

In this formula, the isochrony corresponds to equal time required to perform both tasks, hence the ratio between the two modalities equals 1. According to previous studies, higher values of the deviation from isochrony index are expected to be associated with lower imagery skills.

We probed the evolution of this index across training and between the two groups by performing a linear mixed model with fixed effect the Group (continuous/discontinuous feedback) and the “LMM planned contrast”. The linear trend across days was further tested separately in each group.

Last, we computed the slope of the deviation from isochrony index by fitting, separately for each participant, a linear model considering the deviation from isochrony index measured on each day of training. To investigate whether BCI-learning dynamics could be reflected by isochrony, we computed the Pearson correlation coefficient between this slope and the learning slope modeling the improvement in BCI-control performance previously obtained. In addition, we calculated the Pearson correlation coefficient between the average BCI-control performance and the average deviation from isochrony to probe a relationship independent from training.

Statistics and reproducibility

All statistical analyses were carried out using MATLAB (version R2018b, The MathWorks, Natick, MA, USA) and R version 4.1 (R Core Team, 2021). The sample size was 15 for the main experiment and 10 for the control experiment. Changes across the five days of training were analysed using linear mixed models (LMMs) with a planned contrast to model a linear trend throughout training (“LMM with planned contrast”) as the fixed effect and participants as a random factor. At the individual level, learning was assessed by calculating the slope of a linear fit (learning slope and training slope). Differences between two conditions were evaluated using two-tailed t-tests: paired t-tests for within-group comparisons and unpaired t-tests for between-group comparisons (e.g., comparing data from the main experiment and the control experiment). Differences in performance between blocks in the main experiment were tested using one-way repeated-measures ANOVA. Relationships between two variables were assessed using Pearson correlation coefficients. The significance threshold for all statistical tests was set at 0.05.

Statistics on EEG data were performed using Monte Carlo tests with 1000 permutations, combined with two-tailed paired t-tests and cluster-based corrections to account for multiple comparisons across the scalp. The statistical significance of clusters was determined by comparing their sizes to the distribution of cluster sizes expected under the null hypothesis. Clusters with a p value < 0.05 (two-tailed) were considered significant. Neural changes across training were investigated using the same “LMM with planned contrast” approach described above.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Results

Training improves BCI-control abilities and decoding accuracy

Testing for a change in BCI-control performance throughout 5 training days with the continuous feedback, we observed a linear increase in average performance from Day-1 to -5 (F_1,59 = 5.92, p = 0.018, η²_p = 0.09, Fig. 1b), indicating that imagined speech abilities improved with training, with however marked inter-individual differences (improvement in 11/15 participants, Fig. 1c). Interestingly, we found a positive correlation between the average BCI-control performance obtained considering the entire training period, and individual learning slopes (r = 0.55, p = 0.034, Fig. 1d), showing that the best performers were those who also benefited the most from training.

As expected, there was no difference in performance between the two syllables (T₁₄ = 1.64, p = 0.12, d = 0.42). We found a trend towards significance when testing for differences in performance between the four blocks (F_3,42 = 2.54, p = 0.07, η²_p = 0.15) due to a marginally lower performance during the second block than in the third (T₁₄ = 2.2, p = 0.045, d = 0.57) and fourth (T₁₄ = 2.2, p = 0.043, d = 0.57).

The linear performance improvement trend was confirmed by an increase in the cross-validation accuracy obtained when computing the classifier parameters on offline data (F_1,59 = 9.35, p = 0.0033, η²_p = 0.14, Fig. 1e-left, blue line). This measure exclusively considers neural data, as no feedback is provided during the first part of the experiment. Using the same post-processing approach, we also found a significant improvement over the 5-days of training in the online data (F_1,59 = 17.79, p = 8.6 × 10⁻⁵, η²_p = 0.23, Fig. 1e-left, red line). Across training days, the CV accuracy was significantly higher in the online than the offline session (T₁₄ = 8.3, p = 8.8 × 10⁻⁷, d = 2.14). This difference was not due to a difference in the learning dynamics, as we found no statistical difference between the learning slope in the offline and online sessions (T₁₄ = 1.5, p = 0.15, d = 0.38, Fig. 1e-right). All participants except two received at least one monetary bonus, reflecting performance improvement from one day to the next.

Analysis of the classifier’s features

In a second step, we jointly analyzed participants’ performance and the features used by the classifier to distinguish the imagery of the two syllables. We found a marked correlation between the individual BCI-control performance and the features’ weight (r = 0.83, p = 0.00013, Fig. 2a). This indicates, as expected, that participants performing better present more discriminant features.

We then investigated which frequency bands contribute most prominently to the syllable discrimination and found a peak in contribution straddling the alpha and the lowest end of the beta interval (8–16 Hz), as well as the gamma band (Fig. 2b). Notably there was a linear increase in the average feature weights throughout the entire gamma interval, with the highest values for the highest frequencies up to 70 Hz. The topographical representation of the features according to their frequency shows that the first peak was associated with a cluster over the left central region (Fig. 2c-left), whereas the gamma band contribution originated from temporal regions, bilaterally, and posterior-occipital areas (Fig. 2c-right and Supplementary Fig. 3). A third, distinctive spatial pattern was found in the theta band, characterized by a strong contribution from frontal and temporal regions (Supplementary Fig. 3).

Importantly, the decoded frequencies associated with BCI-learning with continuous (Fig. 2b) and discontinuous feedback (Supplementary Fig. 1b) were similar, with peaks over the alpha and low-beta bands and an increasing contribution in the high-gamma band, and also showed similar topographies (Fig. 2c and Supplementary Fig. 1b). Similarities between two datasets including different participants show that attempted BCI-control based on syllables imagery engaged consistent neural features.

Dynamics of neural features associated with BCI-control learning

First, we considered the global evolution of the features (sum of the first 200 features) and found a significant linear increase in the weights over the course of the training (F_1,59 = 8.62, p = 0.0047, η²_p = 0.13, Supplementary Fig. 2b).

Next, we qualitatively inspected the change in feature weights both in frequency and spatially across the scalp. We found that across participants the most discriminant features consistently localized over temporal regions and involved most prominently frequencies above 30 Hz on each training day (Fig. 2d). While the feature weight distribution on the first two training days was rather scattered over the whole frequency-channel space, with more training it narrowed down to the most discriminant feature clusters (Fig. 2d).

To quantify this change, we first investigated the weights’ evolution frequency-wise and found two frequency ranges that presented a linear change throughout the training period: while the contribution of the 2–10 Hz interval decreased, that of the 52-66 Hz interval increased with training (Fig. 2e).

To identify which regions underpinned this effect, we considered the average over each of these two intervals and ran the same analysis at the individual electrode level. We found a decrease in low-frequency contribution over bilateral temporal and frontal regions, together with an increase in the high-gamma band over left fronto-temporal regions (Fig. 2e)

We then investigated the link between the change in BCI-control performance and the change in the feature space. To do so, we extracted, separately for each participant, a global index representing the amount of change both in the frequency and spatial domains, computed as the Euclidean distance between the weight of two consecutive days (Fig. 2f). By visually inspecting the Euclidean distance for each feature (averaged across participants), we observed a strong overlap of this index (Fig. 2g) and the frequency-channel feature maps (Fig. 2d). We found a strong correlation between the average BCI performance the global index (r = 0.85, p = 0.007 × 10^-2, Fig. 2h), indicating that participants performing better during the real-time control of the visual feedback were also those whose features changed most during training.

Power modulation during BCI-control and neural changes over training days

We subsequently investigated neural changes occurring throughout the BCI-control training considering the power modulation in each frequency band during the 5 s of BCI-control (online session). We first inspected changes during real-time control with respect to the baseline activity (i.e., during the last second of the fixation-cross presentation) and found a significant power decrease over frontal and left-central electrodes in the alpha band, and a similar but more widespread pattern in the beta range (Supplementary Fig. 4) together with enhanced power over posterior-occipital electrodes in the high-gamma band (Supplementary Fig. 4). Next, we investigated linear changes in power modulation throughout training. Overall power increased from day 1 to 5 on all frequency bands (Fig. 3a). In particular, theta and low-gamma bands showed the strongest and most widespread increase across training (Fig. 3a-b). Smaller clusters of power increase were also found in other frequency bands, namely alpha, beta, and high-gamma bands.

**Fig. 3: Evolution of power modulation over the 5 training days and relation with BCI-performance dynamics.**

BCI-control performance and neural changes over training

We then investigated the link between BCI-control performance and power modulation over the 5 training days. We used a linear mixed model with BCI-performance as a dependent variable and, as predictors, the power in each frequency band and the planned contrast modeling a linear trend throughout training. The analysis revealed several clusters of electrodes showing a positive interaction between the two predictors indicating that power variations in these clusters increase their impact on BCI performance with training (Fig. 3c). In other words, specific regions and frequencies show dynamic changes in the direction of a stronger contribution in determining BCI-control. These included clusters located over frontal and central regions in the theta band and over the left temporal region in the gamma band. We found additional smaller significant clusters over the central region in the alpha band, and the left posterior regions in the beta band.

Role of real-time feedback in learning

To assess the importance of accurate real-time feedback in BCI-control improvement, we analyzed the dataset acquired with the discontinuous feedback. Unlike in the main experiment carried out using the continuous feedback, there was no significant increase in BCI-control performance throughout training (F_1,39 = 0.83, p = 0.36, η²_p = 0.02, Fig. 1f).

We computed differences in online vs offline CV accuracy in the discontinuous feedback group and analyzed it together with the CV accuracy in the continuous feedback group. We found a marked linear increase in accuracy across training (main effect of Planned contrast: F_1,23.3 = 10.29, p = 0.003, η²_p = 0.31, Supplementary Fig. 1c), higher accuracy during the online than offline session (main effect of Session: F_1,28.8 = 42.76, p = 3.7 × 10⁻⁰⁷, η²_p = 0.6) and a significant interaction between Session and Group (F_1,28.8 = 9.51, p = 0.004, η²_p = 0.6, Supplementary Fig. 1d). We further developed this interaction with posthoc t-test and found that the marked difference in the offline vs online session was mainly driven by the higher CV accuracy in the online session with the continuous feedback (continuous-online > continuous-offline: T₁₄ = 8.3, p = 8.8 × 10⁻⁰⁷, d = 2.14; continuous-online > discontinuous-offline: T₂₃ = 2.8, p = 0.009, d = 1.15; continuous-online > discontinuous-online: T₂₃ = 1.82, p = 0.082, d = 0.74; continuous-offline < discontinuous-online: T₂₃ = 1.9, p = 0.067, d = 0.78, discontinuous-offline < discontinuous-online: T₉ = 2.3, p = 0.04, d = 0.73; continuous-offline < discontinuous-offline: T₂₃ = 0.53, p = 0.6, d = 0.21).

Comparison of syllable decoding from EEG and EMG

As we asked participants to imagine saying the syllables, a residual muscular activity cannot be excluded. We thus probed the potential contribution of EMG activity in discriminating between the two syllables by computing the CV accuracy of a model using exclusively EMG signals during the offline and online sessions. The average CV accuracy was above chance for both sessions (Fig. 4a). We then compared the EMG-based CV accuracy values to the EEG-based CV accuracy and found opposite effects for the two sessions. While CV accuracy for EMG data was higher than for EEG during the offline session (T₁₄ = 2.2, p = 0.044, d = 0.57, Fig. 4a-left), the opposite was found for the online session with higher CV for EEG than EMG data (T₁₄ = 2.77, p = 0.014, d = 0.71, Fig. 4a-right). Overall, CV accuracy based on EEG-online data was the highest (as compared to both EMG sessions and EEG-offline). Importantly, while EEG-online accuracy showed a strong linear increase throughout training (F_1,59 = 17.79, p = 8.6 × 10⁻⁵, η²_p = 0.23, as already reported at the beginning of the Results section), the same analysis performed with EMG-online data revealed no statistically significant change (F_1,59 = 2.53, p = 0.11, η²_p = 0.04). Data from the EMG-offline session showed that the CV accuracy increased linearly across training (F_1,59 = 8.69, p = 0.0045, η²_p = 0.13).

**Fig. 4: Decoding based on electromyographic (EMG) signals.**

We further investigated differences in the evolution of the CV accuracy across training by computing the training slopes for the EMG data, in both sessions (Fig. 4b), as previously done for the EEG data (Fig. 1e). We found that overall, the slope of the CV accuracy was higher with the EEG data (F_1,14 = 4.5, p = 0.05, η²_p = 0.24), and that the interaction between Data and Session was nearly significant (F_1,14 = 3.36, p = 0.08, η²_p = 0.19). This latter effect was mainly driven by higher values of the learning slope in the EEG-online than EMG-online (T₁₄ = 2.45, p = 0.02, d = 0.63), and EMG-offline data (T₁₄ = 1.78, p = 0.09, d = 0.46). There was no main effect of Session (F_1,14 = 0.57, p = 0.46, η²_p = 0.039).

Next, we tested for a relationship between the slope obtained considering the EMG-online dataset and the behavioral improvement in BCI-control (learning slope) and found a close to significant positive correlation (r = 0.5, p = 0.058, Fig. 4c).

Additionally, we found the distribution of the average weights obtained from the EMG classifier (Fig. 4d) to be markedly different from the histogram obtained considering the EEG features (Fig. 2b). Of note, the different magnitude in the range of the average weights between the EEG and EMG feature space in Figs. 2b, 4d is due to the lower number of EMG features (there were only two EMG channels versus 61 EEG channels).

Mental chronometry results

First, we explored differences and changes across training (“LMM with planned contrast”) in the time required to perform the mental chronometry task considering the two modalities (imagery/speak) and the two groups (continuous/discontinuous feedback, Fig. 5a) with a linear mixed model. We found a marked difference between the two groups in the evolution over time of the task duration for both modalities (main effect of Group: F_1,23 = 10.25, p = 0.004, η²_p = 0.31, interaction Planned Contrast × Group: F_1,23 = 9.11, p = 0.006, η²_p = 0.28, Fig. 5a): participants using the continuous feedback displayed a decrease in the duration of both imagined and spoken tasks, while the opposite trend was found in the group that trained with a discontinuous feedback. There was no statistically significant difference between the two modalities and no other interactions. We further explored the linear trend separately in both groups considering the average duration across the two modalities and found a statistically significant linear increase in the continuous feedback group (F_1,59 = 10.76, p = 0.001, η²_p = 0.15) and a linear decrease in the discontinuous feedback group (F_1,39 = 10.44, p = 0.002, η²_p = 0.21).

Next, we considered the deviation from isochrony index: we found a statistically significant linear increase across training (main effect of Planned Contrast: F_1,98 = 5.6, p = 0.019, η²_p = 0.05, Fig. 5b) and barely significant higher deviation from isochrony in the dataset with continuous feedback than in the discontinuous feedback one (main effect of Group: F_1,23 = 3.96, p = 0.058, η²_p = 0.15, Fig. 5b). The Planned Contrast x Group interaction was not significant (F_1,98 = 0.003, p = 0.95, η²_p = 3.54 × 10⁻⁰⁵).

We further assessed the linear trend across training separately in the two groups and found that the linear increase was statistically significant in the group with discontinuous feedback (F_1,39 = 5.25, p = 0.02, η²_p = 0.12) but not in the group with continuous feedback (F_1,59 = 2.46, p = 0.12, η²_p = 0.04).

Next, we tested for a relationship between BCI-control and the deviation from isochrony and found no statistically significant correlation neither considering the learning slopes of BCI-control performance and isochrony (r = 0.051, p = 0.81, Supplementary Fig. 5a), nor between the average across the training days (r = −0.08, p = 0.69, Supplementary Fig. 5b).

Discussion

The study shows that healthy individuals can learn to control an EEG speech-BCI by training over 5 consecutive days, and uncovers the neural mechanisms related to the acquisition of BCI-control skills. Learning to operate a BCI based on covertly executed tasks has hitherto been investigated almost exclusively in the motor domain^52,53,54. In the field of speech-BCIs, previous studies show that it is possible to improve operating an intracranial BCI through attempted speech^3,4 but not yet via imagined speech. Here, we found that real-time BCI-control performance increased from 55% to 70% over the 5 training days when participants received a real-time and accurate feedback, and that this increase was paralleled with higher discriminability of the neural signals (CV accuracy) during the online than offline session. These results demonstrate that closing the loop is essential to the learning process. In addition, using a discontinuous feedback, we show that accuracy and consistency of the real-time feedback are key features to achieve optimal performance. Feedback alteration consisted of visually informing the subject only when the decoded syllable was the cued one; displaying exclusively successful changes resulted in a discontinuous feedback. Better learning with continuous than discontinuous real-time feedback aligns with previous observations made during motor-BCI training^55,56 and indicates that accurate and high-rate feedback enables an optimal error-driven strategy to control the BCI.

Although the majority of the participants who trained with the continuous feedback (11 out of 15) improved their performance, there were marked inter-individual differences both in control skills and in learning slope, extending to speech-based BCI-control the phenomenon of “BCI-illiteracy”, well-known in motor-imagery BCIs^24,36,57. Poor BCI-operability seems to affect even more severely imagined speech than attempted speech, as on the first training day performance was below chance in most participants, with no outstanding performer. This effect is likely due to several factors including the relatively weak neural signals elicited by speech imagery^14,17, their limited spatial separability, and the restricted access to deeper speech brain regions with surface EEG, a situation that sharply contrasts with the easily decodable focal and superficial patterns elicited by hand motor imagery. Quite predictably, steeper learning was found in better performers. A dichotomy in learners versus non-learners has previously been reported during a single training session of volitional control of individual neurons in mnemonic structures⁵⁸. Here, we show that this dichotomy remains present when training is carried over a longer time period but can however be mitigated by repeating the task over multiple sessions.

Individual factors likely play an important role in acquiring BCI skills, including cognitive, affective, and somatic aspects^25,59. A critical factor in determining BCI-control is arguably a cognitive one, namely the ability to imagine syllables. We thus probed whether learning to control the BCI was accompanied by changes in imagery abilities, and attempted to quantify it using the mental chronometry task. According to this test, good imagery skills would be reflected in an equal time (i.e., isochrony) to perform the imagery and the overt execution. As suggested by a previous motor imagery study⁶⁰, isochrony is expected to be associated with improved BCI-performance over training. The isochrony hypothesis was not confirmed in the group who used the continuous feedback, however, the time taken to repeat imagined and spoken syllables decreased with training, indicating that the task became easier over the training days⁶¹. This facilitation effect might partly underpin the BCI-control improvement in this group. Interestingly, in the group that employed the discontinuous feedback, the difference between imagery and execution increased with training, along with the time taken to repeat the syllables.

These only partly conclusive findings and the lack of a correlation between mental chronometry and BCI-control performance suggest that the isochrony test is probably not the best index to quantify imagined speech learning, as it might be affected by a ceiling effect due to the hyperautomaticity of syllable repetition.

Given extended reports indicating residual EMG activity during inner speech and even the possibility of above-chance EMG decoding (see for a review⁶²), we tested whether EMG signals alone could be classified by computing the CV accuracy in post-processing. The underlying hypothesis is that learning effects should be absent or at least less pronounced in EMG than in EEG data. Consistently, we found no decoding improvement during the online session with EMG, and no distinctive decoding features. However, the above-chance decoding on some of our EMG datasets and a significant increase in CV accuracy offline, indicate that speech imagery was likely accompanied by subthreshold motor activation, a finding that is compatible with the fact that participants were instructed to imagine pronouncing the syllables (rather than e.g., hearing syllables). The presence of residual EMG activity during mental imagery has been the subject of debate for almost a century^63,64 and there are still contrasting results in the field of speech imagery^{42,62,65,66,67}. Our results are in line with the Motor Simulation View (in contrast with the Abstraction View) of inner speech, where peripheral muscular activity would result from imperfect inhibition of motor commands^62,64, possibly accounting for the selectivity of EMG signals to specific phonemes^{42,68,69,70,71}. Above-chance decoding found on average on some days confirms that EMG activity is more than a merely non-specific tonic activation^42,66, and is subject to marked inter-individual differences⁴². EMG activity during imagery is also modulated by the intensity of the mental effort^64,72, an effect that given the participants’ reported experience, likely contributes to the observed correlation with the learning slope. Critically, during the online session, EEG-based decoding achieved significantly higher accuracy than EMG-based decoding, and showed an improvement across training, unlike EMG-based decoding. This, along with the distinct patterns of decoding frequencies, indicates that potential contamination of EEG signals by EMG activity (either as muscle artifacts or neural activity elicited by overt speech) did not interfere with the acquisition of BCI-skills. While EMG signals likely contained information about the syllable choice, learning involved changes occurring at the level of neural activity elicited by covert speech. Significant learning effects might also be observed by providing a feedback based exclusively on EMG, given the high-decoding accuracy achieved with a speech-BCI based on EMG signals⁶⁶. This kind of closed-loop system could benefit patients who retain some residual orofacial movements, and for non-invasive solutions, might have some efficacy. This question remains however outside the scope of the present study.

Here, one of the main goals was to explore the evolution of neural features throughout the learning process. The BCI-control improvement was accompanied by specific changes in the decoding features and in the EEG power. On average, the most discriminant features were located over the temporal regions bilaterally in the gamma band, and over the left sensorimotor cortex in the 8–16 Hz range, overlapping with key speech areas^37,73 previously exploited as decoding sites^6,29. These neural features were qualitatively similar whether the subject got continuous or discontinuous online feedback, indicating that the basic set of neural features mobilized by operating a BCI with syllable imagery was independent of the experimental specificities. The feedback dynamics however was key to the learning process.

Over the 5 training days, we qualitatively observed a pruning effect within the features’ space, with the least discriminant features progressively decreasing their contribution in favor of more focal clusters around the features contributing most to the classification. Specifically, lower frequencies in frontal and temporal regions decreased their contribution in favor of a stronger involvement of the high-gamma band in frontal and left centro-temporal regions. Similar pruning effects have been observed with fMRI-neurofeedback training, resulting in a reduction of redundant connections while strengthening the relevant ones in a restricted set of brain regions⁷⁴. Whether this effect is more pronounced when the action is performed without motor output, such as in an imagined speech BCI, has yet to be determined. Importantly, higher BCI-control over the 5 training days was associated with stronger changes in the feature space, indicating that features’ dynamics play a crucial role in the learning process. From a technical viewpoint, this implies that the classifiers should be set to grasp the individual learning profile by a dynamic calibration of their parameters while the BCI is being operated, such as with adaptive classifiers that are able to account for learning-related changes in real-time⁷⁵.

The power of the neural activity elicited by BCI control (irrespective of the imagined syllable) also substantially increased with training over the entire spectrum, most prominently in the theta and low-gamma band. Both frequency bands are highly relevant in speech perception and production, respectively underpinning syllabic and phonemic processing^76,77. Interestingly, we found that these same two frequency bands were prominently influencing BCI performance as learning progressed, specifically over fronto-central regions for theta power and the left temporal area for the low-gamma band. The fact that the contribution of the theta band as a decoding feature decreased (Fig. 2e) while undergoing substantial power increase across training (Fig. 3a) in relation to performance improvement (Fig. 3c) might appear contradictory. The increase in theta power might however reflect non-syllable specific mnemonic encoding⁷⁸ via changes in synaptic plasticity⁷⁹ known to occur over the same time scale as the training duration in our experiment⁸⁰. Further analyses are necessary to elucidate a potential top-down role of the theta band on higher frequency bands involved in discriminating between the two syllables.

The present study fills an important gap in the field of imagined speech BCI, which traditionally suffers from low performance, by showing that controllability can be improved with training, even when starting from chance-level performance. It provides solid neurophysiological grounds to improve current BCI systems based on speech-imagery, notably by enhancing decoding using a pre-defined subset of brain regions over temporal and fronto-central areas and frequency bands, which we found to be implicated in both decoding and learning. Although surface EEG-based BCIs are unlikely to become stand-alone communication devices, they will find valuable applications in the field of neurotechnology for language disorders. Training to perform real-time control could be used to select those patients who would benefit most from invasive BCIs for communication⁸¹. To validate such an approach, future work is required to 1- establish the correspondence between surface and intracranial EEG recordings of the neural activity elicited by imagined speech, as previously suggested for sensorimotor rhythms in patients with locked-in syndrome (LIS)⁸², and 2- take into consideration the potential reduced BCI-controllability in patients as compared to healthy users⁸³. Indeed, BCIs based on imagined speech might not be suitable for all patients in the long term, such as those in which motor impairments are accompanied by progressive cognitive decline (e.g., LIS⁸⁴). They might however benefit many individuals in whom these functions are spared, for instance, patients with post-stroke aphasia, where attention and global control are generally preserved⁸⁵ and importantly, imagery skills are better retained than spoken language^86,87,88,89. In these patients, rehabilitative interventions based on closing the loop on imagery attempts with real-time feedback could be expected to mobilize residual neural patterns and promote neural plasticity. Importantly, such interventions will have to be adapted to each individual’s residual speech ability and specific impairment.

The future of speech-imagery BCIs holds promise for a variety of purposeful scenarios, particularly those that rely on human-machine co-adaptation.

Data availability

Source data underlying the graphs can be found at this link: https://osf.io/vr26k/. Raw data that support the findings of this study are available from the corresponding author upon reasonable request.

Code availability

Custom-made scripts for the analyses can be found at this link: https://osf.io/vr26k/⁹⁰.

References

Hilari, K., Cruice, M., Sorin-Peters, R. & Worrall, L. Quality of life in aphasia: state of the art. Folia Phoniatr. Logop. 67, 114–118 (2015).
Article PubMed Google Scholar
Rousseau, M. C. et al. Quality of life in patients with locked-in syndrome: evolution over a 6-year period. Orphanet J. Rare Dis. 10, 4–11 (2015).
Article Google Scholar
Guenther, F. H. et al. A wireless brain-machine interface for real-time speech synthesis. PLoS ONE 4, e8218 (2009).
Article PubMed PubMed Central Google Scholar
Metzger, S. L. et al. A high-performance neuroprosthesis for speech decoding and avatar control. Nature 620, 1037–1046 (2023).
Article CAS PubMed PubMed Central Google Scholar
Metzger, S. L. et al. Generalizable spelling using a speech neuroprosthesis in an individual with severe limb and vocal paralysis. Nat. Commun. 13, 6510 (2022).
Article CAS PubMed PubMed Central Google Scholar
Moses, D. A. et al. Neuroprosthesis for decoding speech in a paralyzed person with anarthria. N. Engl. J. Med. 385, 217–227 (2021).
Article PubMed PubMed Central Google Scholar
Wilson, G. H. et al. Decoding spoken English from intracortical electrode arrays in dorsal precentral gyrus. J. Neural Eng. 17, 066007 (2020).
Article PubMed PubMed Central Google Scholar
Willett, F. R. et al. A high-performance speech neuroprosthesis. Nature 620, 1031–1036 (2023).
Article CAS PubMed PubMed Central Google Scholar
Card, N. S. et al. An accurate and rapidly calibrating speech neuroprosthesis. N. Engl. J. Med. 391, 609–618 (2024).
Article PubMed PubMed Central Google Scholar
Cooney, C., Folli, R. & Coyle, D. Neurolinguistics research advancing development of a direct-speech brain-computer interface. iScience 8, 103–125 (2018).
Article PubMed PubMed Central Google Scholar
Martin, S. et al. Decoding inner speech using electrocorticography: progress and challenges toward a speech prosthesis. Front. Neurosci. 12, 1–10 (2018).
Article Google Scholar
Ikeda, S. et al. Neural decoding of single vowels during covert articulation using electrocorticography. Front. Hum. Neurosci. 8, 1–8 (2014).
Article Google Scholar
Brumberg, J. S. et al. Spatio-temporal progression of cortical activity related to continuous overt and covert speech production in a reading task. PLoS ONE 11, e0166872 (2016).
Article PubMed PubMed Central Google Scholar
Leuthardt, E. C. et al. Temporal evolution of gamma activity in human cortex during an overt and covert word repetition task. Front. Hum. Neurosci. 6, 1–12 (2012).
Article Google Scholar
Martin, S. et al. Word pair classification during imagined speech using direct brain recordings. Sci. Rep. 6, 25803 (2016).
Article CAS PubMed PubMed Central Google Scholar
Pei, X. et al. Spatiotemporal dynamics of electrocorticographic high gamma activity during overt and covert word repetition. Neuroimage 54, 2960–2972 (2011).
Article PubMed Google Scholar
Proix, T. et al. Imagined speech can be decoded from low- and cross-frequency intracranial EEG features. Nat. Commun. 13, 48 (2022).
Article CAS PubMed PubMed Central Google Scholar
Soroush, P. Z. et al. The nested hierarchy of overt, mouthed, and imagined speech activity evident in intracranial recordings. Neuroimage 269, 119913 (2023).
Article PubMed Google Scholar
Leuthardt, E. C. et al. Using the electrocorticographic speech network to control a brain–computer interface in humans. J. Neural Eng. 8, 036004 (2011).
Article PubMed PubMed Central Google Scholar
Sereshkeh, A. R., Trott, R., Bricout, A. & Chau, T. Online EEG classification of covert speech for brain–computer interfacing. Int. J. Neural Syst. 27, 1750033 (2017).
Article PubMed Google Scholar
Angrick, M. et al. Real-time synthesis of imagined speech processes from minimally invasive recordings of neural activity. Commun. Biol. 4, 1055 (2021).
Article PubMed PubMed Central Google Scholar
Wandelt, S. K. et al. Representation of internal speech by single neurons in human supramarginal gyrus. Nat. Hum. Behav. 8, 1136–1149 (2024).
Article PubMed PubMed Central Google Scholar
Martin, S. et al. Decoding spectrotemporal features of overt and covert speech from the human cortex. Front. Neuroeng. 7, 1–15 (2014).
Article Google Scholar
Ahn, M. & Jun, S. C. Performance variation in motor imagery brain-computer interface: a brief review. J. Neurosci. Methods 243, 103–110 (2015).
Article PubMed Google Scholar
Marchesotti, S., Bassolino, M., Serino, A., Bleuler, H. & Blanke, O. Quantifying the role of motor imagery in brain-machine interfaces. Sci. Rep. 6, 24076 (2016).
Article CAS PubMed PubMed Central Google Scholar
Maiseli, B. et al. Brain–computer interface: trend, challenges, and threats. Brain Inform. 10, 20 (2023).
Article PubMed PubMed Central Google Scholar
Blabe, C. H. et al. Assessment of brain–machine interfaces from the perspective of people with paralysis. J. Neural Eng. 12, 043002 (2015).
Article PubMed PubMed Central Google Scholar
Panachakel, J. T., Ramakrishnan, A. G. & Ananthapadmanabha, T. V. A novel deep learning architecture for decoding imagined speech from EEG. arXiv Prepr. arXiv:2003, (2020).
Panachakel, J. T. & Ramakrishnan, A. G. Decoding covert speech from EEG-A comprehensive review. Front. Neurosci. 15, 642251 (2021).
Article PubMed PubMed Central Google Scholar
Cooney, C., Folli, R. & Coyle, D. Opportunities, pitfalls and trade-offs in designing protocols for measuring the neural correlates of speech. Neurosci. Biobehav. Rev. 140, 104783 (2022).
Article PubMed Google Scholar
Lopez-Bernal, D., Balderas, D., Ponce, P. & Molina, A. A state-of-the-art review of EEG-based imagined speech decoding. Front. Hum. Neurosci. 16, 1–14 (2022).
Article Google Scholar
D’Zmura, M., Deng, S., Lappas, T., Thorpe, S. & Srinivasan, R. Toward EEG Sensing of Imagined Speech. In Lecture Notes in Computer Science, Vol. 5610 (ed. Jacko, J.A.) 40–48 (Springer, 2009).
DaSalla, C. S., Kambara, H., Sato, M. & Koike, Y. Single-trial classification of vowel speech imagery using common spatial patterns. Neural Netw. 22, 1334–1339 (2009).
Article PubMed Google Scholar
Wang, L., Zhang, X., Zhong, X. & Zhang, Y. Analysis and classification of speech imagery EEG for BCI. Biomed. Signal Process. Control 8, 901–908 (2013).
Article Google Scholar
Nguyen, C. H., Karavas, G. K. & Artemiadis, P. Inferring imagined speech using EEG signals: a new approach using Riemannian manifold features. J. Neural Eng. 15, 016002 (2018).
Article PubMed Google Scholar
Blankertz, B. et al. Neurophysiological predictor of SMR-based BCI performance. Neuroimage 51, 1303–1309 (2010).
Article PubMed Google Scholar
Bouchard, K. E., Mesgarani, N., Johnson, K. & Chang, E. F. Functional organization of human sensorimotor cortex for speech articulation. Nature 495, 327–332 (2013).
Article CAS PubMed PubMed Central Google Scholar
Chartier, J., Anumanchipalli, G. K., Johnson, K. & Chang, E. F. Encoding of articulatory kinematic trajectories in human speech sensorimotor cortex. Neuron 98, 1042–1054.e4 (2018).
Article CAS PubMed PubMed Central Google Scholar
Mesgarani, N., Cheung, C., Johnson, K. & Chang, E. F. Phonetic feature encoding in human superior temporal gyrus. Sci. 343, 1006–1010 (2014).
Article CAS Google Scholar
Alderson-Day, B. & Fernyhough, C. Inner speech: development, cognitive functions, phenomenology, and neurobiology. Psychol. Bull. 141, 931–965 (2015).
Article PubMed PubMed Central Google Scholar
Guillot, A. & Collet, C. Duration of mentally simulated movement: a review. J. Mot. Behav. 37, 10–20 (2005).
Article CAS PubMed Google Scholar
Nalborczyk, L., Grandchamp, R., Koster, E. H. W., Perrone-Bertolotti, M. & Lœvenbruck, H. Can we decode phonetic features in inner speech using surface electromyography? PLoS ONE 15, e0233282 (2020).
Article CAS PubMed PubMed Central Google Scholar
Graves, R., Goodglass, H. & Landis, T. Mouth asymmetry during spontaneous speech. Neuropsychologia 20, 371–381 (1982).
Article CAS PubMed Google Scholar
Campbell, R. Asymmetries in moving faces. Br. J. Psychol. 73, 95–103 (1982).
Article CAS PubMed Google Scholar
Lee, K., Liu, D., Perroud, L., Chavarriaga, R. & Millán, J. D. R. A brain-controlled exoskeleton with cascaded event-related desynchronization classifiers. Rob. Auton. Syst. 90, 15–23 (2017).
Article Google Scholar
Thenaisie, Y. et al. Principles of gait encoding in the subthalamic nucleus of people with Parkinson’s disease. Sci. Transl. Med. 14, eabo1800 (2022).
Wu, S., Bhadra, K., Giraud, A. & Marchesotti, S. Adaptive LDA classifier enhances real-time control of an EEG brain–computer interface for decoding imagined syllables. Brain Sci. 14, 196 (2024).
Article CAS PubMed PubMed Central Google Scholar
Steyrl, D., Scherer, R., Faller, J. & Müller-Putz, G. R. Random forests in non-invasive sensorimotor rhythm brain-computer interfaces: a practical and convenient non-linear classifier. Biomed. Eng. Biomed. Tech. 61, 77–86 (2016).
Article Google Scholar
Breiman, L. Random forests. Mach. Learn. 45, 5–32 (2001).
Article Google Scholar
Oostenveld, R., Fries, P., Maris, E. & Schoffelen, J.-M. FieldTrip: open source software for advanced analysis of MEG, EEG, and invasive electrophysiological data. Comput. Intell. Neurosci. 2011, 1–9 (2011).
Article Google Scholar
Chaumon, M., Bishop, D. V. M. & Busch, N. A. A practical guide to the selection of independent components of the electroencephalogram for artifact correction. J. Neurosci. Methods 250, 47–63 (2015).
Article PubMed Google Scholar
Green, A. M. & Kalaska, J. F. Learning to move machines with the mind. Trends Neurosci. 34, 61–75 (2011).
Article CAS PubMed Google Scholar
Orsborn, A. L. & Pesaran, B. Parsing learning in networks using brain–machine interfaces. Curr. Opin. Neurobiol. 46, 76–83 (2017).
Article CAS PubMed PubMed Central Google Scholar
Kumar, S., Alawieh, H., Racz, F. S., Fakhreddine, R. & Millán, J. D. R. Transfer learning promotes acquisition of individual BCI skills. PNAS Nexus 3, 1–15 (2024).
Article Google Scholar
Roc, A. et al. A review of user training methods in brain computer interfaces based on mental tasks. J. Neural Eng. 18, 011002 (2021).
Article Google Scholar
Neuper, C., Schlögl, A. & Pfurtscheller, G. Enhancement of left-right sensorimotor EEG differences during feedback-regulated motor imagery. J. Clin. Neurophysiol. 16, 373–382 (1999).
Article CAS PubMed Google Scholar
Vidaurre, C. & Blankertz, B. Towards a cure for BCI illiteracy. Brain Topogr. 23, 194–198 (2010).
Article PubMed Google Scholar
Patel, K., Katz, C. N., Kalia, S. K., Popovic, M. R. & Valiante, T. A. Volitional control of individual neurons in the human brain. Brain 144, 3651–3663 (2021).
Article PubMed PubMed Central Google Scholar
Blankertz, B., Lemm, S., Treder, M., Haufe, S. & Müller, K.-R. Single-trial analysis and classification of ERP components—A tutorial. Neuroimage 56, 814–825 (2011).
Article PubMed Google Scholar
Liepert, J., Stürner, J., Büsching, I., Sehle, A. & Schoenfeld, M. A. Effects of a single mental chronometry training session in subacute stroke patients—a randomized controlled trial. BMC Sports Sci. Med. Rehabil. 12, 66 (2020).
Article PubMed PubMed Central Google Scholar
Guillot, A., Hoyek, N., Louis, M. & Collet, C. Understanding the timing of motor imagery: recent findings and future directions. Int. Rev. Sport Exerc. Psychol. 5, 3–22 (2012).
Article Google Scholar
Perrone-Bertolotti, M., Rapin, L., Lachaux, J.-P., Baciu, M. & Lœvenbruck, H. What is that little voice inside my head? Inner speech phenomenology, its role in cognitive performance, and its relation to self-monitoring. Behav. Brain Res. 261, 220–239 (2014).
Article CAS PubMed Google Scholar
Jacobson, E. Electrophysiology of mental activities. Am. J. Psychol. 44, 677–694 (1932).
Article Google Scholar
Guillot, A., Di Rienzo, F., MacIntyre, T., Moran, A. & Collet, C. Imagining is not doing but involves specific motor commands: a review of experimental data related to motor inhibition. Front. Hum. Neurosci. 6, 1–22 (2012).
Article Google Scholar
Oppenheim, G. M. & Dell, G. S. Motor movement matters: the flexible abstractness of inner speech. Mem. Cogn. 38, 1147–1160 (2010).
Article Google Scholar
Kapur, A., Kapur, S. & Maes, P. AlterEgo: a personalized wearable silent speech interface. In Proc. 23rd International Conference on Intelligent User Interfaces, 43–53. https://doi.org/10.1145/3172944.3172977 (ACM, 2018).
Meltzner, G. S. et al. Speech recognition for vocalized and subvocal modes of production using surface EMG signals from the neck and face. In Interspeech, 2667–2670. https://doi.org/10.21437/Interspeech.2008-661 (ISCA, 2008).
McGuigan, F. J. & Dollins, A. B. Patterns of covert speech behavior and phonetic coding. Pavlov. J. Biol. Sci. 24, 19–26 (1989).
Article CAS PubMed Google Scholar
Locke, J. L. & Fehr, F. S. Subvocal rehearsal as a form of speech. J. Verbal Learn. Verbal Behav. 9, 495–498 (1970).
Article Google Scholar
Livesay, J., Liebke, A., Samaras, M. & Stanley, A. Covert speech behavior during a silent language recitation task. Percept. Mot. Skills 83, 1355–1362 (1996).
Article CAS PubMed Google Scholar
Nalborczyk, L. et al. Orofacial electromyographic correlates of induced verbal rumination. Biol. Psychol. 127, 53–63 (2017).
Article PubMed Google Scholar
Slade, J. M., Landers, D. M. & Martin, P. E. Muscular activity during real and imagined movements: a test of inflow explanations. J. Sport Exerc. Psychol. 24, 151–167 (2002).
Article Google Scholar
Hickok, G. & Poeppel, D. The cortical organization of speech processing. Nat. Rev. Neurosci. 8, 393–402 (2007).
Article CAS PubMed Google Scholar
Lee, S. et al. Detection of cerebral reorganization induced by real-time fMRI feedback training of insula activation. Neurorehabil. Neural Repair 25, 259–267 (2011).
Article PubMed Google Scholar
Shenoy, P., Krauledat, M., Blankertz, B., Rao, R. P. N. & Müller, K.-R. Towards adaptive classification for BCI. J. Neural Eng. 3, R13–R23 (2006).
Article PubMed Google Scholar
Giraud, A.-L. & Poeppel, D. Cortical oscillations and speech processing: emerging computational principles and operations. Nat. Neurosci. 15, 511–517 (2012).
Article CAS PubMed PubMed Central Google Scholar
Gross, J. et al. Speech rhythms and multiplexed oscillatory sensory coding in the human brain. PLoS Biol. 11, e1001752 (2013).
Article PubMed PubMed Central Google Scholar
Herweg, N. A., Solomon, E. A. & Kahana, M. J. Theta oscillations in human memory. Trends Cogn. Sci. 24, 208–227 (2020).
Article PubMed PubMed Central Google Scholar
Greenstein, Y. J., Pavlides, C. & Winson, J. Long-term potentiation in the dentate gyrus is preferentially induced at theta rhythm periodicity. Brain Res 438, 331–334 (1988).
Article CAS PubMed Google Scholar
Rioult-Pedotti, M. S., Friedman, D., Hess, G. & Donoghue, J. P. Strengthening of horizontal cortical connections following skill learning. Nat. Neurosci. 1, 230–234 (1998).
Article CAS PubMed Google Scholar
Chaudhary, U. et al. Spelling interface using intracortical signals in a completely locked-in patient enabled via auditory neurofeedback training. Nat. Commun. 13, 1–9 (2022).
Article Google Scholar
Hnazaee, M. F. et al. Towards predicting ECoG-BCI performance: assessing the potential of scalp-EEG. J. Neural Eng. 19, 046045 (2022).
Article Google Scholar
Séguin, P., Maby, E. & Mattout, J. Why BCIs work poorly with the patients who need them the most? In Proc. 8th Graz Brain-Computer Interface Conference. https://doi.org/10.48550/arXiv.2302.06312 (Graz University of Technology, Austria, 2019).
Séguin, P. et al. The challenge of controlling an auditory BCI in the case of severe motor disability. J. Neuroeng. Rehabil. 21, 9 (2024).
Article PubMed PubMed Central Google Scholar
Brownsett, S. L. E. et al. Cognitive control and its impact on recovery from aphasic stroke. Brain 137, 242–254 (2014).
Article PubMed Google Scholar
Fama, M. E., Hayward, W., Snider, S. F., Friedman, R. B. & Turkeltaub, P. E. Subjective experience of inner speech in aphasia: preliminary behavioral relationships and neural correlates. Brain Lang. 164, 32–42 (2017).
Article PubMed Google Scholar
Fama, M. E. & Turkeltaub, P. E. Inner speech in aphasia: current evidence, clinical implications, and future directions. Am. J. Speech Lang. Pathol. 29, 560–573 (2020).
Article PubMed Google Scholar
Sierpowska, J. et al. The black box of global aphasia: neuroanatomical underpinnings of remission from acute global aphasia with preserved inner language function. Cortex 130, 340–350 (2020).
Article PubMed Google Scholar
Stark, B. C., Geva, S. & Warburton, E. A. Inner speech’s relationship with overt speech in poststroke aphasia. J. Speech Lang. Hear. Res. 60, 2406–2415 (2017).
Article PubMed Google Scholar
Custom-code. Available at: https://doi.org/10.17605/OSF.IO/VR26K.

Download references

Acknowledgements

We thank the Human Neuroscience Platform of the Fondation Campus Biotech Geneva and Shizhe Wu for technical advice. This study has been supported by the National Center of Competence in Research “Evolving Language”, Swiss National Science Foundation Agreement #51NF40_180888 and the Fondation pour l’Audition.

Author information

These authors contributed equally: Anne-Lise Giraud, Silvia Marchesotti.

Authors and Affiliations

Department of Basic Neurosciences, Faculty of Medicine, University of Geneva, Geneva, Switzerland
Kinkini Bhadra, Anne-Lise Giraud & Silvia Marchesotti
Université Paris Cité, Institut Pasteur, AP-HP, Inserm, Fondation Pour l’Audition, Institut de l’Audition, IHU reConnect, Paris, France
Anne-Lise Giraud

Authors

Kinkini Bhadra
View author publications
Search author on:PubMed Google Scholar
Anne-Lise Giraud
View author publications
Search author on:PubMed Google Scholar
Silvia Marchesotti
View author publications
Search author on:PubMed Google Scholar

Contributions

Kinkini Bhadra: conceptualization, methodology, software, investigation, data curation, formal analysis, writing - original draft and editing, visualization. Anne-Lise Giraud: conceptualization, supervision, writing – reviewing and editing, funding acquisition. Silvia Marchesotti: conceptualization, methodology, software, formal analysis, data curation, writing – original draft and editing, visualization, supervision.

Corresponding author

Correspondence to Silvia Marchesotti.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Communications Biology thanks Maxime Verwoert and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Primary Handling Editors: Christian Beste and Joao Valente. A peer review file is available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Reporting Summary

Transparent Peer Review file

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Bhadra, K., Giraud, AL. & Marchesotti, S. Learning to operate an imagined speech Brain-Computer Interface involves the spatial and frequency tuning of neural activity. Commun Biol 8, 271 (2025). https://doi.org/10.1038/s42003-025-07464-7

Download citation

Received: 09 October 2023
Accepted: 03 January 2025
Published: 20 February 2025
Version of record: 20 February 2025
DOI: https://doi.org/10.1038/s42003-025-07464-7

This article is cited by

Non-Invasive Brain-Computer Interfaces: Converging Frontiers in Neural Signal Decoding and Flexible Bioelectronics Integration
- Sheng Wang
- Xiaobin Song
- Linwei Yu
Nano-Micro Letters (2026)
EEG Resting-state Microstate Dynamics in Children and Adolescents with Avoidant/Restrictive Food Intake Disorder (ARFID)
- Kinkini Bhadra
- Antony A. Janakiram
- Cristina Berchio
Brain Topography (2025)