Introduction

Cognitive decline poses a significant challenge, making the implementation of timely detection methods essential1,2. The advent of disease-modifying therapies such as Aducanumab3,4 and Lecanemab5, which target amyloid plaques, a hallmark of Alzheimer’s Disease (AD), offers potential to alter disease progression. These FDA-approved therapies have demonstrated efficacy primarily when administered in the initial phases of AD. Identifying subtle changes in cognitive function before significant deficits occur is paramount for maximizing the therapeutic benefits of these drugs, ultimately aiming to preserve cognitive function and improve quality of life in at-risk individuals6. Standard tools such as the Mini-Mental State Examination (MMSE)7 and the Montreal Cognitive Assessment (MoCA)8, are widely used in clinical settings. Despite its widespread use, the MMSE is criticized for limited sensitivity and susceptibility to confounding factors such as education levels, age and socioeconomic background9,10. Research shows that this lack of sensitivity allows many patients with cognitive impairments to go unnoticed, undermining the need for cognitive rehabilitation11. Consequently, while the MMSE remains a valuable initial screening tool, its limitations underscore the importance of supplementing it with additional assessments to ensure a more comprehensive evaluation of cognitive function, particularly during the early stages of cognitive decline12.

In this regard, biomarkers derived from cerebrospinal fluid (CSF) and blood offer complementary diagnostic perspectives, providing a window into the molecular changes occurring in the brain. Their levels can reflect distinct stages of neurodegeneration, allowing clinicians and researchers to assess disease progression and stratify patients for targeted interventions13. Beta-amyloid (Aβ), for instance, is a pivotal biomarker in AD research. Advances in assay technologies have recently facilitated the detection of Aβ biomarkers in blood, addressing challenges such as low concentrations and interference from peripheral sources. The CSF Aβ42/Aβ40 ratio is widely regarded as the gold standard for identifying amyloid pathology. Furthermore, emerging studies on plasma biomarkers for AD have demonstrated their potential to differentiate AD from other neurodegenerative disorders, predict the onset of AD dementia in its early stages, and provide valuable insights into disease progression13,14.

While CSF and blood biomarkers offer valuable insights for understanding disease pathology, they are often invasive, resource-intensive, and less suited for capturing dynamic brain activity during cognitive tasks. Electroencephalography (EEG) provides a non-invasive window into real-time cognitive processes, offering insights into the neurophysiological disruptions associated with cognitive decline. This approach complements molecular biomarkers by bridging the gap between disease pathology and functional brain dynamics, enabling a more comprehensive evaluation of cognitive health. EEG is effectively identifying changes in power spectral density, disruptions in functional connectivity, and altered coherence patterns associated with cognitive decline and AD15,16. These disruptions in neural processing and connectivity underscore the complexity of neurophysiological changes linked to declining cognitive functions17. Previous research has extensively explored the impact of cognitive load on various frequency bands, particularly within the frontal lobe. Frontal Theta activity, for example, increases with the growing demands on memory retention across various cognitive tasks like the n-back18,19,20. Diminished Theta power has been observed in individuals with mild cognitive impairment (MCI) and AD during auditory and visual oddball tasks compared to healthy controls21. Similarly, Delta activity has been shown to play a role in inhibiting potential interferences that may affect performance in high-load cognitive tasks22, while diminished Delta power has been observed in MCI and AD patients during oddball tasks23. Gamma waves, associated with prefrontal cortex activation during cognitive processing, exhibit positive correlations with the fMRI-BOLD signal in frontal regions24. In middle-aged adults, heightened frontal Gamma activity was observed during the high cognitive load level of verbal n-back task compared to younger adults, suggesting age-related modulation of Gamma oscillations25. However, reduced Gamma oscillations in elderly individuals (mean age 75) indicate that Gamma activity may increase until midlife and starts to decline in older age26. Research shows cognitive decline often involves reduced amplitude and synchronization of Gamma wave activity27. AD patients demonstrate elevated Gamma activity during performance of cognitive tasks, potentially reflecting increased resource allocation under cognitive load compared to healthy seniors28. Beta waves, linked to cognitive processes such as working memory maintenance and information inhibition, have also shown correlations with fMRI-BOLD signals in the prefrontal cortex and exhibited a positive load effect specifically during cognitive working memory tasks24. In the prefrontal cortex, heightened Beta activity aids in processes like information erasure from working memory and preserves contents during delay periods29. Furthermore, while behavioral performance was similar between young and healthy elderly participants in an auditory memory task study, notable differences in Beta band desynchronization during retrieval suggest age-related influences on Beta responses during working memory task30. Notably, Beta power was higher in MCI patients compared to controls, both at rest and during working memory tasks31. Understanding how cognitive load influences frequency bands in the frontal lobe contributes valuable insights into the neural mechanisms underlying cognitive processes and can shed light on cognitive decline. Building on these insights, EEG biomarkers enable clinicians to monitor activity in real time, capturing dynamic changes across various tasks and functions. This facilitates a deeper understanding of patient challenges, which can also be correlated with self-reported data. Furthermore, EEG’s non-invasive nature, high accuracy, and straightforward interpretability enhance its utility in clinical settings, providing objective, quantifiable data to identify abnormal brain patterns and support timely diagnosis and intervention for individuals at risk32. Advancements in device design and analytical approaches have further improved the accessibility and clinical relevance of EEG biomarkers. The present study highlights the development of novel biomarkers derived from a single-channel EEG to effectively assess cognitive states. Single-channel EEG retains the key advantages of multichannel systems—being non-invasive, cost-effective, and portable—while offering additional benefits, such as forehead placement, which eliminates the need for a full cap, enhancing comfort and simplifying the setup for participants and clinicians. This streamlined approach supports more frequent recordings and may be particularly suitable for elderly individuals and clinical settings, reducing the need for hospital visits. Notably, single-channel EEG achieves these benefits without compromising data quality33.

In recent years, machine learning (ML) and deep learning (DL) approaches have been increasingly applied to EEG data to predict early cognitive decline, yielding high accuracy results. A review analyzing 209 studies found that DL models, particularly convolutional neural networks (CNNs) and support vector machines (SVMs), can achieve accuracies exceeding 93% in distinguishing between cognitive decline stages34. Another review of 116 studies on the progression from MCI to AD reported that ML techniques, including SVMs, random forests, and CNNs, delivered classification accuracies up to 95% and AUC values of 0.98 for EEG-based predictions35. Additionally, ensemble methods and feature selection techniques like Lasso and ElasticNet were frequently used to enhance model performance36. While these results show potential, the effectiveness of ML and DL techniques applied to EEG data is often limited by small sample sizes, impacting generalizability and increasing the risk of overfitting. Studies with larger EEG cohorts have demonstrated the potential of these approaches to detect cognitive deficits, enhancing their applicability to broader populations and widespread use in clinical settings. For instance, a study involving frontotemporal EEG data from 120 participants showed that EEG multifractal analysis, combined with ML models, effectively detected MCI in healthy individuals, correlating well with normal MMSE scores (≥ 26)37. Another study analyzed resting-state prefrontal EEG biomarkers from 496 elderly individuals and used various ML methods (including WLS, Ridge, ElasticNet and Lasso) to predict cognitive impairment. These models achieved moderate AUC (0.84) and accuracy (0.75), effectively differentiating between individuals at risk of MCI and those with cognitive deficits38. These findings underscore the potential of advanced ML and DL methods for accurate, non-invasive early diagnosis of cognitive decline.

This study includes pre-extracted EEG features, identified as significant markers of cognitive function, validated in prior research conducted on diverse populations, including young, healthy subjects39,40,41, and elderly experiencing various cognitive states42,43. These features were utilized in our pilot study and are further applied in the current studies to assess their effectiveness in detecting cognitive decline. Our previous pilot study44 included 50 seniors with MMSE scores ranging from 10 to 30, divided into three groups: healthy individuals (MMSE > 27), individuals at risk of MCI (MCI-R, MMSE 24–27), and individuals with mild dementia (MD, MMSE < 24). EEG data was collected during an auditory cognitive assessment with varying cognitive load levels and at rest. Pre-extracted EEG features showed significant correlations with MMSE scores, particularly ST4 and A0, across task difficulty levels. Furthermore, these features effectively distinguished between seniors with high vs. low MMSE scores. EEG features Theta, Delta, A0, and VC9 increased with higher cognitive load levels, indicating different activity patterns between young and senior participants in different cognitive states, particularly notable for VC9, which differentiated between all levels of cognitive load. This pilot study demonstrated that single-channel wearable EEG and ML features can effectively evaluate cognitive states and align with clinical measurements for detecting cognitive decline.

The recent FDA approval of drugs designed to slow beta amyloid buildup in AD, results in seniors increasingly seeking evaluations for eligibility for these new treatments, placing a substantial burden on clinicians. This situation intensified the demand for highly specific AI-based assessments that can accurately distinguish between healthy individuals and those who may need further evaluation. Motivated by this need, the first study presented here focuses on the high range of MMSE scores (24–30, typically considered healthy), aiming to distinguish between cognitively healthy individuals and those who may be at risk for early cognitive decline, with a cutoff score of 27. The goal was to identify subtle cognitive changes that may signal the onset of decline among elderly individuals using EEG biomarkers. To enhance the rigor of the study, participants underwent two EEG recording sessions conducted on separate days within a seven-day period. This dual-session design was implemented to test the reliability and consistency of the EEG data across sessions for each participant, as well as to examine the effects of learning and adaptation over time compared to stable neural patterns observable in both sessions. We hypothesized that EEG biomarkers can distinguish between cognitively healthy individuals and those at risk for early cognitive decline with MMSE scores in the high range (24–30).

The second study aims to validate the outcomes of the pilot study by incorporating additional clinical diagnostic tools, such as MoCA. While MMSE is a reliable tool, its sensitivity can be limited by educational level variations. A review of over 50 studies indicates that MoCA exhibits greater sensitivity than MMSE in detecting subtle early-stage MCI deficits45. Additionally, previous work demonstrated that a single-channel EEG approach successfully extracted features comparable to MoCA scores46. To further enhance the clinical aspect of our assessment, we introduced functional tasks in the second study protocol. Two tasks from the Performance Assessment of Self-Care Skills (PASS), which evaluates functional status and change47. A study found significant associations between PASS tasks focusing on cognitive skills and performance in verbal memory and executive function, effectively differentiating MCI subjects from healthy controls48. We hypothesized that a combined approach using single-channel EEG, MMSE, and MoCA scores will provide a more accurate assessment of early cognitive decline, particularly in individuals who are at risk for MCI.

Finally, we conducted a meta-analysis of data gathered in all three studies, including additional healthy controls (n = 349), to achieve a comprehensive perspective on the relationships between EEG features, cognitive assessments, and functional tasks in the elderly population.

Methods

Participants

The first study recruited 80 patients from the inpatient rehabilitation department at Dorot Geriatric Medical Center, with a mean age of 73.51 (SD: 10.45) years, evenly distributed between males and females. Both groups exhibited a diverse age range. An age difference was observed between the healthy male group and the MCI-R male group. This difference is primarily attributed to the presence of a male participant in the MCI-R group, whose age (101 years) is more than two standard deviations above the group average. When this outlier is excluded from the analysis, the age difference between the male subjects is no longer significant (t = -1.86, p = 0.07). Other than that, no significant age differences were found between the groups.

The second study included 77 patients from the same department, with a mean age of 74.17 (SD: 8.90) years, comprising 52% females and 48% males. Each group displayed a wide age range. Differences in age were observed between the MD group and the other two groups (MCI-R and Healthy), particularly among female participants. Full demographic details are provided in Tables 1 and 2.

In both studies, clinical staff identified potential participants during hospital admissions. Participants were selected based on study inclusion criteria and had MMSE scores of 24–30 (first study), or 10–30 (second study).

For the meta-analysis, a total of 112 healthy participants with mean age of 37.6 (SD: 15.4) years, comprising 62% females and 38% males were recruited. They completed auditory cognitive tasks similar to the study groups.

All participants included in this paper provided informed consent in line with the Declaration of Helsinki. Individuals who objected or had neurological comorbidities, scalp or skull damage, facial skin irritation, significant hearing impairments, or a history of significant drug abuse were excluded.

Ethics approval, accordance statement and informed consent

This study was conducted in accordance with the Declaration of Helsinki and relevant institutional and national regulations. Ethical approval was granted by the Ethics Committee (EC) of Dorot Geriatric Medical Center. The first study received approval on September 07, 2020 (NIH Clinical Trials Registry number: NCT04683835), and the second study received approval on March 01, 2022 (NIH Clinical Trials Registry number: NCT05528445). Ethical approval for healthy participants included in the meta-analysis was obtained from Tel Aviv University. All participants provided informed consent before their inclusion in the study.

Study groups

Figure 1 illustrates the group allocation and analysis details for each part of the study.

In the first study, participants were divided into two groups based on their MMSE scores: Healthy group (MMSE scores of 28–30, n = 40); and MCI-R group (MMSE scores of 24–27, n = 40);

In the second study, participants were divided into three groups based on their MMSE scores: Healthy group (MMSE scores of 28–30, n = 30); and MCI-R group (MMSE scores of 24–27, n = 30); mild dementia (MD) group (MMSE scores of 10–23, n = 17).

We used MMSE score cutoffs of 24 and 27 for group allocation, focusing on timely detection of cognitive decline. Previous evidence suggests that a higher cutoff score enhances diagnostic accuracy49. Additionally, research indicates that educated individuals scoring below 27 on the MMSE are at increased risk of developing dementia50.

Finally, the meta-analysis included data from both studies and additional healthy participants, totaling 237 elderly individuals (allocated as in the second study) and 112 healthy young participants.

Fig. 1
figure 1

Study design with group allocation and analytical approach at each step.

Clinical and demographic data

To enhance the validation of clinical assessments and cognitive states of participants, additional evaluations were conducted alongside the MMSE in both studies. In the first study, participants underwent Instrumental Activities of Daily Living (IADL)51 assessments, which measures daily living tasks across eight domains, with scores ranging from 0 (low functioning) to 23 (high functioning). The IADL is self-reported and assessed through interviews and has seldom been linked to objective measures like brain activity. However, a study using single-channel EEG effectively classified elderly subjects based on IADL scores52.

In the second study, several clinical assessment methods were collected including the Montreal Cognitive Assessment (MoCA)8, the Geriatric Depression Scale (GDS)53 for depression diagnosis, and the Executive Clock Drawing Task (CLOX)54 for assessing cognitive impairment. Additionally, demographic and sleep-related data were collected in the second study (see full details in Supplementary Material Table S17).

Designed for the detection of MCI or early Dementia by healthcare professionals, the MoCA evaluates various cognitive domains including visuospatial abilities, memory, attention, and delayed recall55. The interpretation of scores, including the threshold for normal cognitive function (commonly 26 or higher), should consider adjustments based on the individual’s years of education and age56. The GDS, designed for elderly individuals, consists of “yes” or “no” questions about the past week’s emotional experiences, scores from 0 to 15, with higher scores indicating more severe depression57. The CLOX task, involving drawing and replicating a clock, scores from 0 to 15, with lower scores indicating greater cognitive impairment54.

EEG recording and auditory battery

EEG recording followed the previously described protocols43,44, and was conducted in a quiet, well-lit room. A research assistant prepared the sanitized Neurosteer EEG equipment, including the disposable electrode patch, sensor, EEG monitor, and clicker. After the electrode patch was attached to the participant’s forehead, the recording commenced. Participants remained seated throughout the session, receiving auditory instructions through a speaker linked to the EEG monitor. The session lasted approximately 30 min, including a brief baseline activity recording to ensure accurate evaluation.

This battery consisted of pre-recorded tasks: musical detection, musical n-back, and resting state tasks as outlined in prior studies43,44. Initial instructions were provided to minimize further guidance and reduce potential bias. The detection task comprised a sequence of melodies performed by a violin, trumpet, and flute. Participants used a clicker to respond to auditory cues based on specific instructions provided at the beginning of each block. The task included two levels of difficulty to examine escalating cognitive demands. In Level 1, melodies lasted 3 s and repeated throughout the block. Participants clicked the button for each repetition of the melody, with three 90-second trials (one per instrument). Each melody appeared 5–6 times, interspersed with 10–18 s of silence. In Level 2, melodies were shortened to 1.5 s, and all three instruments were intermixed within a single block. Participants clicked only for the specified instrument, ignoring others. Each trial included 6–8 melodies, with 8–14 s of silence between them, and the target melody appeared 2–3 times.

The n-back task further tested cognitive load with three levels of difficulty (0-back, 1-back and 2-back). Participants were presented with a sequence of melodies played by different instruments and instructed to respond using the clicker. In the 0-back condition, participants clicked the button each time a melody was played, with one 90-second block consisting of nine trials, where each melody lasted 1.5 s and was followed by 6–11 s of silence. In the 1-back condition, participants clicked the button each time a melody repeated itself (n = 1). This condition consisted of two 90-second blocks, with 12–14 trials per block. Melodies lasted 1.5 s, followed by 4–6 s of silence. Approximately 30–40% of trials in this condition included target stimuli, where the melody repeated. In the 2-back condition, participants were asked to click the button each time a melody repeated itself two steps apart (n = 2). A 45-second block consisting of 12 trials was played, with melodies of 1.5 seconds, followed by 4–6 seconds of silence. Approximately 25% of trials in this condition included target stimuli (where the melody that was played two steps ago repeated).

Resting state tasks were interspersed within the auditory battery to provide baseline measurements. The first resting state task occurred between the detection and n-back tasks, during which participants were instructed to keep their eyes open and think of positive autobiographical thoughts for 45 s. The second resting state task followed the auditory assessment, where participants closed their eyes and allowed their minds to wander for 60 s.

In the first study, participants underwent two EEG recording sessions with different randomizations of the auditory battery trials. Each patient was re-examined under the same conditions over the next seven days, with sessions at least one day apart.

In the second study, participants completed two additional C-IADL51 sub-tasks from PASS: telephone use and medication management. Each task is rated on a 4-point scale (0–3), and patients receive three types of scores: independence, safety, and adequacy (quality)47.

EEG device

EEG data were collected using the high dynamic range Neurosteer® Recorder (hdrEEG), which employs a single-channel setup. Each participant was fitted with a three-electrode patch placed on the forehead, utilizing dry gel to ensure good signal transmission. The non-invasive electrodes were arranged in the prefrontal cortex, with the single channel of EEG obtained from the differential signal between electrodes at Fp1 and Fp2, according to the International 10/20 system, and a reference electrode at Fpz. The recordings were digitized at a continuous sampling rate of 500 Hz.

Power spectrum and frequency bands

The EEG power spectrum was obtained through the fast Fourier transform (FFT) of the EEG signals within a 4-second window, using a Hamming window to minimize spectral leakage. Power spectral density was calculated from the frontal channel (Fp1-Fp2) and transformed to dB (logarithm base 10), to reflect log-transformed absolute power (in µV²/Hz). Delta (0.5–4 Hz), Theta (4–7 Hz), Alpha (8–15 Hz), Beta (16–31 Hz), and lower Gamma (32–45 Hz) frequency bands were included in the analysis. The frequency band ranges employed in the present study fall within established conventions in EEG analysis58,59.

Signal processing and EEG features

In recent years, a time-frequency approach has been adopted for analyzing EEG data to characterize brain states in AD60,61. In line with this approach, our study employs an advanced time-frequency method to process the EEG signal, as previously described40,43,44. The EEG features are produced by a secondary layer of machine learning applied to labeled datasets previously gathered by Neurosteer, to derive several linear combinations. Specifically, the EEG features VC9 and A0 were calculated employing the linear discriminant analysis (LDA) technique62. LDA is designed to identify an optimal linear transformation that maximizes class separability. Previous studies employing LDA models on imaging data have demonstrated success in predicting the development of cognitive decline. Simple LDA models using MRI and PET data were shown to predict cognitive decline or stability up to four years prior to the manifestation of decline symptoms63. The calculation of EEG feature ST4 utilized principal component analysis (PCA)64, a technique employed for reducing feature dimensionality before classification. Research indicates that features extracted through PCA exhibit a significant correlation with MMSE scores and effectively distinguish individuals with AD from healthy subjects65,66,67. Notably, all three EEG features were derived from datasets different from those analyzed in the current study, to avoid overfitting the data. Consequently, the weight matrices previously determined were applied to transform the data acquired in the present study.

In studies conducted on young healthy participants, VC9 feature showed increased activity with escalating levels of cognitive load manipulated by a numeric n-back task39. Furthermore, during an arithmetic task, VC9 activity decreased in response to external visual interruptions41. Additionally, in a surgery simulator task performed by medical interns, VC9 activity declined with task repetition, correlating with individual performance40. VC9 demonstrated greater sensitivity than Theta particularly for tasks with lower cognitive load, making it more suitable for clinical and elderly populations. Notably, in the preceding pilot study44, higher cognitive load levels resulted in increased VC9 activity exclusively in the healthy young group compared to the healthy senior group, highlighting different activity patterns between young and senior participants across various cognitive states. In clinical settings, VC9 activity correlated with the auditory mismatch negativity (MMN) component in minimally responsive patients68.

EEG feature A0, previously identified as a classifier for distinguishing cognitive load from rest in healthy subjects, has proven to be a robust predictor of cognitive decline in individuals with mild-to-moderate impairment44. Furthermore, A0 effectively differentiates between healthy controls and Parkinson’s disease (PD) patients, with higher activity observed in healthy individuals43.

EEG feature ST4 was found to correlate with individual performance in the numeric n-back task, specifically correlating the disparity in RTs between high and low cognitive load levels to differences in ST4 activity per participant39. In the preceding pilot study44, ST4 demonstrated the ability to differentiate between individuals with low MMSE scores, those with scores between 24 and 27, and those with scores above 28, as well as healthy young participants. This suggests that ST4 can detect subtle changes in cognitive states, indicating its potential as a sensitive marker of cognitive functioning.

Statistical analysis

Overview

The statistical analysis was conducted separately for the first and second studies, followed by a meta-analysis incorporating data from a total of 349 participants from both studies and previously collected data.

In the first study, the analysis began with dimensionality reduction using Lasso, Elastic Net, Ridge, and SVM with RBF kernel models to identify key features correlated with MMSE scores. This was followed by Linear Mixed Model (LMM) analyses, typically used when the dependent variable is continuous, and when there are repeated measurements for the same subject. In the current study, we used LMMs to assess the relationships between EEG variables, MMSE groups, and cognitive load levels with the following variables: MMSE group (numeric, between), visit (categorical, within), and cognitive load level (numeric, within). Separate LMMs were then conducted for each visit, considering MMSE group and cognitive load.

In the second study, LMM analyses incorporated the MMSE group (numeric, between) and cognitive load level (numeric, within) variables. Additionally, correlation models were employed to examine the associations between EEG variables and clinical test scores. Logistic regression models were applied to predict both MMSE and MoCA results based on brain activity features and collected clinical data (e.g., CLOX, GDS, and PASS scales). Logistic regression was chosen for its ability to model the relationship between predictors and outcomes in a probabilistic framework, providing interpretable insights into how different features influence prediction.

The significance level for all analyses was set at p < 0.05. Post-hoc effects with Benjamini-Hochberg correction69 were applied following significant main effects and interactions. All analyses were carried out using Python Statsmodel70.

Variables

These studies included EEG variables, performance data, and clinical scales. EEG variables comprised frequency bands: Delta, Theta, Alpha, Beta and lower Gamma, as well as three EEG features: VC9, ST4, and A0 (normalized to a scale of 0-100). All EEG variables were calculated every second using a moving window of four seconds, and mean activity per condition was analyzed. Behavioral variables included mean response accuracy and mean RTs per participant. The independent variable representing cognitive load was constructed as follows: tasks performed during resting state; Detection task level 1 and 0-back were categorized as cog_load 1; Detection task level 2 and 1-back were categorized as cog_load 2. Finally, 2-back was categorized as cog_load 3.

Results

Demographic and clinical results

To ensure proper adjustment for age and gender, mean ages were compared within each MMSE group using the Welch Two Sample t-test, both overall and separately by gender (see Tables 1 and 2 for detailed results).

In the first study, a significant positive correlation between MMSE and IADL scores was observed (r = 0.26, p = 0.03), as expected based on previous literature71,72. Significant correlations were also found between MMSE and IADL scores and A0 biomarker activity during both cognitive and resting tasks (detection task: r = -0.25, p = 0.04; n-back task: r = -0.29, p = 0.02; and resting state tasks: r = -0.39, p = 0.003), suggesting that higher A0 activity might be associated with greater cognitive decline, as indicated by lower IADL and MMSE scores.

The second study included additional demographic and clinical data (see full details in Supplementary Material table S17). No significant differences in education level, years of employment, average sleep hours, sleep quality, or tiredness were found between groups (MD, MCI-R, and Healthy, p > 0.05). MoCA scores showed significant differences between all groups (all ps < 0.05, see Table 2).

Table 1 Demographic information for the first study groups, including mean ages and Mini-Mental state examination (MMSE) scores for total participants, and separately for males and females (SDs presented in brackets). The table also includes t and p values comparing mean ages between healthy and MCI-R groups, both overall and by gender. Additionally, t and p values comparing ages between genders are shown in the final row.
Table 2 Demographic information for the second study groups, including mean ages, Mini-Mental state examination (MMSE) scores, and Montreal cognitive assessment (MoCA) scores for total participants, as well as for males and females separately (SDs presented in brackets). The table provides t and p values comparing mean ages between healthy, MCI-R, and MD groups, both overall and by gender. The final row presents t and p values comparing ages between genders.

First study results

The first study aimed to detect early cognitive decline in healthy seniors. Initially, dimensionality reduction techniques (Lasso, Elastic Net, Ridge, and SVM with RBF kernel) were used to identify features correlated with MMSE scores. Subsequently, linear mixed models (LMM) were employed to examine relationships between EEG variables, MMSE groups, and cognitive load levels.

Dimensionality reduction

To identify a combination of features that would result in the highest correlation with MMSE scores, mean feature activity as well as reaction times (RTs), and accuracy were calculated for each auditory task per participant. Since the focus was on detection of timely cognitive decline in the healthy elderly population (typically associated with MMSE > 24), the aim was to differentiate between healthy individuals (MMSE > 27) and those at risk for MCI (MMSE between 24 and 27).

Multiple linear predictors and one nonlinear predictor were tested, including ridge, Lasso, and Elastic regression, linear kernel RBF, and SVM with RBF kernel. Lasso and Elastic Net yielded slightly better results than ridge regression, indicating the usefulness of both L1 and L2 penalties in feature selection. We set the number of features to analyze at 30, based on individual R2 values. The data was then analyzed using cross-validated binary prediction of MMSE scores. Each cross-validation group produced an ensemble average over multiple regularization parameters to improve reliability73. The average R2 was 0.31, corresponding to an r > 0.55. See Table 3 for results from four models.

Table 3 Performance metrics for four predictive models used to detect early cognitive decline in healthy elderly population. SDs are presented in parentheses.

These results indicate that our approach effectively predicts cognitive performance as measured by MMSE scores, achieving a good balance between sensitivity and specificity. Specifically, Lasso and ElasticNet models achieved the highest sensitivity (0.90), indicating excellent detection of true positives. Both Lasso and ElasticNet models yielded the highest F1 score (0.77), indicating a strong balance between precision and sensitivity. Elastic Net achieved the highest AUC (0.74) with Lasso closely following (0.73), demonstrating superior overall ability to distinguish between classes. Figure 2 illustrates the predictions of these two models, showing a correlation of r = 0.38 and r = 0.35, with lower variability in the higher MMSE scores (27–30).

Fig. 2
figure 2

Prediction of Lasso model (left) and ElasticNet model (right), of Mini-Mental State Examination (MMSE) scores. The red trend line indicates a positive correlation. Shaded areas highlight clusters of higher MMSE scores (27–30) with corresponding higher predicted values and lower variability.

Linear mixed model (LMM) results

For the complete LMM results of all studies, including standard deviations, p- and z-values, refer to Supplementary Material Sect. 3. This study involved two recording sessions across consecutive visits, each featuring a comparable auditory battery with tasks of varying cognitive load. The initial LMM analysis included data from both visits, with MMSE score (numeric, between-subjects), visit (categorical, within-subjects), and cognitive load (numeric, within-subjects) as variables. No significant main effects or interactions were found for any of the features analyzed. Consequently, further analyses were conducted for each visit separately.

Analysis of the first visit data revealed no main effects between the groups. However, significant interactions between group and cognitive load were found for VC9, ST4, and Theta, with the healthy group showing higher activity at higher cognitive loads: cognitive load 2 vs. rest for VC9 (p = 0.014), ST4 (p = 0.016), and Theta (p = 0.028); and cognitive load 1 vs. rest for VC9 (p = 0.016) and ST4 (p = 0.018). No differences in cognitive load were detected in the MCI-R group.

In the second visit, A0 showed a significant main effect of group, with higher activity in the MCI-R group compared to the healthy group (p = 0.033). Additionally, VC9, Theta, Delta, Alpha, and Beta exhibited significant main effects of cognitive load (all ps < 0.001), with similar cognitive load effects observed across both groups (see Fig. 3 and Supplementary Material Sect. 3.1.2).

Fig. 3
figure 3

Mean values per participant for A0, VC9, Theta, and Delta during visit 1 (left) and visit 2 (right) in Healthy and MCI-R groups, across different cognitive load levels: resting state (purple), cognitive load level 1 (red), and cognitive load level 2 (green). Asterisks denote significant effects (*p < 0.05, **p < 0.01 and ***p < 0.001).

Inter-patient variability results

Refer to supplementary material Sect. 2 for the full details and results. Testing reliability between the two visits for all EEG features and frequency bands revealed moderate to excellent reliability (ICCs 0.5–0.8) for the n-back task, and moderate to good reliability (ICCs 0.5–0.75) for the detection task. Findings from Pearson correlations revealed significant correlations between the two visits across both detection and n-back tasks for all EEG features and frequency bands (all p-values < 0.01). In summary, the low within-patient variability observed between the two visits in the first study enhances the validity of our measurement method.

Second study results

The second study included a single recording session with cognitive assessments involving musical tasks of varying cognitive loads. To gain a deeper insight into participants’ clinical status, additional clinical information and measurements were collected.

Correlation with clinical measures

Pearson correlations were calculated between each EEG feature per cognitive load, and the MMSE score and the MoCA score (full correlation results are provided in Supplementary Material Table S18). A0 and Gamma demonstrated strong correlations with MMSE scores across all tasks, and with the MoCA scores for most tasks (see Fig. 4).

Fig. 4
figure 4

A0 (left) and Gamma (right) correlations to Mini-Mental State Examination (MMSE) score (top panel) and Montreal Cognitive Assessment (MoCA) scores (bottom panel), as a function of cognitive load level: low (red), medium (blue), high (green), and resting state (purple).

Linear mixed model (LMM) results

In the LMM model with group (3 levels, categorical, between), and cognitive load (3 levels, categorical, within) variables, A0, Gamma and Beta exhibited significant differences between the groups (see Fig. 5 for individual means of A0 and Gamma per group and cognitive load). Post-hoc analyses showed that the difference between Healthy and MD groups was significant for A0 (padj = 0.017) and Beta (padj = 0.002). For Gamma, the difference between Healthy and MD groups (padj = 0.001), as well as MCI-R and MD groups (padj = 0.0431) showed significance.

The main effect for cognitive load was significant for A0, VC9, Delta, Theta, Beta and Gamma. Post-hoc analysis revealed that for most features, the differences between cognitive load levels were highly significant for the Healthy and MCI-R groups, but not significant for the MD group (see Supplementary Material Table S21).

Fig. 5
figure 5

The mean activity per participant in the second study for A0 (top), and Gamma (bottom), in healthy participants, MCI-R, and MD groups, as a function of cognitive load level: resting state (blue), cognitive load level 1 (red), cognitive load level 2 (green), and cognitive load level 3 (purple). Asterisks denote significant effects (*p < 0.05, **p < 0.01 and ***p < 0.001).

Logistic regression model results

To incorporate the clinical data gathered in the second study, logistic regression models were created to predict the MMSE scores based on EEG features and clinical data. This approach contrasts with the linear regression models used in the first analysis, which focused on healthy participants (with MMSE > 24), aiming to identify early signs of cognitive decline. The logistic regression approach here provides a broader understanding of cognitive impairment across a wider spectrum of MMSE scores (18–30) with multiple clinical measures included.

Two linear regression analyses were conducted to identify significant predictors of MMSE score. Both regressions included potential predictors from EEG features (i.e., A0, ST4, VC9, and Delta, Beta and Gamma), demographic factors (i.e., age, gender, years of education), cognitive task performance (i.e., accuracy, response time), and clinical measures (i.e., CLOX, GDS, and PASS scales). In the first regression, we tested all predictors across the different cognitive load levels, and the second regression was repeated for each cognitive load level. All regressions were created with a backward elimination process, first inserting all variables and then sequentially removing the non-significant variables based on their p-values (> 0.05). After backward elimination, the final first model predicting MMSE score across cognitive load levels, had an R2 value of 0.988, with three significant predictors: A0 (p = 0.009), Gamma (p < 0.001), and accuracy (p < 0.001). For the full results and figures, see Supplementary Material Sect. 3.2.4.

To identify the factors influencing cognitive load, a series of regression models were constructed for each cognitive load level, using the same methods as the first regression. For the highest level of cognitive load level (i.e., brain activity during 2-back), seven significant predictors were retained: A0 (p < 0.001), Gamma (p < 0.001), ST4 (p = 0.046), CLOX (p = 0.015), PASS - drugs safety (p < 0.001), years of employment (p = 0.011), and tiredness (p = 0.019). For the mid-high cognitive load level (i.e., detection level 2 and the 1-back), the significant predictors included A0 (p = 0.001), Beta (p < 0.001), CLOX (p = 0.003), age (p = 0.04), PASS - drugs safety (p < 0.001), marital status (p = 0.002), years of employment (p < 0.001), living arrangements (p = 0.011), and tiredness (p = 0.004). In the low cognitive load level (i.e., 0-back and detection level 1), the significant variables were: A0 (p < 0.001), ST4 (p = 0.026), Beta (p < 0.001), PASS - drugs quality (p = 0.024), PASS - drugs safety (p < 0.001), marital status (p = 0.002), years of employment (p = 0.009), living arrangements (p = 0.020), tiredness (p = 0.018), and accuracy (p = 0.003). Interestingly, the resting state model, the variables who were found significant were the EEG features of VC9 (p < 0.001) and Theta (p < 0.001), and CLOX (p = 0.002), PASS - drugs safety (p = 0.003), marital status (p = 0.002), years of employment (p = 0.001), living arrangements (p = 0.000), tiredness (p = 0.018), and accuracy (p = 0.003) as clinical variables.

In conclusion, while each cognitive load level displayed a distinct set of significant predictors, there were shared factors such as A0 and years of employment consistently identified across models as significant. Conversely, certain variables like Gamma played a crucial role in specific cognitive load levels but did not demonstrate universal applicability across all levels.

Meta analysis

In the final stage of our analysis, we combined the data from both studies with previously collected data44 that included seniors with different MMSE scores and a cohort of healthy young participants. This integration enabled a thorough meta-analysis, incorporating a total of 237 elderly individuals (categorized as healthy seniors n = 121, MCI-R n = 84, and MD n = 32), along with healthy young controls (n = 112). All participants completed similar tasks with the same levels of cognitive load, allowing for analysis of differences in mean brain activity between groups and across cognitive load levels, and their interactions. The population distributions of mean A0 activity levels during both rest and cognitive resource allocation were also computed and are provided in Supplementary Material figures S2 and S3.

Initially, our focus was directed toward the elderly population (senior participants who underwent MMSE evaluation, n = 203). Pearson correlations were calculated for each EEG feature, MMSE score and cognitive load level. A0 exhibited a significant correlation with MMSE score (r = -0.25, p < 0.001), which remained significant across all cognitive load levels after Bonferroni correction (all corrected ps < 0.05). Similarly, Gamma band exhibited a significant correlation with MMSE score (r = -0.23, p < 0.001), with significance maintained only for rest after Bonferroni correction (corrected p = 0.004). Refer to Fig. 6 and Supplementary Material Table S26 for the full results.

Fig. 6
figure 6

Correlation between Mini-Mental State Examination (MMSE) scores and EEG features A0 (left), and Gamma (right) as a function of cognitive load level.

Next, we constructed an LMM that integrated the group variable (including all senior groups and healthy young controls), with the cognitive load levels (see Fig. 7 and Supplementary Material Sect. 4.2 for all LMM and post-hoc results). Significant main effects of group were found for A0, Delta and Gamma, indicating lower activity levels the healthier and younger the group. Subsequent post-hoc comparisons revealed significant differences for A0 between the healthy young group and all other groups (all ps = 0.001), as well as between healthy seniors and the MD group (p = 0.001), and MCI-R group (p = 0.037). For Gamma, the MD group showed significantly higher activity compared to the healthy young group (p = 0.002). Delta showed a significant difference between the MCI-R and healthy young groups (p = 0.001).

An interaction between group and cognitive load was observed for VC9, ST4, Theta, Alpha, Beta and Gamma. Simple effect comparisons indicated that differences between cognitive load levels were generally more pronounced in cognitively healthier groups, with significant differences between cognitive load levels and rest for healthy seniors and healthy young participants (all ps < 0.001). Complete results are available in Supplementary Material Sect. 4.2.

Fig. 7
figure 7

A0 activity (top) and VC9 activity (middle), and Gamma (bottom) of the groups in the meta-analysis, as a function of cognitive load level. Asterisks denote significant effects (*p < 0.05, **p < 0.01 and ***p < 0.001).

Discussion

Timely detection of cognitive decline is crucial for effective intervention, highlighted by the recent FDA approval of two new AD drugs3,4. EEG serves as a valuable tool for identifying abnormal brain activity patterns that may indicate cognitive impairment. Building on our previously published pilot study44, which explored neural activity using a single-channel EEG, this paper presents two follow-up studies and a meta-analysis involving 237 seniors and 112 healthy young subjects, aimed at extending the scope and broadening the applicability and relevance of the results. An auditory assessment protocol was implemented to evaluate cognitive function under varying load conditions, facilitating a comprehensive exploration of EEG pattern changes to identify reliable biomarkers for timely detection.

The first study aimed to detect subtle cognitive differences within the healthy elderly population by examining cognitively healthy participants. The Lasso regression model effectively selected relevant EEG and behavioral features, demonstrating its utility in identifying individuals at risk for MCI, with results showing strong predictive relationships between EEG patterns and cognitive performance. This approach aligns with research showing Lasso regression can predict the relationship between working memory ability and frontal brain activity through EEG signal processing74. Another study used Lasso regression to select functional brain indicators associated with cognitive impairment, effectively classifying participants into groups based on MoCA and MMSE scores75. An additional study suggested that a predictive model for MMSE scores based on Lasso regression, highlighting the effectiveness of EEG biomarkers, particularly from the prefrontal regions, in indicating early cognitive decline38. These findings highlight the potential of this approach, though further refinement and additional variables may improve precision. Temporal patterns in EEG biomarkers revealed differences in cognitive adaptability and load-related responses between the two study groups. Cognitive adaptability differed between the study groups, with the Healthy group (MMSE 28–30) demonstrating more pronounced differences in VC9, ST4, and Theta activity across cognitive load levels compared to the MCI-R group (MMSE 24–27). While these differences were initially absent in the MCI-R group, they became evident over time, suggesting a delayed or limited adaptation process. This potential learning effect over time is consistent with the multiday learning curve approach, which indicates that assessing learning over multiple days can reveal early Aβ-related memory declines before conventional AD symptoms appear76,77. Higher A0 activity in the MCI-R group compared to the Healthy group further supports this reduced adaptability. Cognitive load effects were more pronounced in the Healthy group, suggesting cognitive load effects are more pronounced in healthy individuals and may indicate a greater risk for MCI with lower initial MMSE scores. Theta and Delta bands showed consistent increases during cognitively demanding tasks relative to rest, consistent with our previous findings39,44 as well as recent literature regarding increased frontal activity of Theta18,20,78 and Delta22,79 during the performance of cognitive demanding tasks. High within-patient consistency of EEG features across visits highlights the stability of these biomarkers. Collectively, the first study demonstrated good intra-group consistency and notable inter-group variability among healthy seniors. Our hypothesis was realized, as EEG biomarkers successfully identified subtle changes in individuals at risk for cognitive decline, highlighting the potential of EEG measures to track cognitive dynamics and differentiate between healthy aging and potential early decline.

The second study aimed to validate and extend these findings by incorporating additional demographic data and clinical evaluations using MoCA8 and PASS47. Addressing a limitation of the pilot study, the inclusion of information regarding the education of senior participants in this study revealed no significant group differences in education levels, a factor that can influence individual MMSE scores45,80. Other factors were also ruled out as confounding effects, including years of employment, sleep patterns, and GDS scores, however, their inclusion improved statistical models, enhancing the results related to the novel EEG biomarkers. A0 and Gamma activity emerged as consistent predictors of MMSE score across different cognitive load levels, while other clinical and demographic factors (such as CLOX, PASS drug safety, and years of employment) showed varying significance depending on cognitive load levels. Lower MMSE and MoCA scores (indicative of greater cognitive impairments) were associated with increased Gamma activity during the performance of cognitive tasks. Although previous studies showed decreased Gamma band synchronization in AD81,82, the increased Gamma band power observed during task performance persists even in cognitive decline patients, possibly indicating heightened resource allocation under cognitive load28,83. Studying the effects of cognitive load on brain wave patterns can provide crucial insights into the processes underlying cognitive decline, enhancing our understanding of the mechanisms involved. These results highlight the nuanced interplay between EEG features, clinical measures, and cognitive performance, confirming our hypothesis that combining these tools enhances the accuracy of early cognitive decline assessment and provides a more comprehensive understanding of cognitive health and the mechanisms underlying cognitive decline.

The consistent negative correlation between A0 activity and MMSE scores, initially identified in the pilot study44, was replicated in both the second study and the meta-analysis, strengthening the evidence for A0 as a biomarker of cognitive function. This correlation was also extended to MoCA and IADL scores, fulfilling the primary objective of the second study. These associations to clinical measures further validate A0 as a biomarker related to cognitive state, as previously described43,44. Moreover, A0 effectively differentiates cognitive groups across all analyses, thus achieving the primary goal of successfully identifying changes between cognitively healthy individuals and individuals at risk of decline (who initially scored lower with MMSE scores between 27 and 24). This is further supported by the finding that individuals exhibiting more advanced decline showed even higher A0 levels, suggesting a gradient of A0 activity that correlates with cognitive health. Collectively, these findings provide further evidence that A0 is sensitive to subtle cognitive changes and is potentially suitable as a cognitive decline biomarker.

One of the central findings presented here highlights the relationship between EEG activity and cognitive load, with healthier individuals demonstrating greater adaptability to increasing task demands. A0 activity varied most significantly across cognitive load levels within healthy individuals, whereas the individuals at-risk for MCI exhibited significant differences only between rest and high cognitive load conditions. In contrast, individuals with existing cognitive deficits showed no significant changes in A0 across cognitive load levels. This pattern of diminishing responsiveness in individuals with cognitive decline suggests reduced flexibility in managing cognitive demands. Similarly, VC9 revealed a graded response across cognitive groups, where differences between task and rest conditions diminished as cognitive health declined. Notably, individuals at risk of MCI demonstrated reduced activity during higher load conditions compared to lower ones, a pattern not seen in cognitively healthier participants. This plateauing of EEG activity indicates that individuals with early cognitive decline may activate additional neural resources during lower loads but face difficulty sustaining this activation as task complexity increases. These findings align with previous research showing compromised adaptability in brain activity among seniors experiencing cognitive impairment84,85, highlighting potential limitations in cognitive reserve.This study adds to existing diagnostic approaches, including blood and CSF biomarkers, by offering a non-invasive, real-time assessment of brain activity that could enhance timely detection. EEG biomarkers offer a cost-effective screening tool that may integrate with pathology-specific methods to improve predictive accuracy. Additionally, EEG-based tools could ease the diagnostic workload for clinicians by serving as an initial screening method in general or geriatric care settings. Fully automated algorithms for interpreting EEG data may streamline this process, reducing the need for specialized training.

While showing promising results, further research is needed to address limitations encountered in our studies. For instance, the small sample size of the MD group (n = 32), challenges robust comparisons with the larger MCI-R (n = 84) and Healthy (n = 121) groups, increasing the risk of Type II error. Expanding the sample size for the MD group in future studies would enhance statistical power, allowing for more thorough analyses and interpretations of differences in cognitive states. Additionally, the age-related confound among females between MD and other groups should be taken into account when considering the results. While the prevalence of dementia generally increases with age86,87,88, which may partly explain the observed differences, the age differences should be considered a limitation of the study. The significant EEG variables identified in the logistic regression model could be utilized in future research to predict MMSE and MoCA scores of elderly participants, allowing for comparison with actual clinical assessment scores to assess their predictive power. In the second study, the PASS drugs safety score emerged as a key predictor for MMSE scores in logistic regression models across cognitive load levels. Despite its promise in assessing functional competence and distinguishing between subjects with cognitive decline and healthy controls48, our findings did not reveal significant differences between study groups or correlations with EEG feature activity during PASS performance. Future investigations could explore alternative PASS sub-tasks, such as shopping or checkbook balancing, known for their robust discriminative capabilities89. Furthermore, while this paper focuses on the timely detection of cognitive decline, long-term studies could provide deeper insights into the predictive power of our biomarkers. Tracking individuals at risk for MCI over time could reveal how early biomarkers relate to the actual development of cognitive impairment, enhancing understanding of disease progression and potential early intervention.

In summary, this paper highlights the effectiveness of EEG biomarkers in detecting cognitive function among healthy elderly individuals. The integration of additional diagnostic tools and identification of key predictors further enhances our understanding of cognitive impairment. We demonstrated the capability of EEG features to distinguish between cognitively healthy individuals and those at risk. Collectively, our findings underscore the potential of EEG features as a non-invasive, cost-effective and reliable approach for better understanding cognitive states and facilitating timely diagnosis to improve clinical outcomes.