Combining meta and ensemble learning to classify EEG for seizure detection

Liu, Mingze; Liu, Jie; Xu, Mengna; Liu, Yasheng; Li, Jie; Nie, Weiwei; Yuan, Qi

doi:10.1038/s41598-025-88270-3

Download PDF

Article
Open access
Published: 28 March 2025

Combining meta and ensemble learning to classify EEG for seizure detection

Mingze Liu¹^na1,
Jie Liu²^na1,
Mengna Xu¹,
Yasheng Liu¹,
Jie Li¹,
Weiwei Nie³ &
…
Qi Yuan¹

Scientific Reports volume 15, Article number: 10755 (2025) Cite this article

2890 Accesses
1 Citations
Metrics details

Subjects

Abstract

Despite two decades of extensive research into electroencephalogram (EEG)-based automated seizure detection analysis, the persistent imbalance between seizure and non-seizure categories remains a significant challenge. This study integrated meta-sampling with an ensemble classifier to address the issue of imbalanced classification existing in seizure detection. In this framework, a meta-sampler was employed to autonomously acquire undersampling strategies from EEG data. During each iteration, the meta-sampler interacted with the external environment on a single occasion with the objective of deriving an optimal sampling strategy through this interactive learning process. It was anticipated that optimal sampling strategies would be derived through interactive learning. And then the soft Actor-Critic algorithm was employed to address the non-differentiable optimization issue associated with the meta-sampler. Consequently, this framework adaptively selected training EEG data, and learned effective cascaded integrated classifiers from unbalanced epileptic EEG data. Besides, the time domain, nonlinear and entropy-based EEG features were extracted from five frequency bands (δ, θ, α, β, and γ) and were selected by Semi-JMI to be fed into this imbalanced classification framework. The proposed detection system achieved a sensitivity of 92.58%, a specificity of 92.51%, and an accuracy of 92.52% on the scalp EEG dataset. On the intracranial EEG dataset, the average sensitivity, specificity, and accuracy were 98.56%, 98.82%, and 98.7%, respectively. The experimental comparisons demonstrated that the system outperformed other state-of-the-art methods, and showed robustness in the face of label corruption.

Detection and classification of adult epilepsy using hybrid deep learning approach

Article Open access 16 October 2023

Retraining and evaluation of machine learning and deep learning models for seizure classification from EEG data

Article Open access 02 May 2025

EEG detection and recognition model for epilepsy based on dual attention mechanism

Article Open access 19 March 2025

Introduction

Epilepsy is a life-threatening chronic neurological disorder. Whether manifesting as overt symptoms or as subtle pre-seizure states, epilepsy represents one of the most critical conditions within the realm of neurology^1,2. Severe seizures can have a profound impact on patients, potentially leading to injury³. Furthermore, patients with epilepsy and their families may also experience stigma and discrimination, which can have a detrimental effect on their quality of life. The World Health Organization (WHO) estimates that approximately 180,000 new cases of epilepsy are reported globally each year⁴. The prevalence of active epilepsy in the general population at a given time is estimated to range from 4 to 10 per 1,000 individuals. Although anti-epileptic drugs are an effective treatment for the condition, one third of patients continue to experience seizures despite receiving treatment⁵. Moreover, approximately 75% of seizures are not preventable. Accurately detecting seizures is pivotal in ensuring patient safety, enhancing quality of life, and determining long-term prognosis. Therefore, this represents not only a significant technical advancement but also a beacon of hope for numerous individuals⁶. The distinctive patterns (i.e., three-phase waves, biphasic waves, whole-body sharp and slow waves, and multi-spike) observed in electroencephalogram (EEG) signals serve as indicators for various seizure types and play an essential role in identifying seizure-related brain abnormalities, diagnosing specific seizure categories, and pinpointing treatment targets^7,8. In conventional epilepsy treatment, the identification of seizure fragments is primarily conducted by epilepsy specialists through the observing of long-term EEG recordings and subsequent manual labeling. However, this artificial method is constrained by limitations in detection time when dealing with long-term continuous EEG. Furthermore, fatigue increases the risk of misjudgment by experts. Consequently, the development of automated and expedient seizure detection algorithms needs exploration.

Despite the extensive research conducted over the past two decades on automated seizure detection using EEG, a significant challenge persists: the imbalance between seizure and non-seizure categories. This imbalance arises because the duration of seizures is considerably shorter than that of non-seizure periods⁹. This phenomenon of class imbalance is not exclusive to seizure detection. It is also commonly observed in large real-world datasets, leading to the development of numerous methods for addressing unbalanced label distribution. These classical methods can be divided into two categories: algorithm level and data level. Siddiqui et al.¹⁰ employed a cost-sensitive learning method with random forest, representing the most classical of the algorithm-level methods. However, both the method proposed by Siddiqui et al. and the sensitive-specificity weighted cross-entropy (SSWCE) method proposed by Ingolfsson et al.¹¹, which allows for higher specificity or sensitivity based on cross-entropy loss, require domain experts to construct a cost matrix or loss function based on task-specific prior knowledge. It is imperative to acknowledge that the cost matrix or loss function, designed for a particular problem, is exclusively applicable to that specific task and may not ensure optimal performance when applied to other tasks. Moreover, reweighting methods have been developed that assign distinct weights to distinct categories. These include weighted extreme learning machine (ELM) and penalty strategies^12,13. Compared with the aforementioned methods that only improve the recognition ability of the model for the minority class, Yanna Zhao et al.¹⁴ used the focal loss to redefine the loss function of the linear graph convolutional network (LGCN) in the seizure detection model based on LGCN, which allowed paying more attention to those samples that were difficult to classify or misclassified.

Besides oversampling-based methods, such as K-means synthetic minority oversampling technique (SMOTE)^15,16, SMOTE + TomekLinks¹⁷, ADASYN¹⁸, and borderline nearest neighbor synthetic minority oversampling technique (BNNSMOTE)¹⁹, the random channel ordering (RCO) method, as proposed by Xuyang Zhao et al.²⁰, generated EEG images by randomly shuffling the order of channels. The EEGAug method employed time frequency conversion to transform data comprising disparate frequency bands from disparate samples into time-domain samples²¹. Furthermore, the method developed by Bin Gao et al.²² employed a generative adversarial network (GAN) to enhance the data. These methods can be classified as minority generation methods. However, when applied to large datasets, they may result in increased computational overhead, a slower training speed, and the potential for overfitting. Consequently, the strategy of reducing samples from the majority of categories is a favorable approach, such as undersampling^23,24 or using anomaly sampling to transform non-seizure data into outlier samples, as demonstrated by Nizar Islah et al.²⁵. Furthermore, Xiaoshuang Wang et al.²⁶ employed sliding window segmentation to enhance seizure data and to randomly select non-seizure data. Zhen Jiang et al.²⁷ combined the TomekLink technique with undersampling to address the imbalance ratio between seizure and non-seizure data. Nevertheless, this mixed-class method may exhibit shortcomings inherent to both types of data-level methods.

The traditional imbalanced learning methods often rely on heuristic assumptions derived from the training data. They typically assume that instances with higher training errors provide more valuable information for learning, as demonstrated by the focus loss and RUSBoost algorithm used in K-double cross-validation by Nipun Bhanot et al.²⁸. Another commonly held assumption is that generating synthetic samples around a small number of instances facilitates the learning process. However, the former outliers may result in the generation of false reinforcement, whereas the latter cannot accommodate excessive overlap between a minority class sample and a majority class sample. Meta-learning can be employed to address complex practical tasks without the need for heuristic assumptions. Therefore, a framework combining meta-learning and ensemble learning is introduced for solving the issue of imbalance existing in seizure detection. Indeed, some studies have demonstrated the potential of ensemble learning to address imbalances in seizure detection. Yao Guo et al.²⁹ employed the EasyEnsemble algorithm within the supervised learning framework to enhance robust seizure detection for uncertain subjects, potentially mitigating the generalization error associated with seizure-free segments. Tiejia Jiang et al.³⁰ introduced the Ensemble Classification Model (ECM) method, which partitions data into several relatively balanced subsets based on the imbalance ratio, trains an independent classifier for each subset, and integrates the output results via vote-based decision-making. Despite these advancements, both methods remain rooted in heuristic design, which may limit their overall performance. Additionally, EasyEnsemble utilizes AdaBoost as its base classifier; however, boosting-class methods are known to be sensitive to noise, which could adversely affect classification accuracy. The proposed framework combining meta-learning and ensemble learning can avoid these problems. In recent years, numerous seizure detection algorithms based on deep learning have emerged. Xu et al.³¹ introduced a method for labeling cross-period samples with soft labels and integrated this approach with a three-dimensional convolutional neural network (3D-CNN) architecture. They further developed a multi-scale feature extraction technique utilizing the short-time Fourier transform (STFT). In addition to traditional CNN models, hybrid models have also become prevalent. These include combinations such as CNN + long short-term memory networks (LSTM)³², LSTM + multi-scale Atrous-based deep convolutional neural networks (MSA-DCNN)³³, as well as graph neural network (GNN)-based hybrids like graph attention networks (GAT) + Transformer³⁴, graph convolutional networks (GCN) + Transformer³⁵, and attention-based GNNs³⁶. However, machine learning methodologies offer distinct advantages in terms of model universality and feature interpretability compared with deep learning. Using interpretable features as inputs enables medical experts to map features to corresponding biological or chemical factors, thereby facilitating the formulation of appropriate treatment plans⁹.

In this study, epileptic seizures were detected based on meta-learning and ensemble learning to address the imbalanced classification in this field. On the one hand, a meta-sampler was employed to autonomously identify and implement adaptive undersampling strategies from EEG data, with the aim of mitigating the instability and high computational cost. And the non-differentiable optimization issue associated with the meta-sampler was addressed by using the soft Actor-Critic algorithm. On the other hand, this method distinguished between model training and meta-training processes, rendering it broadly applicable to a diverse array of classifiers. And the use of task-independent meta-states kept high robustness in the face of EEG data tag corruption. Besides, the EEG features were extracted in the five frequency bands (δ, θ, α, β, and γ) and from the three distinct perspectives, i.e., time domain analysis, nonlinearity assessment, and entropy-based measurement. Subsequently, Semi-JMI was employed to select EEG features that were fed into this classification framework. The proposed seizure detection system demonstrated superior classification performance compared with other published methods based on two publicly accessible EEG databases, resulting in encouraging outcomes.

The remaining content of this article is organized as follows: section “Materials and methods” elaborates on data sets, data preprocessing, a proposed framework for imbalanced classification based on meta-learning and ensemble learning, and experimental settings. Then, in section “Results”, we clarify the results of the experiment. Finally, sections Discussion” and “Conclusion” are the objects of discussion and conclusions, respectively.

Materials and methods

Experimental datasets

TUSZ dataset

The Temple University Hospital (TUH)-EEG Corpus (TUEG) is publicly available from the Neural Engineering Data Consortium (NEDC) website (www.nedcdata.org). The TUEG Seizure Corpus (TUSZ) is a subset of the TUH clinical EEG signal database, which is notable for its extensive patient population and diverse configurations³⁷. These files comprise male and female (51% women) patients with an average age of 1–90 years (mean age 51.6). The data frequency range is 250–1024 Hz, with 250 Hz data accounting for 87% of the total. The electrodes are arranged on the scalp in a standard 10–20 format. The EEG has 23 or 22 channels available, although, in rare cases, it may contain 20 channels.

A randomly selected subset of the data, comprising channel numbers 23 and 22 from the latest version (v2.0.1) of TUSZ, was employed as the dataset in this study. The data were then uniformly downsampled to 250 Hz. Subsequently, all data were transformed into a temporal central parasagittal (TCP) montage of 22 channels (Fig. 1)³⁸. Due to the extensive volume of data, it was necessary to divide it into 4-second, non-overlapping segments. Single-channel data were employed as samples, ensuring that each segment consisted of 22 samples of 1,000 data points. The examination of the seizure and non-seizure data revealed that the majority of the records in the validation set consisted of a single seizure or non-seizure, exhibiting distinct characteristics. Accordingly, the validation set data of TUSZ was employed as the training set of the experiment, and the training set data of TUSZ was used as the validation set of the experiment. TUSZ’s non-seizure signal duration was considerably longer than any seizure type duration, rendering it an appropriate choice for the algorithm. The selected dataset exhibited an imbalance ratio of 10 for both the training and validation sets. This indicates that the majority class (non-seizure) data is ten times more than the minority class (seizure) data. Table 1 lists some details of the experimental dataset.

Table 1 Statistical overview of the TUSZ dataset used in our seizure detection experiments.

Full size table

SWEZ dataset

The SWEC-ETHZ (SWEZ) short-term dataset comprised 100 intracranial EEG (iEEG) recordings from 16 patients with drug-resistant epilepsy, which are available for download at http://ieeg-swez.ethz.ch/. The data had undergone digital band-pass filtering between 0.5 and 150 Hz and were sampled at a rate of 512 Hz³⁹. Each seizure was stored as a record, beginning 3 min after the start of the record and ending 3 min before the end, as shown in Fig. 2. The number of seizures in these patients ranged from 2 to 14, with durations ranging from 10 to 1002 s. Patient 14 was removed because his non-seizure data were inadequately provided. The specific information of the experimental dataset is provided in Table 2. The duration of seizures exhibited by different patients varied considerably, which necessitated the use of a K-fold cross-validation approach. In this method, one record was designated as the test set for each patient (with k records), whereas the remaining records were divided equally into training and validation sets. The data were divided into non-overlapping 5-second intervals, with the data from a single channel serving as the sample.

Table 2 Information on 15 patients from the SWEC-ETHZ short-term dataset.

Full size table

Data preprocessing

Butterworth filter

The EEG signals in the database were removed with a sixth-order Butterworth band-pass filter for signal components below 1 Hz or above 60 Hz⁴⁰. Setting the low frequency to 0.5 Hz could effectively eliminate the direct current (DC) component noise present in the EEG signal. Furthermore, a high frequency of 60 Hz was employed to ensure that the filtered signal contained the typical rhythms observed in EEG signals, including δ (0.5–4 Hz), θ (4–8 Hz), α (8–13 Hz), β (13–30 Hz), and γ (30–60 Hz)⁴¹.

Feature extraction

Following the application of a filter, the EEG data of the δ, θ, α, β, and γ waves were obtained in five distinct frequency bands. For each band of data, 29 classical statistical features were extracted and organized in the order of δ, θ, α, β, and γ bands, resulting in a feature vector comprising a total of 29$\:\times\:$5=145 features. This process transforms EEG signals into informative features, thereby achieving the objective of data dimensionality reduction. EEG signals were inherently nonlinear and irregular, necessitating the inclusion of nonlinear features in the analysis of EEG activity. The 29 univariate features were selected from three categories: time domain, nonlinear features, and entropy-based features. The numbering followed the order of the δ-γ bands (e.g., 30 was θ-1). Table 3 provides further details on these features. Additionally, Table 4 presents the mathematical foundation of these measures and provides descriptions.

Table 3 List of all the 29 extracted features and their respective abbreviations.

Full size table

Table 4 More details about some of the features used in this work.

Full size table

Feature selection

Mutual information is widely adopted in the field for feature selection⁴⁹. This study presented a comparative analysis of four filter-based feature selection methods, including maximum mutual information (MIM), joint mutual information (JMI), and their semi-supervised variants (Semi-MIM and Semi-JMI)⁵⁰. The features were ranked based on the scoring metric, and those with higher scores were selected due to feature selection. The MIM criterion was as follows:

$$\:{J}_{MIM}\left({X}_{j}\right)=\widehat{I}\left({X}_{j};Y\right)$$

(1)

Although mRMR is a widely used criterion that considers redundancy between features, Brown et al. (2012) found it to be quite unstable in an extensive empirical study. Accordingly, we elected to use the JMI criterion:

$$J_{{JMI}} \left( {X_{j} } \right) = \mathop \sum \nolimits_{{X_{k} \in {\mathbf{X}}_{\theta } }} ~\hat{I}\left( {X_{j} ;Y\mid\:X_{j} } \right)$$

(2)

The binary random variable s is used to annotate the original EEG dataset D, thereby constructing a semi-supervised EEG dataset containing both labeled and unlabeled instances. The true label value y was only recorded when s = 1; otherwise, it was marked as “missing.” The proportion of labels was controlled by ps1. Therefore, what could be observed was not really Y but a “surrogate” variable $\:\stackrel{\sim}{\text{Y}}$. The “soft” prior knowledge was employed to determine the optimal proxy for ranking the features to decide whether the label marked as missing was all assumed to be positive for the agent $\:{\stackrel{\sim}{\text{Y}}}_{1}\:$or all assumed to be negative for the agent $\:{\stackrel{\sim}{\text{Y}}}_{0}$. Once the most influential agent was identified, the ranking could be conducted using either the MIM or the JMI criterion for this variable. The Semi-MIM and Semi-JMI scoring metrics were as follows:

$$\:Semi-\text{M}\text{I}\text{M}:{J}_{MIM}^{{\stackrel{\sim}{Y}}_{0}/{\stackrel{\sim}{Y}}_{1}}\left({X}_{j}\right)=\widehat{I}({X}_{j};{\stackrel{\sim}{Y}}_{0})or\:\:\widehat{I}({X}_{j};{\stackrel{\sim}{Y}}_{1})$$

(3)

$$\:Semi-\text{J}\text{M}\text{I}:{J}_{JMI}^{{\stackrel{\sim}{Y}}_{0}/{\stackrel{\sim}{Y}}_{1}}\left({X}_{j}\right)={\sum\:}_{{X}_{i}\in\:{X}_{\theta\:}}\:\widehat{I}({X}_{j};{\stackrel{\sim}{Y}}_{0}\mid\:{X}_{i})\text{o}\text{r}{\sum\:}_{{X}_{i}\in\:{X}_{\theta\:}}\:\widehat{I}({X}_{j};{\stackrel{\sim}{Y}}_{1}\mid\:{X}_{i})$$

(4)

Imbalanced classification framework

The framework was divided into two distinct parts: meta-training for the purpose of optimizing the meta-sampler, and meta-sampling and ensemble training for the training of integrated models. The aforementioned elements will be described in this section. The specific structure and overview of the seizure detection system are depicted in Fig. 3(a).

Meta-state

Inspired by the concept of gradient/hardness distributions^51,52, we used the histogram distribution of prediction errors on both the training and validation sets as a meta-state representation of the entire framework. This representation was agnostic to specific task properties (e.g., dataset size and feature space) and not only provided ensemble training information to the meta-sampler but also enabled adaptive resampling across tasks. The prediction error was defined as the absolute difference between the predicted probability of the ensemble classifier G and the true label. It was approximated by a histogram of n positions. Specifically, the $\:i$-th component of the error histogram E for the dataset D was calculated as follows:

$$\:{\text{E}}_{\text{D}}^{\text{i}}=\frac{\left|\right\{(x,y)\mid\:\frac{i-1}{n}\le\:abs\left({G}_{j}\right(x)-y)<\frac{i}{n},(x,y)\in\:\mathcal{D}\left\}\right|}{\left|\mathcal{D}\right|}*n,1\le\:i\le\:n.$$

(5)

The concatenated prediction error histograms of the training and validation sets formed the meta-state S=[E_tr,E_v]. It intuitively reflected the fit of the model to the training set and its generalization on unseen data. When n > 2, the histogram provided a finer granularity in showing the distribution of “easy” samples (errors close to 0) and “hard” samples (errors close to 1) and thus contained more information to guide the resampling process. In addition, the size of the meta-state near the hard sample indicated bias, whereas the level of histogram aggregation indicated variance. The combination of the training and validation set histograms revealed whether the model was underfitting or overfitting to some extent, providing decision support for the sampling process. Since the meta-state provided input to the meta-sampling, the prediction probability and label were calculated for the majority (non-seizure) of the dataset.

Meta-sampling

We employed the Gaussian function technique to streamline both the meta-sampling process and the sampler itself. Specifically, let $\:\widehat{s}$ be defined as a meta-sampler that produces the scalar a∈[0,1] based on the input meta-state S. For a sample with a current model prediction error of x, the (unnormalized) probability of being sampled is:

$$\:{g}_{a,\sigma\:}\left(x\right)=\frac{1}{\sigma\:\sqrt{2\pi\:}}\text{e}\text{x}\text{p}(-\frac{1}{2}{\left(\frac{x-a}{\sigma\:}\right)}^{2})$$

(6)

where $\:\sigma\:$ is a hyperparameter. The majority of data (non-seizure) in the training set was sampled as much as the minority of data (seizure), and the sampling weight was expressed as:

$$\:{w}_{i}=\frac{{g}_{a,\sigma\:}\left(\right|G\left({x}_{i}\right)-{y}_{i}\left|\right)}{\sum\:_{({x}_{j},{y}_{j})\in\:{M}_{a}}\:{g}_{a,\sigma\:}\left(\right|G\left({x}_{j}\right)-{y}_{j}\left|\right)}$$

(7)

The process of combining the resampled majority data (non-seizure) with the minority data (seizure) resulted in a balanced training subset. In essence, this meant that the sampler tended to retain samples with prediction errors close to a. For instance, as a approached 1, our sampling behavior pattern resembled hard example mining, whereas as a approached 0, it resembled robust learning. Additionally, two modes of action existed for a in the meta-training process: the “rand” mode involved randomly selecting a from the action space, and the “mesa” mode involved outputting a from the Gaussian strategy network⁴⁵. For details on the specific network output process, refer to section “Meta-training”.

Ensemble training

The ensemble training procedure trained multiple base classifiers in a serial manner. With the meta-sampler $\:\widehat{s}$ and the meta-sampling strategy, we could iteratively train the new base classifier using a balanced training set after meta-sampling. Specifically, at the j-th iteration we had the unbalanced original dataset and the current ensemble model G_j() to get the meta-state S. The sampler then dynamically undersampled the original data according to the current S to obtain a balanced training set D_{j + 1, tr}. We performed an update to train a base classifier g_j+1() with this balanced training set and added it to the ensemble model to obtain G_j+1(). Finally, an ensemble model consisting of m base classifiers was obtained. The prediction probability of the new ensemble model was obtained by a weighted average between the ensemble model Gj() and the base classifier g_{j + 1}() to maintain the consistency of the prediction probability and prevent the old prediction probability from having an excessive influence on the new ones. The first base classifier G₁() was trained with random undersampling. At the same time, the ensemble training was not only part of the meta-training; but also the training of the ensemble model after obtaining the optimal meta-sampler.

Meta-training

As mentioned earlier, we aimed for a framework to learn the most appropriate sampling strategy (sampler parameters) directly from the data so as to optimize the final generalization performance of the ensemble model⁵³. An interaction between the meta-sampler and the external environment was observed in each iteration of ensemble training, where the environment provided a meta-state S, the meta-sampler took an action a based on the current S, and the ensemble model was updated to obtain a new meta-state $\:{S}^{{\prime\:}}$. We wanted the sampler to learn from its own interaction with the ensemble training process, so the non-differentiated optimization problem of training the meta-sampler could naturally be solved by reinforcement learning (RL).

Specifically, the ensemble training system could be considered as an environment (ENV) in our RL setting. Within each episode (EPI), the environment was initialized and the meta-state S was obtained. Subsequently, through meta-sampling and ensemble training at each environment step, the action a, the new state$\:\:{S}^{{\prime\:}}$, and the reward r were determined based on the difference in the performance measure function of G on the verification set before and after the update. At the same time, Soft Actor-Critic (SAC) was used to optimize the parameters of our meta-sampler. This process continued until the number of base classifiers in the ensemble model reached m (d = 1), at which point it moved on to the next episode.

The SAC algorithm is a non-strategic actor-critic deep RL algorithm based on the maximum entropy RL framework⁵⁴. It consists of 1 actor network, 2 Q critic networks, and 2 target soft Q-networks. The actor policy network of the meta-sampler consists of a fully connected layer of 55 neurons, followed by 2 branches that output the mean and log standard deviation, respectively. The Gaussian distribution was used to extract a∈[0,1] and logπ() through the reparameterization technique and the tanh function. The critic network consisted of two identical Q-networks, each of which was a multilayer perceptron (MLP) of {2n + 1,55,55,1}. The parameters for the target soft Q network were obtained by exponentially moving average Q network parameters. In addition, the tuple of information (S, a, r, $\:{S}^{{\prime\:}}$, d) was stored in the experience replay buffer B to reduce the correlation between previous and subsequent actions⁵⁵. During neural network training, a batch of experiences is randomly selected from the B pool to increase training efficiency. As a standard practice in network training, we used the ADAM optimizer to optimize policy and critic networks, as shown in Fig. 3(b).

Channel decision fusion

The output result of a sample is the predicted label of a channel in the fragment. The results of all the channels were summed and compared with a predefined threshold to obtain the predicted label of the fragment. If the sum was greater than or equal to the threshold, the segment was labeled as a seizure; otherwise, it was labeled as a non-seizure. The range of values for the dataset threshold was defined as ($\:0.5\times\:C-C\times\:0.25$,$\:0.5\times\:C+C\times\:0.25$), where C represents the number of channels in the data. The schematic figure for channel decision fusion is shown as Fig. 4.

Evaluation method

In this study, we employed sensitivity, specificity, accuracy, and F1 score as performance metrics for the model.

Sensitivity (sen) represents the proportion of correctly identified attacks:

$$\:SEN=\frac{TP}{TP+FN}$$

(8)

Specificity (spe) represents the proportion of correctly identified non-seizures among all non-seizures:

$$\:SPE=\frac{TN}{TN+FP}$$

(9)

Accuracy (acc) is defined as the proportion of correctly classified samples:

$$\:ACC=\frac{TP+TN}{TP+FN+FP+TN}$$

(10)

The F1 score represents a composite evaluation metric that integrates sensitivity and specificity to provide an overall assessment:

$$\:F1=\frac{2TP}{2TP+FN+TN}$$

(11)

The Matthews correlation coefficient (MCC) is a crucial metric for evaluating the performance of binary classification models, especially in the context of imbalanced datasets. MCC values range from − 1 to + 1, with the following interpretations: +1 indicates a perfect prediction, 0 denotes a random prediction, and − 1 signifies a complete inconsistency between the prediction and the actual observation:

$$\:\text{MCC}=\frac{TP\times\:TN-FP\times\:FN}{\sqrt{(TP+FP)(TP+FN)(TN+FP)(TN+FN)}}$$

(12)

where TP is the seizure sample correctly classified as seizure, FN is the seizure sample incorrectly classified as non-seizure, TN is the non-seizure sample that is correctly classified as non-seizure, and FP is the non-seizure sample incorrectly classified as seizure.

The experiment’s preprocessing was conducted using MATLAB R2021b, while the algorithm was implemented in Python using the PyTorch framework with Python version 3.7. The system ran on the Windows 10 operating system and was equipped with an Intel(R) Core i7-7700 CPU @ 3.60 GHz. A portion of the settings for the hyperparameters used in the experiment are presented in Table 5. The TUSZ dataset employed a fixed entropy regularization coefficient (α) of 0.1, whereas the SWEZ dataset employed an automatic adjustment of α to enforce the entropy constraint.

Table 5 Partial hyperparameter settings for TUSZ dataset and SWEZ dataset.

Full size table

Results

Feature selection experiment

The results of the four feature selection methods exhibited little difference, leading to the selection of six features for comparative experiments on the TUSZ dataset. In addition, a fifth method (mixture) was created by integrating common features from the four selections. The experimental results of these five methods demonstrated minimal disparity, with the Semi-JMI method showing superior effectiveness. The detailed results are presented in Table 6. The Semi-JMI feature selection method was used in all subsequent experiments.

Table 6 Performance results for the sensitivity, specificity, accuracy, and F1 score of the features selected according to the five feature selection methods.

Full size table

Scalp EEG(sEEG) dataset experiment

We conducted cross-patient experiments using a sEEG dataset. Five experiments were performed on the model using a decision tree as the base classifier, with the number of features set at 6, 9, 12, 30, and 60. The results of the experiment are depicted in Fig. 5. The figure also illustrates a notable decline in experimental outcomes when the number of features was 30 and 60, accompanied by an increase in both running time and model complexity with the increase in the number of features. Following a comprehensive evaluation of the time cost and performance metrics, it was concluded that the model achieved optimal performance with 6 features. This resulted in sensitivity, specificity, accuracy, and F1 score of 92.58%, 92.51%, 92.52%, and 75.54%, respectively.

Our model decoupled model training and meta-training, thereby enabling its application to a diverse range of machine learning models. Consequently, when dealing with 6 features, we employed four distinct classifiers, that is, the k-nearest neighbor classifier (KNN), Gaussian naive bayes (GNB), decision tree (DT), and adaptive boosting (AdaBoost), as base classifiers for conducting experiments. At the conclusion of meta-training, the experiment’s EPI was set to 70 due to the longer execution time required by KNN compared with the other three methods. When the DT was employed as the base classifier, the model attained optimal performance, as illustrated in Fig. 6. The sensitivity, specificity, accuracy, F1 score, and MCC were 92.58%, 92.51%, 92.52%, 75.54%, and 0.7303, respectively. It is paramount to note that all classifier parameters used the default settings preset by the SCISKIT-LEARN package. In the case of the ensemble classifier (AdaBoost), the value of N_ESTIMATOR was set to 10.

Intracranial EEG(iEEG) dataset experiment

The iEEG dataset experiment was a patient-specific study that evaluated the sensitivity, specificity, accuracy, and F1 score of a model using a DT as the base classifier for fifteen patients. The average sensitivity of the model was 98.56%, with an average specificity of 98.82%, an average accuracy of 98.7%, and an average F1 score of 96.85%. The experiment results for each patient are illustrated in Fig. 7. The sensitivity, specificity, and accuracy for fourteen patients exceeded 95%, while their F1 score surpassed 90%. Notably, all metrics for patient 3, patient 7, patient 8, patient 10, and patient 11 achieved optimal performance. Conversely, patient 13 exhibited suboptimal outcomes, which might be attributed to a combination of frontal lobe epilepsy and an excessive number of electrode channels during channel decision fusion⁵⁶. Consequently, the final prediction label was determined by combining the threshold value with the five adjacent detection results.

Concurrently, the more prolonged episode is divided into two segments due to the disparate durations of each patient’s record. An additional experiment was conducted to enhance the overall detection effect, such as the 5th fold of patient 6 (see Fig. 8). The results of the 14-fold cross-validation for patient 4 are presented in Fig. 9. All indices for the 3rd and 4th folds reached 100%, indicating that Semi-JMI selected the most representative features and accelerated the convergence process of the model.

Discussion

Robustness to corrupted labels

In practice, label contamination and damage are unavoidable occurrences in the preprocessing of large raw datasets and in real-world medical applications. The absence of label data represents a significant challenge to the algorithmic process. The ps1 values were set to three distinct levels with the objective of creating semi-supervised datasets with labeled proportions of 75%, 50%, and 25% under the condition of missing completely at random (MCAR). The unlabeled data were considered to be corrupted label data, whereas the training subsets comprised labeled data with proportions of 75%, 50%, and 25%, respectively. Subsequently, these subsets were employed to assess the robustness of the algorithm. The experimental results of the GNB classifier, which served as the base classifier, are depicted in Fig. 10. The 25% training subset exhibited the worst performance, with sensitivity, specificity, F1 score, and accuracy at 0.9053, 0.9061, 0.7062, and 0.9061, respectively. Nevertheless, this outcome was deemed acceptable. The F1 score can only reach 70–75% due to the imbalance ratio of the test set approaching 7. In contrast, the sensitivity and specificity of the four datasets showed minimal disparity. The results of the 75% training subset exhibited the most pronounced imbalance, which was the reason for the positive number in the performance loss. The performance degradation of 11/12 of the three subsets was below 2.5%, while that of the 50% and 75% subsets was significantly less than 1%. This evidence demonstrated the robustness of the algorithm in labeling corruption. Furthermore, the meta-sampler effectively mitigated overfitting in the ensemble classifier by optimizing for generalized performance.

Comparisons

Table 7 presents a summary of seizure detection studies conducted in recent years using the TUSZ dataset. It demonstrates the benefits of our study and provides comprehensive information on the study years, methodologies, and performance metrics. It is important to note that these methodologies are fragment-based cross-patient experiments. Compared with 13 distinct methodologies, our algorithm demonstrated the most accurate results and the highest F1 score, ranking second and third in sensitivity and specificity, respectively. The method employed by Tsiouris et al.³⁸ demonstrated a notable sensitivity of 93.94%. This method was designed to detect absence seizures exclusively, which resulted in the use of only absence seizure data in the patient-independent assessment. Golmohammadi et al.⁵⁷ employed a deep learning method with a mixed deep structure to achieve optimal specificity; however, the sensitivity was only 30.83%, representing the most imbalanced result in the table. The results of the five methods in the recent study were all imbalanced, with our sensitivity and specificity differing by only 0.07%. This indicated the effectiveness of our method in the face of class-imbalanced data. In a related study, Abou-Abbas et al.⁵ employed 44 features extracted from disparate fields and subjected them to random forest analysis following feature selection. This approach yielded 94.12% sensitivity and 91.55% accuracy, which was comparable to our findings. This suggested that, in the case of the TUSZ dataset, manual feature extraction might prove to be a more effective approach than automatic feature extraction, which would entail the direct input of data into the network.

Table 7 Comparison of existing methods for detecting epileptic seizures using scalp electrical signals in TUSZ dataset.

Full size table

Table 8 presents the classification performance of this study and the four most recent algorithms on the SWEZ dataset. It was evident that the average sensitivity and average specificity in this study exhibited superior performance, at 98.56% and 98.82%, respectively, surpassing the second-place scores of 96.81% (sen, MIX + HD⁶⁷) and 98.78% (spe, MIX + SVM⁶⁷). Concurrently, our algorithm exhibited the most balanced performance among the five results, with a difference of only 0.26% in sensitivity and specificity. Once the optimal sampler was identified, the considerable discrepancies in feature representativeness and data volume across each fold of each patient collectively resulted in a significant variation in ensemble training latency. Consequently, the maximum latency of 7.14 s was selected as the final outcome. The average sensitivity and specificity of the four algorithms resulted from the recalculation of fifteen patients from the original text.

Table 8 Comparison of existing epileptic seizure detection methods using iEEG signals in SWEZ dataset.

Full size table

Table 8 also presents the cross-validation results for each individual patient. The algorithm demonstrated superior performance in six patients, with patient 4 diagnosed with parietal lobe epilepsy⁵⁶. The sensitivity and specificity of our algorithm exceeded 96%, thereby demonstrating superior performance compared with other algorithms. The specificity of the MIX + SVM method in patient 12 was 99.54%, while the sensitivity was 90.5%. This represented an unbalanced result in comparison to the 99.31% and 98.47% reported in this study. Patient 6 and patient 16 were lower than the rest of the algorithms due to the imbalance ratio of one fold being much larger than the others, which affected the final result. The suboptimal performance of patients 13 and 1 could be attributed to the excessive number of electrode channels, which impaired the decision-making process of channel fusion, as previously discussed.

Two experiments were conducted on TUSZ and SWEZ datasets. The first was a cross-patient study to assess the model’s performance with previously unseen patient data. The second was a patient-specific investigation to develop a highly tailored model for individual patients. The results of both experiments were favorable, demonstrating the model’s capacity for generalization and personalization. The minimal difference in sensitivity and specificity indicated that the model could accurately identify both seizure and non-seizure samples. Concurrently, our algorithm exhibited a total delay of 16–18 s in the extensive TUSZ dataset after determining the optimal meta-sampler, which was sufficient for real-time seizure monitoring.

Yuzhe Yang et al.⁶⁹ proposed that the availability of a greater quantity of unlabeled data reduced label bias in a semi-supervised manner, thereby significantly enhancing the performance of the final classifier. Our future objective was to identify a task-independent meta-state suitable for semi-supervised EEG data with the intention of significantly enhancing the final classification results. Concurrently, the recent study has indicated that burst energy⁷⁰, an emerging engineering characteristic, exerts a substantial influence on classification performance. By distinguishing between abnormal and normal EEGs across various energy levels, accurate brain activity information can be extracted from the continuous wavelet transform scale map. This approach demonstrates significant potential in enhancing the accuracy of epilepsy recognition. In future work, this approach and other methods for addressing imbalanced data classification, such as stratified sampling, will be incorporated to further improve the accuracy of seizure detection. Besides, both Baghersalim et al.⁶⁴ and Sabor et al.⁶¹ introduced electrocardiogram (ECG) signals to aid in the detection of seizures by the brain, thus providing another clear direction for future research.

Conclusion

In this study, the framework of meta-sampling in conjunction with an ensemble classifier was developed to address the issue of class imbalance between seizure and non-seizure in epileptic EEG datasets. This framework optimized the sampling strategy of the meta-sampler through the meta-training process, adaptively selected training data, and learned effective cascaded ensemble classifiers from imbalanced EEG data. The proposed method has been successfully implemented on both scalp and intracranial EEG datasets. For the TUSZ dataset, the sensitivity, specificity, and accuracy were 92.58%, 92.51%, and 92.52%, respectively. In the case of the SWEZ dataset, the average sensitivity, specificity, and accuracy were 98.56%, 98.82%, and 98.7%, respectively. A comparative analysis with existing methods demonstrated that this method exhibited superior performance in terms of sensitivity, specificity, and accuracy for both scalp and intracranial EEG datasets. Additionally, the method demonstrates robustness in the presence of label damage, as evidenced by the TUSZ dataset with 25% label loss. In future work, deep learning models will be incorporated within this framework to automatically generate EEG feature for further improvement of seizure detection.

Data availability

The datasets analyzed during the current study are publicly available in the following two web links,1. https://isip.piconepress.com/projects/nedc/html/tuh_eeg/2. http://ieeg-swez.ethz.ch/.

References

Kanner, A. M. & Bicchi, M. M. Antiseizure medications for adults with epilepsy: A review. JAMA J. Am. Med. Assoc. 327, 1269–1281. https://doi.org/10.1001/jama.2022.3880 (2022).
Article CAS MATH Google Scholar
Nazarov, A. Consequences of seizures and epilepsy in children (2022).
Shoeibi, A. et al. A comprehensive comparison of handcrafted features and convolutional autoencoders for epileptic seizures detection in EEG signals. Expert Syst. Appl. 163, 113788. https://doi.org/10.1016/j.eswa.2020.113788 (2021).
Article MATH Google Scholar
Xiao, T. et al. Self-supervised learning with attention mechanism for EEG-based seizure detection. Biomed. Signal Process. Control 87, 105464. https://doi.org/10.1016/j.bspc.2023.105464 (2024).
Article MATH Google Scholar
Abou-Abbas, L., Henni, K., Jemal, I., Mitiche, A. & Mezghani, N. Patient-independent epileptic seizure detection by stable feature selection. Expert Syst. Appl. 232, 120585. https://doi.org/10.1016/j.eswa.2023.120585 (2023).
Article Google Scholar
Perez, D. L. & LaFrance, W. C. Nonepileptic seizures: An updated review. CNS Spectr. 21, 239–246. https://doi.org/10.1017/S109285291600002X (2016).
Article PubMed PubMed Central Google Scholar
Akyol, K. Stacking ensemble based deep neural networks modeling for effective epileptic seizure detection. Expert Syst. Appl. 148, 113239. https://doi.org/10.1016/j.eswa.2020.113239 (2020).
Article MATH Google Scholar
Dash, D. P., Kolekar, M., Chakraborty, C. & Khosravi, M. R. Review of machine and deep learning techniques in epileptic seizure detection using physiological signals and sentiment analysis. ACM Trans. Asian Low Resour. Lang. Inform. Process. 23, 1–29. https://doi.org/10.1145/3552512 (2024).
Article MATH Google Scholar
Ali, E., Angelova, M. & Karmakar, C. Epileptic seizure detection using CHB-MIT dataset: The overlooked perspectives. R. Soc. Open. Sci. 11, 230601. https://doi.org/10.1098/rsos.230601 (2024).
Article ADS PubMed PubMed Central Google Scholar
Siddiqui, M. K., Huang, X., Morales-Menendez, R., Hussain, N. & Khatoon, K. Machine learning based novel cost-sensitive seizure detection classifier for imbalanced EEG data sets. Int. J. Interact. Des. Manuf. (IJIDeM) 14, 1491–1509. https://doi.org/10.1007/s12008-020-00715-3 (2020).
Article Google Scholar
Ingolfsson, T. M. et al. EpiDeNet: An energy-efficient approach to seizure detection for embedded systems. In 2023 IEEE Biomedical Circuits and Systems Conference (BioCAS), 1–5 (IEEE, 2023).
Yuan, Q, et al. Epileptic seizure detection based on imbalanced classification and wavelet packet transform. Seizure 50, 99–108 (2017). https://doi.org/10.1016/j.seizure.2017.05.018
Article MATH Google Scholar
Wu, D. et al. Classification of seizure types based on multi-class specific bands common spatial pattern and penalized ensemble model. Biomed. Signal Process. Control 79, 104118. https://doi.org/10.1016/j.bspc.2022.104118 (2023).
Article MATH Google Scholar
Zhao, Y. et al. EEG-Based seizure detection using linear graph convolution network with focal loss. Comput. Methods Programs Biomed. 208, 106277. https://doi.org/10.1016/j.cmpb.2021.106277 (2021).
Article ADS PubMed Google Scholar
Ahmad, I. et al. A hybrid deep learning approach for epileptic seizure detection in EEG signals. IEEE J. Biomedical Health Inf. 1–12. https://doi.org/10.1109/JBHI.2023.3265983 (2023).
Hu, D. et al. Epileptic signal classification based on synthetic minority oversampling and blending algorithm. IEEE Trans. Cogn. Dev. Syst. 13, 368–382. https://doi.org/10.1109/TCDS.2020.3009020 (2020).
Article MATH Google Scholar
Yang, Q. et al. Childhood epilepsy syndromes classification based on fused features of electroencephalogram and electrocardiogram. Cogn. Comput. Syst. 4, 1–10. https://doi.org/10.1049/ccs2.12035 (2022).
Article MATH Google Scholar
Akter, M. S. et al. Multiband entropy-based feature-extraction method for automatic identification of epileptic focus based on high-frequency components in interictal iEEG. Sci. Rep. 10, 7044. https://doi.org/10.1101/2020.03.23.004572 (2020).
Article ADS CAS PubMed PubMed Central MATH Google Scholar
Zhang, P., Zhang, X. & Liu, A. Effects of data augmentation with the BNNSMOTE algorithm in seizure detection using 1D-MobileNet. J. Healthc. Eng. 4114178. https://doi.org/10.1155/2022/4114178 (2022).
Zhao, X., Yoshida, N., Ueda, T., Sugano, H. & Tanaka, T. Epileptic seizure detection by using interpretable machine learning models. J. Neural Eng. 20, 015002. https://doi.org/10.1088/1741-2552/acb089 (2023).
Article ADS Google Scholar
Zhao, X., Sole-Casals, J., Sugano, H. & Tanaka, T. Seizure onset zone classification based on imbalanced iEEG with data augmentation. J. Neural Eng. 19, 065001. https://doi.org/10.1088/1741-2552/aca04f (2022).
Article ADS MATH Google Scholar
Gao, B., Zhou, J., Yang, Y., Chi, J. & Yuan, Q. Generative adversarial network and convolutional neural network-based EEG imbalanced classification model for seizure detection. Biocybernet. Biomed. Eng. 42, 1–15. https://doi.org/10.1016/j.bbe.2021.11.002 (2022).
Article MATH Google Scholar
Solaija, M. S. J., Saleem, S., Khurshid, K., Hassan, S. A. & Kamboh, A. M. Dynamic mode decomposition based epileptic seizure detection from scalp EEG. IIEEE Access 6, 38683–38692. https://doi.org/10.1109/ACCESS.2018.2853125 (2018).
Article Google Scholar
Sun, C. et al. Epileptic seizure detection with EEG textural features and imbalanced classification based on EasyEnsemble learning. Int. J. Neural Syst. 29, 1950021. https://doi.org/10.1142/S0129065719500217 (2019).
Article PubMed MATH Google Scholar
Islah, N., Koerner, J., Genov, R., Valiante, T. A. & O’Leary, G. Machine learning with imbalanced EEG datasets using outlier-based sampling. In 2020 42nd Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC), 112–115 (IEEE, 2020).
Wang, X. et al. One dimensional convolutional neural networks for seizure onset detection using long-term scalp and intracranial EEG. Neurocomputing 459, 212–222. https://doi.org/10.1016/j.neucom.2021.06.048 (2021).
Article MATH Google Scholar
Jiang, Z. & Zhao, W. Fusion algorithm for imbalanced EEG data processing in seizure detection. Seizure 91, 207–211. https://doi.org/10.1016/j.seizure.2021.06.023 (2021).
Article PubMed MATH Google Scholar
Bhanot, N. et al. Seizure detection and epileptogenic zone localisation on heavily skewed MEG data using RUSBoost machine learning technique. Int. J. Neurosci. 132, 963–974. https://doi.org/10.1080/00207454.2020.1858828 (2022).
Article CAS PubMed MATH Google Scholar
Guo, Y. et al. Epileptic seizure detection by cascading isolation forest-based anomaly screening and EasyEnsemble. IEEE Trans. Neural Syst. Rehabil. Eng. 30, 915–924. https://doi.org/10.1109/TNSRE.2022.3163503 (2022).
Article PubMed MATH Google Scholar
Jiang, T. et al. Early seizure detection in childhood focal epilepsy with electroencephalogram feature fusion on deep autoencoder learning and channel correlations. Multidimens. Syst. Signal Process. 33, 1273–1293. https://doi.org/10.1007/s11045-022-00839-7 (2022).
Article MATH Google Scholar
Xu, Y., Yang, J., Ming, W., Wang, S. & Sawan, M. Shorter latency of real-time epileptic seizure detection via probabilistic prediction. Expert Syst. Appl. 236, 121359. https://doi.org/10.1016/j.eswa.2023.121359 (2024).
Article MATH Google Scholar
Alharbi, N. S., Bekiros, S., Jahanshahi, H., Mou, J. & Yao, Q. Spatiotemporal wavelet-domain neuroimaging of chaotic EEG seizure signals in epilepsy diagnosis and prognosis with the use of graph convolutional LSTM networks. Chaos Solitons Fractals 181, 114675. https://doi.org/10.1016/j.chaos.2024.114675 (2024).
Article MathSciNet Google Scholar
Anita, M. & Kowshalya, A. M. Automatic epileptic seizure detection using MSA-DCNN and LSTM techniques with EEG signals. Expert Syst. Appl. 238, 121727. https://doi.org/10.1016/j.eswa.2023.121727 (2024).
Article Google Scholar
Zhao, Y. et al. Hybrid attention network for epileptic EEG classification. Int. J. Neural Syst. 33, 2350031. https://doi.org/10.1142/S0129065723500314 (2023).
Article PubMed MATH Google Scholar
Lian, J. & Xu, F. Epileptic EEG classification via graph transformer network. Int. J. Neural Syst. 33. https://doi.org/10.1142/S0129065723500429 (2023).
Grattarola, D., Livi, L., Alippi, C., Wennberg, R. & Valiante, T. A. Seizure localisation with attention-based graph neural networks. Expert Syst. Appl. 203, 117330. https://doi.org/10.1016/j.eswa.2022.117330 (2022).
Article Google Scholar
Vanabelle, P., De Handschutter, P., El Tahry, R., Benjelloun, M. & Boukhebouze, M. Epileptic seizure detection using EEG signals and extreme gradient boosting. J. Biomed. Res. 34, 228–239. https://doi.org/10.7555/JBR.33.20190016 (2020).
Article PubMed Central Google Scholar
Tsiouris, K. M., Konitsiotis, S., Gatsios, D., Koutsouris, D. D. & Fotiadis, D. I. Automatic absence seizures detection in EEG signals: An unsupervised module. In: 42nd Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC), 532–535 (IEEE, 2020).
Huang, W., Xu, H. & Yu, Y. MRP-Net: Seizure detection method based on modified recurrence plot and additive attention convolution neural network. Biomed. Signal Process. Control 86, 105165. https://doi.org/10.1016/j.bspc.2023.105165 (2023).
Article MATH Google Scholar
Liu, X., Ding, X., Liu, J., Nie, W. & Yuan, Q. Automatic focal EEG identification based on deep reinforcement learning. Biomed. Signal Process. Control 83, 104693. https://doi.org/10.1016/j.bspc.2023.104693 (2023).
Article MATH Google Scholar
Hamad, A., Houssein, E. H., Hassanien, A. E. & Fahmy, A. A. Feature extraction of epilepsy EEG using discrete wavelet transform. In 2016 12th International Computer Engineering Conference (ICENCO), 190–195 (IEEE, 2016).
Hu, X. et al. Scalp EEG classification using deep Bi-LSTM network for seizure detection. Comput. Biol. Med. 124, 103919. https://doi.org/10.1016/j.compbiomed.2020.103919 (2020).
Article PubMed MATH Google Scholar
Shen, M., Wen, P., Song, B. & Li, Y. Real-time epilepsy seizure detection based on EEG using tunable-Q wavelet transform and convolutional neural network. Biomed. Signal Process. Control 82, 104566. https://doi.org/10.1016/j.bspc.2022.104566 (2023).
Article Google Scholar
Jenke, R., Peer, A. & Buss, M. Feature extraction and selection for emotion recognition from EEG. IEEE Trans. Affect. Comput. 5, 327–339. https://doi.org/10.1109/TAFFC.2014.2339834 (2014).
Article MATH Google Scholar
Şen, B., Peker, M., Çavuşoğlu, A. & Çelebi, F. V. A comparative study on classification of sleep stage based on EEG signals using feature selection and classification algorithms. J. Med. Syst. 38, 1–21. https://doi.org/10.1007/s10916-014-0018-0 (2014).
Article Google Scholar
Memar, P. & Faradji, F. A novel multi-class EEG-based sleep stage classification system. IEEE Trans. Neural Syst. Rehabil. Eng. 26, 84–95. https://doi.org/10.1109/TNSRE.2017.2776149 (2017).
Article Google Scholar
Hariharan, M. et al. Improved binary dragonfly optimization algorithm and wavelet packet based non-linear features for infant cry classification. Comput. Methods Programs Biomed. 155, 39–51. https://doi.org/10.1016/j.cmpb.2017.11.021 (2018).
Article CAS PubMed MATH Google Scholar
Acharya, U. R. et al. Entropies for automated detection of coronary artery disease using ECG signals: A review. Biocybernet. Biomed. Eng. 38, 373–384. https://doi.org/10.1016/j.bbe.2018.03.001 (2018).
Article MATH Google Scholar
Brown, G., Pocock, A., Zhao, M. J. & Luján, M. Conditional likelihood maximisation: A unifying framework for information theoretic feature selection. J. Mach. Learn. Res. 13, 27–66. https://doi.org/10.5555/2503308.2188387 (2012).
Article MathSciNet MATH Google Scholar
Sechidis, K. & Brown, G. Simple strategies for semi-supervised feature selection. Mach. Learn. 107, 357–395. https://doi.org/10.1007/s10994-017-5648-2 (2018).
Article MathSciNet PubMed MATH Google Scholar
Li, B., Liu, Y. & Wang, X. Gradient harmonized single-stage detector. In Proceedings of the AAAI Conference on Artificial Intelligence, 8577–8584 (2019).
Liu, Z. et al. Self-paced ensemble for highly imbalanced massive data classification. In 2020 IEEE 36th international conference on data engineering (ICDE), 841–852 (IEEE, 2020).
Liu, Z. et al. MESA: Boost ensemble imbalanced learning with meta-sampler. Adv. Neural. Inf. Process. Syst. 33, 14463–14474 (2020).
MATH Google Scholar
Haarnoja, T. et al. Soft actor-critic algorithms and applications. arXiv Preprint arXiv:1812.05905 (2018).
Zhang, B. et al. Soft actor-critic–based multi-objective optimized energy conversion and management strategy for integrated energy systems with renewable energy. Energy. Conv. Manag. 243, 114381. https://doi.org/10.1016/j.enconman.2021.114381 (2021).
Article MATH Google Scholar
Burrello, A., Schindler, K., Benini, L. & Rahimi, A. Hyperdimensional computing with local binary patterns: One-shot learning of seizure onset and identification of ictogenic brain regions using short-time iEEG recordings. IEEE Trans. Biomed. Eng. 67, 601–613. https://doi.org/10.1109/TBME.2019.2919137 (2019).
Article PubMed Google Scholar
Golmohammadi, M. et al. Deep architectures for automated seizure detection in scalp EEGs. arXiv Preprint arXiv:1712.09776. (2017).
Zhu, Y. et al. Mitigating patient-to-patient variation in EEG seizure detection using meta transfer learning. In 2020 IEEE 20th International Conference on Bioinformatics and (BIBE), 548–555 (IEEE, 2020).
Zhang, Z. et al. DWT-Net: Seizure detection system with structured EEG montage and multiple feature extractor in convolution neural network. J. Sens. 3083910. https://doi.org/10.1155/2020/3083910 (2020).
Tang, S. et al. Self-supervised graph neural networks for improved electroencephalographic seizure analysis. arXiv Preprint arXiv:2104.08336 (2021).
Sabor, N., Mohammed, H., Li, Z. & Wang, G. Brain-heart interaction-based deep architectures for epileptic seizures and firing location detection. IEEE Trans. Neural Syst. Rehabil. Eng. 30, 1576–1588. https://doi.org/10.1109/TNSRE.2022.3181151 (2022).
Article PubMed MATH Google Scholar
Thuwajit, P. et al. EEGWaveNet: Multiscale CNN-based spatiotemporal feature extraction for EEG seizure detection. IEEE Trans. Industr. Inf. 18, 5547–5557. https://doi.org/10.1109/TII.2021.3133307 (2021).
Article MATH Google Scholar
Rahmani, A., Venkitaraman, A. & Frossard, P. A Meta-GNN approach to personalized seizure detection and classification. In ICASSP 2023–2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 1–5 (IEEE, 2023).
Baghersalimi, S., Teijeiro, T., Aminifar, A. & Atienza, D. Decentralized federated learning for epileptic seizures detection in low-power wearable systems. IEEE Trans. Mob. Comput. https://doi.org/10.1109/TMC.2023.3320862 (2023).
Article Google Scholar
Statsenko, Y. et al. Automatic detection and classification of epileptic seizures from EEG data: Finding optimal acquisition settings and testing interpretable machine learning approach. Biomedicines 11, 2370. https://doi.org/10.3390/biomedicines11092370 (2023).
Article PubMed PubMed Central MATH Google Scholar
Sánchez-Hernández, S. E., Torres-Ramos, S., Román-Godínez, I. & Salido-Ruiz, R. A. Evaluation of the relation between Ictal EEG features and XAI explanations. Brain Sci. 14, 306. https://doi.org/10.3390/brainsci14040306 (2024).
Article PubMed PubMed Central Google Scholar
Burrello, A., Benatti, S., Schindler, K., Benini, L. & Rahimi, A. An ensemble of hyperdimensional classifiers: Hardware-friendly short-latency seizure detection with automatic iEEG electrode selection. IEEE J. Biomed. Health Inf. 25, 935–946. https://doi.org/10.1109/JBHI.2020.3022211 (2020).
Article Google Scholar
Hussein, R., Palangi, H., Wang, Z. J. & Ward, R. Robust detection of epileptic seizures using deep neural networks. In 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2546–2550 (IEEE, 2018).
Yang, Y. & Xu, Z. Rethinking the value of labels for improving class-imbalanced learning. Adv. Neural. Inf. Process. Syst. 33, 19290–19301 (2020).
MATH Google Scholar
Hasnaoui, L. H., Djebbari, A., Leveraging epilepsy detection accuracy through burst energy integration in & cwt and decision tree classification of eeg signals. Biomed. Eng. Appl. Basis Commun. 2450052. https://doi.org/10.4015/S1016237224500522 (2024).

Download references

Acknowledgements

This study was jointly supported by the Shandong Provincial Natural Science Foundation (No. ZR2023MF068 and No. ZR2021MH252), and the Shandong Provincial Qianfoshan Hospital Cultivation Fund for National Natural Science Foundation of China (No. QYPY2021NSFC0802).

Author information

Mingze Liu and Jie Liu have contributed equally to this work.

Authors and Affiliations

Shandong Province Key Laboratory of Medical Physics and Image Processing Technology, School of Physics and Electronics, Shandong Normal University, Jinan, 250358, China
Mingze Liu, Mengna Xu, Yasheng Liu, Jie Li & Qi Yuan
Department of Pediatric Intensive Care Unit, Shandong Provincial Maternal and Child Health Care Hospital Affiliated to Qingdao University, Jinan, 250014, China
Jie Liu
The First Affiliated Hospital of Shandong First Medical University, Shandong First Medical University, Jinan, 250014, China
Weiwei Nie

Authors

Mingze Liu
View author publications
Search author on:PubMed Google Scholar
Jie Liu
View author publications
Search author on:PubMed Google Scholar
Mengna Xu
View author publications
Search author on:PubMed Google Scholar
Yasheng Liu
View author publications
Search author on:PubMed Google Scholar
Jie Li
View author publications
Search author on:PubMed Google Scholar
Weiwei Nie
View author publications
Search author on:PubMed Google Scholar
Qi Yuan
View author publications
Search author on:PubMed Google Scholar

Contributions

Mingze Liu: Writing original draft, Methodology. Jie Liu: Data curation, Investigation. Mengna Xu: Data curation. Yasheng Liu: Data curation. Jie Li: Data curation. Weiwei Nie: review & editing, Supervision. Qi Yuan: Supervision, Conceptualization, review & editing.

Corresponding authors

Correspondence to Weiwei Nie or Qi Yuan.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Cite this article

Liu, M., Liu, J., Xu, M. et al. Combining meta and ensemble learning to classify EEG for seizure detection. Sci Rep 15, 10755 (2025). https://doi.org/10.1038/s41598-025-88270-3

Download citation

Received: 24 September 2024
Accepted: 28 January 2025
Published: 28 March 2025
DOI: https://doi.org/10.1038/s41598-025-88270-3

Subjects

Abstract

Similar content being viewed by others

Detection and classification of adult epilepsy using hybrid deep learning approach

Retraining and evaluation of machine learning and deep learning models for seizure classification from EEG data

EEG detection and recognition model for epilepsy based on dual attention mechanism

Introduction

Materials and methods

Experimental datasets

TUSZ dataset

SWEZ dataset

Data preprocessing

Butterworth filter

Feature extraction

Feature selection

Imbalanced classification framework

Meta-state

Meta-sampling

Ensemble training

Meta-training

Channel decision fusion

Evaluation method

Results

Feature selection experiment

Scalp EEG(sEEG) dataset experiment

Intracranial EEG(iEEG) dataset experiment

Discussion

Robustness to corrupted labels

Comparisons

Conclusion

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Additional information

Publisher’s note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Quick links