Motor imagery classification using sparse representations: an exploratory study

de Menezes, José Antonio Alves; Gomes, Juliana Carneiro; de Carvalho Hazin, Vitor; Dantas, Júlio César Sousa; Rodrigues, Marcelo Cairrão Araújo; dos Santos, Wellington Pinheiro

doi:10.1038/s41598-023-42790-y

Download PDF

Article
Open access
Published: 20 September 2023

Motor imagery classification using sparse representations: an exploratory study

José Antonio Alves de Menezes^1,2^na1,
Juliana Carneiro Gomes¹^na1,
Vitor de Carvalho Hazin²,
Júlio César Sousa Dantas²,
Marcelo Cairrão Araújo Rodrigues³ &
…
Wellington Pinheiro dos Santos^1,4^na1

Scientific Reports volume 13, Article number: 15585 (2023) Cite this article

2813 Accesses
3 Citations
Metrics details

Subjects

Abstract

The non-stationary nature of the EEG signal poses challenges for the classification of motor imagery. sparse representation classification (SRC) appears as an alternative for classification of untrained conditions and, therefore, useful in motor imagery. Empirical mode decomposition (EMD) deals with signals of this nature and appears at the rear of the classification, supporting the generation of features. In this work we evaluate the combination of these methods in a multiclass classification problem, comparing them with a conventional method in order to determine if their performance is regular. For comparison with SRC we use multilayer perceptron (MLP). We also evaluated a hybrid approach for classification of sparse representations with MLP (RSMLP). For comparison with EMD we used filtering by frequency bands. Feature selection methods were used to select the most significant ones, specifically Random Forest and Particle Swarm Optimization. Finally, we used data augmentation to get a more voluminous base. Regarding the first dataset, we observed that the classifiers that use sparse representation have results equivalent to each other, but they outperform the conventional MLP model. SRC and SRMLP achieve an average accuracy of $75.95\%$ and $82.51\%$ respectively while the MLP is $72.38\%$, representing a gain between $4.93\%$ and $14\%$. The use of EMD in relation to other feature processing techniques is not superior. However, EMD does not influence negatively, there is an opportunity for improvement. Finally, the use of data augmentation proved to be important to obtain relevant results. In the second dataset, we did not observe the same results. Models based on sparse representation (SRC, SRMLP, etc.) have on average a performance close to other conventional models, but without surpassing them. The best sparse models achieve an average accuracy of $95.43\%$ among the subjects in the base, while other model reach $98.33\%$. The improvement of self-adaptive mechanisms that respond efficiently to the user’s context is a good way to achieve improvements in motor imagery applications. However, other scenarios should be investigated, since the advantage of these methods was not proven in all datasets studied. There is still room for improvement, such as optimizing the dictionary of sparse representation in the context of motor imagery. Investing efforts in synthetically increasing the training base has also proved important to reduce the costs of this group of applications.

Adaptive EEG preprocessing to mitigate electrode shift variability for robust motor imagery classification

Article Open access 19 November 2025

Motor imagery EEG signal classification using novel deep learning algorithm

Article Open access 08 July 2025

A sparrow search algorithm-optimized LSTM framework with EMD denoising for rolling element bearing remaining useful life prediction

Article Open access 02 March 2026

Introduction

The brain–computer interface (BCI) has enabled the control or communication with various devices through brain sensors¹ and opens space for the most varied applications such as touch-free text input systems, controls for: wheelchairs, cursors, prostheses and exoskeletons and virtual reality systems². The EEG plays an important role in several types of brain applications and was popularized for being non-invasive. In addition to its common use in disease diagnosis, sleep assessment and neurofeedback, the EEG has also been widely used in BCI applications³, especially using motor imagery (MI-BCI), that consists in limb movement imagination. This cognitive task is reflected in the motor cortex (central surface area of the brain) and occurs in waves of varying frequencies and can represent the imagination of the limbs: right, left hands and feet. Identification and classification can be useful for the creation of interfaces that allow the communication of people with some communication and movement disability.

Because the cortex is the brain region closest to the skull, the EEG becomes a good alternative for identifying motor imagery in a non-invasive way. However, there are challenges, as it is not simple to recognize patterns in the brain signal and classify them as motor imagery. That is why numerous computational methods are researched in order to obtain more accurate results. In this sense, we seek to identify and classify motor imagery in order to have a set of brain commands that will compose the interface. However, the nature of the brain signal picked up by the EEG is non-stationary and poses challenges. The signals are so diverse and time-varying that the usefulness of pattern recognition computational models is limited to the time close to the signal collection. Furthermore, a long recording time is required to achieve good performance for these models. Which in practice can make the user reluctant to use it⁴.

Classifications based on sparse representations of some EEG feature (SRC)^5,6 are increasingly useful to obtain good performance from untrained patterns⁷. This way, allowing a smaller volume of data to be necessary in the training process of a classification model^2,4,8 seek to improve this approach by focusing on optimizing and learning dictionaries for sparse representation^9,10,11, in turn, choose to focus on improving the features that will be represented.

Empirical mode decomposition (EMD)¹² is characterized as a good method to contribute to the generation of features extracted from non-stationary signals. Thus, several works use it in the BCI based on Motor Imagery^13,14,15,16. This method presents itself as an option to be combined with SRC and which has not yet been explored.

Taking into account that the motor imagery problem is dependent on each subject, due to neuroplasticity characteristics, requiring customized solutions. Therefore, several works investigate the optimization of windows for each individual, combining this approach with features extracted from common spatial patterns (CSP)^{17,18,19,20,21,22,23,24,25,26} Additionally, the frequency separation can also be optimized, aiming to increase the recognition accuracy of imagined movements, also employing features extracted from CSP^10,27,28,29. In this exploratory research, we seek to investigate the use of empirical mode decomposition (EMD) and sparse classifiers in the task of pattern recognition in motor imagery. Our hypothesis is that the decomposition of the EEG signal using EMD is more adequate to the non-stationary nature of these signals, and may generate competitive results when appropriate classifiers are used for the sparse representation provided by this approach.

We propose to evaluate the use of SRC of features derived from EMD, in order to verify the hypothesis that this combination is promising in the classification of Motor Imagery in a multiclass problem. We compare the SRC model with a conventional model (MLP). We also evaluated a hybrid approach for classification of sparse representations with MLP (RSMLP). For both approaches we use features derived from EMD. However, we also extract these features directly from the conventional frequency bands, bypassing the application of EMD, in order to arrive at a second comparison. To overcome the difficulty imposed by the base size limitation, we used data augmentation. Feature selection methods were used to select the most significant ones, specifically Random Forest and particle swarm optimization (PSO).

The structure of the subsequent sections is organized as follows: in “Related works” section, we comment on works related to MI-BCI and particularly related to SRC and EMD techniques; in “Materials and methods” section, we present the used materials, methods and theoretical concepts necessary for a good understanding of this work; in “Results” and “Discussion” sections we show the results and discussion. Finally, our conclusions and future works are described in “Conclusion” section.

Related works

Motor imagery based BCI

Several works have proposed methods aiming to improve motor imagery classification. ³⁰experimented different techniques to adjust CNNs, for instance. In this study, they tested a subject-independent categorization to overcome the small amount of EEG signals per participant. The main idea of the work is to train the network with data from different subjects. Then, using the data from the chosen subject, the pre-trained model is altered and adjusted. With that in mind, the researchers tested a variety of network adaptation mechanisms: classifier optimization, adaptation of convolutional layers, and multiple proportions of training (the number of layers retrained with the data of the patient). In order to test the proposed method, they used EEG signals from 54 healthy participants who imagined right and left-hand movements. Spatial and temporal filters with max-pooling, three convolutional blocks, and a fully connected layer with Softmax were employed in the CNN. Finally, the study found that, when compared to standard methods (with subject-specific training), the proposed method had a $32.50\%$ greater accuracy. In addition, the best results were achieved with adaptation of three of the four convolutional layers, as well as the Softmax layer. This means that only the first layer was properly trained with data from other patients.

As in the previous work, ³¹also proposed a subject-independent training approach. The authors idea was to train BCIs with data from numerous participants and transfer the learning to classify a new subject. Thus, the authors modified the classic CSP method and applied a bandpass filter. With this, they designed a structure with greater discriminative ability, called separate channel convolutional network (SCCN). The methodology was tested on dataset 2b from the BCI Competition IV (2008). Two measures were used to evaluate the results: accuracy and Information Transfer Rate (ITR). The proposed technique had an ITR of 0.83 and an average accuracy of $64\%$. Unfortunately, simpler classifiers, such as LDA, produced better results ($65\%$).

³²tested another novel technique: the combination of numerous CNN models with varying depths and filters to improve motor imagery categorization results. In this case, the CNN models were based on the AlexNet architecture. The features extracted by the various models are then combined and sent into an MLP (MCNN Method) or an autoencoder as an input (CCNN Method). The authors experimented four types of CNNs, ranging from one to four convolution blocks and max-pooling, as well as filters with sizes ranging from 10 to 30. The CNN models were trained and tested with two datasets: the High Gamma Dataset, which is made up of 20 volunteers’ EEG signals, and the BCI IV-2a (BCID) dataset. The BCID is made up of electroencephalogram (EEG) data from nine participants who imagined four different sorts of movements: right hand, left hand, foot, and tongue. In terms of training, the study looked at two approaches: subject-specific training and training based on signals from all volunteers. Therefore, based on the BCID, the first approach had an average accuracy of $75.7\%$. In the second approach, the CCNN outperformed the others, with an average accuracy of $55.34\%$.

In turn, ³³concentrated on two methods: modifying the CNN’s kernel dimensions, and data augmentation for each subject. The researchers looked at how kernel size differed between participants and between imaging sessions for the same subject. This analysis found a $10\%$ difference in accuracy with kernel variations. Based on these findings, they presented a CNN with a hybrid convolution scale, named HS-CNN. Furthermore, the paper also presented and tested a new way of data augmentation, in which signal windows are recombined in the time and frequency domains. The researchers used two BCI Competition IV databases to validate the method: 2a and 2b. Each dataset is made up of nine healthy people who imagined four and two types of movements, respectively. The proposed method has an average classification accuracy of $87.6\%$, with improvements of up to $23.25\%$ for dataset 2a and $19.7\%$ for dataset 2b.

In contrast, ³⁴invested in new methods of representing EEG signals for classification of motor imagery. Instead of using EEG signals in one dimension, the authors proposed a way to represent them in three dimensions. In this way of representing, the authors sought to keep the spatial distribution information of the electrodes in addition to the temporal information. For feature extraction and classification steps, they developed a CNN also in 3D (Multi-branch 3D CNN). The idea with this architecture is to combine multiple CNNs with different receptive fields. Finally, this methodology was tested with data from the BCI Competition IV 2a database, and the Multi-branch 3D CNN achieved superior results were compared with a Small, Medium and Large Receptive Field networks only.

EMD and SRC MI-BCI

Recognizing patterns from sparse representations, although it emerged in the field of image classification^35,36, is becoming increasingly useful in the classification of biological signals. ³⁷review methods of sparse representations of EEG signals for detecting epilepsy. ⁷propose a classification model based on the smallest residual produced in the reconstruction of a sparsely represented sample. The technique is combined with extreme learning machine (ELM), a random-weighted one-layer neural network^38,39,40, to classify myoelectric signals in the control of prostheses. Signals were collected from two amputees, with and without a prosthesis, in addition to eight healthy individuals. The effectiveness of the method was evaluated both in offline and online mode. In the first, the subjects perform suggested movements in random positions of their amputated arm, varying the positions in the three dimensions. The classifier is trained however using only one position while the others are used as a test. When the ELM does not sufficiently discriminate an SRC move is used to resolve the impasse between the doubtful classes. In online mode, subjects perform suggested moves in four untrained positions while receiving real-time feedback using the same classifier combination as in the previous mode. The method was compared with six other classifiers trained with three feature groups. In all groups of subjects, the proposed method was the one that obtained the lowest error rate with significant performance improvement ($p < 0.001$) for the untrained positions.

This SRC model based on the smallest residual is still little applied in MI-BCI studies. Therefore, it is of interest to this work to evaluate the technique in this new context. The sparse sample is used to reconstruct the original sample which will be classified according to the class that produced the smallest residue during its reconstruction. A better description of the method can be found in the "Feature selection" section.

^5,6appear among the first studies exploring SRC in the classification of motor imagery. Since then, sparse representation has been increasingly used in this type of problem. For us, approaches that seek to improve the features that will be sparsely represented are interesting to situate the state of the art. However, these approaches do not classify the samples themselves, but their sparse representations. This differs from the classification by the smallest⁷ residue studied in our work. But we find it appropriate to cite the pioneering studies on the combination of sparse representation techniques in the context of MI-BCI.

⁹propose a classification mechanism for the sparse representations of optimized features in space, time and frequency. The best EEG channels are selected based on relative entropy (spatial optimization), the signals from this subset of channels are decomposed into several overlapping frequency bands (frequency optimization) and time-segmented into overlapping windows (temporal optimization). Features based on the CSP are then extracted and the most significant are selected by sparse regression. Finally, a dictionary is generated and optimized to aid in sorting. The approach was evaluated in two databases and obtained an improvement of 21.57% and 14.38% compared to other methods applied in the same databases. In addition to comparison with other methods, the study assesses the impact of dictionary optimization by comparing SRC results with and without dictionary optimization, obtaining a significant improvement of 9.44%.

¹⁰use the same time-frequency feature optimization method as the previous study⁹ but uses sparse representation as an feature selector technique to train an SVM classifier^41,42. The approach was evaluated in three databases where mean accuracies of 88.5%, 83.3% and 84.3% were obtained, obtaining significant improvement ($p<$ 0.01) in relation to the other compared methods.

EMD is another technique that can be used to improve features for classifying motor imagery, but it has not yet been combined with the SRC technique ⁴³use EMD to decompose the signal into five components, called Intrinsic Mode Functions (IMF), to extract three features. The resulting feature vector is used to train an artificial neural network and obtains good results in discriminating four classes of motor imagery. The technique is compared to Discrete Wavelet Transform (DWT). EMD traits achieve an average accuracy of 90.02%, while DWT traits average an accuracy of 84.77%. ⁴⁴have a similar application, but extracts up to seven features from the produced IMFs. The proposed model consists of a three-layer hierarchical SVM classifier, which surpasses the compared model (traditional SVM). ¹²offer an analysis of the applicability of EMD in the identification of motor imagery. In addition to generating features, the technique can also be used to remove artifacts⁴⁵. ⁴⁶use EMD to compose an enhanced EEG signal, considering the average frequency of the MFIs. Three Hjorn parameters plus the absolute power are extracted from the new signal to compose the feature vector. The LDA classifier is then applied in the classification. It was observed that the technique can improve on average $10.5\%$ of classification accuracy. ⁴⁷evaluate EMD multivariate extension (MEMD) in MI-BCI. Searches for peaks and troughs in n directions (multichannels) of the input signal improving the location of frequency information. SVM is applied in classification.

Regarding EEG signals classification, successful approaches have been adopting sparse representation, since these applications are strongly dependent on frequency representation, which tends to present sparse features obtained by sequential filtering as the most appropriate candidates to represent this information. For instance, ⁴⁸proposed a method to detect seizure EEG signals by empirical mode decomposition (EMD) and sparse representation. The EMD method is used to process EEG signals for generating IMFs. Then frequency features of IMFs are extracted as the dictionary in sparse representation based classification (SRC) scheme to realize reduction of the data dimension and calculation cost. The generated dictionary is automatically optimized by a clustering algorithm, similarly to^17,18, that adopted common spatial patterns (CSP) instead of IMFs. Then, ⁴⁸demonstrated that this approach is able to detect the seizure EEG signals with a considerably high accuracy: up to 99%. According to⁴⁸, this approach is fast and practical for the treatment of epilepsy in practice. We understand that combining the two approaches EMD and SRC can help in the classification of motor imagery.

An SRC classifier depends on its dictionary matrix, and its construction is an important step for good performance. ⁵uses features based on CSP to demonstrate that the property of incoherence between signals collaborates so that a new sample can be predominantly represented by one of these signals. Examining the EMD, it is important to note that the IMFs tend to have different frequencies, which guarantees a degree of incoherence between them most of the time, since the coherence is often calculated in terms of frequency, comparing the spectral components in each frequency. Therefore, signals with different frequencies have different spectral characteristics and do not share similar information at these frequencies. However, it is a particularly difficult point to guarantee a high level of incoherence at all times. The IMF components can have a wide frequency range in some cases, due to the non-stationary nature of the signals, being impossible to determine in advance due to the empirical nature of the method. However, by observing the components, we can see that most of the time these frequencies are different. The interest in associating EMD and SRC in this work is due to the particular advantages of combining EMD with brain signals, due to its non-stationary character. Also for the important results achieved with sparse representations in the classification of these signs pointed out by many studies. Therefore, it is in our interest to explore this combination that is still little evaluated and to document our findings in this study. However, it is worth remembering that this approach has already been evaluated in the context of signs in epilepsy by⁴⁸, confirming the feasibility of the technique.

Materials and methods

Databases

Two datasets were used in this work. First database was provided by the Neurodynamics Research Group of the Federal University of Pernambuco. It consists of 21 EEG signal collections from a single healthy adult male. The collections were performed on alternate days and each one consists of 60 trials of 8 s. In each trial, in turn, the motor imagery of the right, left or feet is performed. EEG data were collected through the g.Hamp amplifier set to a sampling rate of 256Hz. In addition, 27 active channels were distributed over the premotor, motor and sensoriomotor cortex. After preparation and placement of the electrodes, the data acquisition protocol was performed as follows (Fig. 1):

1.
The individual wearing an EEG sits in front of a Monitor.
2.
In a few moments, a cross appears to alert you of an impending instruction.
3.
An arrow will randomly indicate which activity to do in that trial. E.g.: arrow to the right will indicate motor imagery of the right hand, etc.
4.
The individual performs the indicated instruction within 8 s of the trial. It may or may not receive real-time feedback on the quality of its execution.
5.
The screen is completely white and the subject rests for 4 s.
6.
The process is repeated until completing 60 trials.

The study was conducted in compliance with the guidelines of Resolution 466/2012 of the National Health Council and the Declaration of Helsinki of 1964 and approved by the Ethics Committee of the Health Sciences Center of Federal University of Pernambuco, CAAE number 79271517.2.0000.5208, Recife, Brazil. All participants signed the Informed Consent Term (ICT) prior to the start of the research. Patients were informed of the goals and procedures, as well as the risks and benefits, and could interrupt the assessments at any time, without the need for further explanation.

The second base used was the BCI Competition IV dataset 2b, which is widely used in the classification of motor imagery of the upper limbs. Dataset 2b provides signals from 3 brain sites: C3, Cz and C4, organized into several trials labeled with the imagery class. And it consists of 5 sessions of 9 subjects. Each session has several attempts for each motor imagery class that last approximately 8 s. More details about this base can be found in⁴⁹. In this work we used the 5 sessions of each subject. For each trial we used the first 3 s of imagery.

Proposed classification methodology of EEG signals by sparse classifiers and empirical mode decomposition

In this work, we propose to evaluate the effectiveness of sparse representations in classifying motor imagery from features based on EMD. For this, we use processing, extraction and selection methods of features already related to the manipulation of EEG signals. Figure 2 shows how these methods are embedded in our goal flow. The “Signal pre-processing and feature extraction” section brings them up in detail.

Signal pre-processing and feature extraction

The first step of our processing consisted of separating and organizing the trials for each collection. At this stage the signal could be filtered at 0.5 and 60 Hz and normalized. EMD (Empirical Mode Decomposition) was applied to all channels. At least 5 IMFs (Intrinsic Mode Functions) were obtained, they approximate the conventional frequency bands (delta, theta, alpha, beta and gamma) but are more suited to the non-stationary nature of the EEG signal. However, it was also an option to use a filter bank with the following bands: 4–8 Hz, 8–12 Hz, 12–16 Hz, 16–20 Hz, 20–24 Hz, 24–28 Hz, 28–32Hz . These seven bands may allow for better adaptation of a subject’s signal processing. Especially the 4–12 Hz and 12–16 Hz bands, which are conventionally called alpha and slow beta, are related to motor activity. Abordagens semelhantes podem ser encontradas em^10,27,28,29.

Once the trial was decomposed or filtered, we segmented it into 1 s windows and used only the last 5 s to extract features. We can consider that the initial seconds of a trial contain still scattered imagery information, since the individual was waiting for instructions, so we chose to discard the first 3 s of the trial. The extracted features were energy, sample entropy and absolute power of the segments.

Empirical mode decomposition—EMD

EMD is a self-adaptive method that decomposes the signal without leaving the time domain. It is useful for analyzing non-stationary and non-linear signals such as brain signals. It consists of breaking the signal into a finite number of intrinsically-mode functions (IMF)¹². For a given signal x(t) it can be decomposed into n IMF c(t) and a residue r(t).

$$\begin{aligned} x(t) = \sum _{i=1}^{n} c_i(t) + r(t) \end{aligned}$$

(1)

IMF are in this case adaptedly derived directly from the input signal. An IMF must satisfy the following requirements:

1.
The sum of the number of maximums and minimums of the IMF must be equal to the number of zero crossings or differ at most by one;
2.
The mean between the envelope defined by the local maximum and the envelope defined by the local minimum must be zero at any point in the IMF.

⁴³presents a good scheme of how the algorithm to obtain the IMFs works and can be seen in Fig. 3. At first, the search for an IMF is performed on the input signal. This search consists of determining local maxima and minima forming an empirical function. When this function meets the criteria described above, it is considered an IMF. From there it is subtracted from the original signal and the process is repeated on the resulting signal. The process ends when the resulting signal is a monotonic function. As it is empirical in nature, the amount of IMFs produced by a signal is not delimited and may vary between different signals. However, it is noticed that the first IMFs have a higher frequency while the last ones make up the linear trend of the signal, as shown in the Fig. 4.

Feature selection

In dataset 1, each trial processed carries information from 27 channels, decomposed into 5 IMFs, segmented into 5 windows of one second, where 3 functions were calculated for each segment. So our feature vector has a size of 27 $\times$ 5 $\times$ 5 $\times$ 3 = 2025 features. In dataset 2, trials has 3 channels, 5 IMFs, 3 windows of one second and 3 functions (3 $\times$ 5 $\times$ 3 $\times$ 3 = 135 features). It is therefore a high number, capable of dispersing the low amount of trials available and making the sorting activity very costly. Then we reduce the vector size using a heuristic approach. It is known by Neuroengineering that certain brain sites are more likely to concentrate motor imagery information. These sites are called C3, C4 and Cz. Both are located close to the central sulcus, in the primary motor cortex and are related to motor activity in the right hand, left hand and feet, respectively. It should be noted that these are the classes we are trying to identify. It was also possible to bet which IMFs would be more likely to contain motor imagery. We chose the 3rd and 4th IMFs because of the similarity^43,44 with beta and alpha bands, commonly related to motor activity.

So our new vector has 3 $\times$ 2 $\times$ 5 $\times$ 3 = 90 features. We thus mitigate the risk of underfitting signaled by the size of the initial vector and reduce classification costs. However, it was still of our interest to use computational methods of feature selection, motivated by the notion that we could be missing some important information for the discrimination of the three classes of imagery. Added to the complexity of the brain signal, the interconnection of brain areas does not exhaust the possibility of the information we seek to be reflected in other brain sites. Two alternatives to heuristic feature selection are Particle Swarm Optimization (PSO) and Random Forest aided feature selection. These methods are explained below.

Particle swarm optimization—PSO

Particle swarm optimization (PSO) is a widely established method that works as decision trees, where features are ranked according to accuracy. The PSO uses an objective function that seeks to minimally penalize ranking performance, keeping it as high as possible with a reduced number of features^{50,51,52,53,54,55,56,57,58}.

The PSO algorithm was inspired by a study of groups of birds looking for a place to build their nests, as well as looking for food. In this case, the movement of the entire pack is based on the overall intelligence of the group. With that in mind, the algorithm initially uses a population of randomly generated particles or individuals, each individual being a candidate for the solution of the fitness function. Each individual is associated with a position vector and a velocity vector. The flock’s trajectory is then guided by two sources of information: the position of the best particle in relation to the aptitude function and knowledge of the places previously visited by each particle. In this way, at each iteration of the algorithm, the positions and velocities of the particles are adjusted towards the best global and individual position^{59,60,61,62,63}.

Feature selection by PSO was implemented in the database using 30 particles and k-NN as objective function estimator. The individual particle (parameter $c_{1})$ and global consciousness were adopted as 0.5. The inertia factor w was 0.9 and the number of neighbors was 30.

Random forest for feature selection

In addition to PSO, we test feature selection with decision trees, which can be used in classification and regression problems. In particular, Random Forests are a combination of trees that are built with different samples from the original database. Essentially, each tree is made up of four types of nodes: root, leaf, parent and child nodes. The starting point is the root and the terminal nodes are the leaves. Thus, using such trees, the algorithm makes a decision after following a hierarchical path that starts at the root node and reaches the leaf node. Thus, after the formation of the forest, a new object that needs to be classified is placed for each of the trees to classify. At the end, each tree casts a vote that indicates its decision on the object’s class. The forest then chooses the most voted class for that⁶⁴ object.

In the case of feature selection, the measure of importance of each feature can be calculated according to the reduction of the model’s accuracy when the feature in question is not included in the tree. In other words: decision trees based on less relevant features will have a lower ranking performance⁶⁵. In our study, Random Forests were implemented using 100 trees or iterations.

Classification

The classification activity consisted of estimating which of the motor imagery classes (right hand, left hand and foot) a trial belongs to. It is an offline classification of trials (8-s activities of intense motor imagination). However, it was also part of this study to explore the classification of segments (1 s in length). This being a simulation of what would be an online application to identify motor imagery.

Figure 5 shows the different approaches to classification used in this study. Assuming that the SRC holds promise for classifying untrained data, it was necessary to compare it with another “common” classifier. We therefore chose the multilayer perceptron neural network (MLP). However, it was also possible to explore the potential of sparse representations in a classifier other than SRC. We then added to our analysis a hybrid approach that used sparse representation but used MLP for classification.

Sparse representation classification

Frequently, the use of classifiers in real situations exposes the model to untrained conditions. The SRC is an algorithm that seeks greater robustness in predicting these untrained conditions. Figure 6 illustrates the idea behind using a sparse representation for classification. While bounds-based algorithms cannot separate data from untrained conditions, in SRC these data are reconstructed from a minimal subset of training samples and the reconstruction is then classified instead of the original sample. An SRC classifies samples based on the residual resulting from the reconstruction of that same sample after it has been sparsely represented. The reconstruction with the smallest residue should determine which class the sample belongs to.

A sparse representation is a linear combination of a set of vectors, called a dictionary, to obtain a sparse vector, which has a large number of null or near-null values^17,18 The dictionary, in turn, is composed of samples from the training base. These samples already have their features extracted. The classification of a new sample is done by observing the weights of the linear combination that most contribute to the construction of the sparse vector. Members of the correct class are expected to contribute more weight.

Let D be a training dictionary, an array composed of $D_i$ partitions, with $i=1, \dots , n$, where n is the number of classes, and $D_i$ is composed of the class’s training samples i. Thus D forms a matrix partitioned by elements of each training class. For example, in a problem to distinguish upper limb motor imagery, the training dictionary can be defined as the union of 2 sets of left and right hand samples: D = [ $D_{left} \mid D_{right}$ ]. A new sample y is reconstructed from a linear combination Dx where $x = [x_1, x_2, \dots , x_L]^T$ are the weights associated with each training sample. A sparsity constraint will limit the linear combination to a minimal subset of training samples, i.e. most of the weights should be zero resulting in a sparse vector. This restriction is imposed through the optimization problem

$$\begin{aligned} x = \mathop {\textrm{argmin}}\limits _x \left\{ \frac{1}{2} ||Dx - y||_2^2 + \lambda ||x||_1\right\} \end{aligned}$$

(2)

where $\lambda > 0$ is a sparsity smoothing parameter. Since the transformation ($Dx = y$) will hardly be exact, the residual is then calculated for each class i

$$\begin{aligned} r_i = ||D_ix_i - y||_2 \end{aligned}$$

(3)

and the one with the smallest residual is the predicted class for the y sample. Following the example of the motor imagery classification of upper limbs, a new sample (y) is sparsely represented ($y_2$) and then $r_{left}$ (by $D_{left} x_{left} - y_2$) and $r_{right}$ (by $D_{right} x_{right} - y_2$), the smallest value between $r_{left}$ and $r_{right}$ will point to the partition ($D_i$) of the dictionary that most contributed to the sparse representation of the sample and therefore the probable class of the new sample.

Multilayer perceptron

In 1958, Frank Rosenblatt proposed the perceptron model as the simplest form of an artificial neural network for binary classification. The perceptron consists of a single neuron with synaptic weights and adjustable bias, capable of classifying linearly separable problems⁶⁶. Multilayer Perceptron (MLP) is a generalization of the Rosenblatt perceptron and consists of several layers: input layer, one or more hidden layers, and an output layer. Having additional hidden layers allows MLPs to solve more complex problems^66,67.

The training of an MLP is carried out in a supervised manner and aims to adjust the synaptic weights, so that the output of the network approaches what is expected. To carry out this adjustment, the most used method is the error retropropagation algorithm, consisting of two phases. In the propagation phase, an output is obtained from an input. Then, in the backpropagation phase, the error is calculated using the obtained and desired outputs. Through the error obtained, it is possible to adjust the weights and iteratively minimize the⁶⁸ error. MLPs have been commonly used in medical applications, such as in the diagnosis of cancer^{69,70,71,72,73,74,75}, diabetes⁷⁶, multiple sclerosis⁷⁷, and in the monitoring and diagnosis of Covid-19^78,79,80.

Experiments settings

The scheme shown in Fig. 2 enabled a series of options for signal processing that served as a parallel input for the classifiers shown in Fig. 5. We started with a heuristic approach first and then made variations, combining other processing options as they proved promising. Table 1 lists the combinations chosen in this work. The 10-fold cross-validation was performed three times for each configuration evaluated, totaling 30 scores per setup.

Table 1 Groups of performed experiments.

Full size table

Considering the type of sample, the trial is the default sample for most setups (setup 5 classifies the trial but using a resampler mechanism to increase the base). In setup 4 the classified sample is the 1 s segment of the signal, instead of the trial. The segment does not compose the feature vector, as in the other configurations, therefore reducing the vector size (for exemple, dataset 1 reduce to 405 features = 27 channels $\times$ 5 components $\times$ 3 features), but increasing the base sample volume. Considering the type of decomposition, setups 1–5 apply EMD and select up to five IMFs to compose the feature vector.

Setup 6 is the only one that doesn’t use EMD. It uses optimized time-frequency to determine which set of filters and windows generate the best performance for a given subject^10,27,28,29. In this approach, we use a filter bank with 7 bands of 4 Hz length each (4–8 Hz, 8–12 Hz, 12–16 Hz, 16–20 Hz, 20–24 Hz, 24–28 Hz, 28–32 Hz). We segmented the signal into 1-s windows with 50% overlap in the useful seconds of each trial. For dataset 1 this means that there are 5103 features in all (27 channels $\times$ 7 bands $\times$ 5 s separated into 9 segments $\times$ 3 functions). For dataset 2 there will be 441 features (3 channels $\times$ 7 bands $\times$ 4 s separated into 7 segments $\times$ 3 functions). Due to the high volume of features it is very important to use an efficient feature selection method, we use the selection based on RandomForest parameterized with 100 trees.

In terms of channel and feature selection, setup 1 is the heuristic approach described in “Feature selection” section. It is the approach with the smallest feature vector (= 90) and therefore the least expensive. The remaining setups apply computational mechanism for feature selection. The PSO used in setup2 was configured for 30 particles, the k-NN (k = 4) served as an estimator for the objective function and we assigned the following values to the parameters: c$_1$ = 0.5, c$_2$ = 0.5, w = 0.9, k = 30, p = 2. Random Forests in turn were used with 100 trees. We can note that Random Forest was the most frequent method of feature selection used. This is due to its speed compared to PSO and therefore it is less expensive.

The feature vector produced by each setup was then evaluated in 3 classifiers: MLP, SRC and SRMLP (Fig. 2). MLP was trained with 50 neurons in the hidden layer, while OMP was the algorithm used in approaches that included sparse representation of the feature vector.

Metrics

We consider two metrics to evaluate the performance of the experiments: the accuracy and the Kappa index. Accuracy is the ratio between the number of correct predictions and the total number of classified samples. In other words, it is the probability of the classifier correctly indicating the class imagined by the patient or user. Its mean and standard deviation were useful in analyzing groups of experiments. The accuracy was calculated using the Eq. 4 below:

$$\begin{aligned} Accuracy = \frac{TP + TN}{TP + TN + FP + FN} \end{aligned}$$

(4)

where TP indicates true positives, TN indicates true negatives, FP indicates false positives and FN indicates false negatives.

The Kappa index, in turn, is a metric capable of dealing with multi-class problems, such as the one proposed here. It assesses the level of agreement between data sets, with the maximum value being 1, where values above 0.75 suggest excellent agreement and values between 0.40 and 0.75 a median agreement^81,82. The Kappa index can be calculated using the Equation 5 below:

$$\begin{aligned} k = \frac{\rho _o - \rho _e}{1 - \rho _e}, \end{aligned}$$

(5)

where $\rho _o$ is the observed agreement, or accuracy, $\rho _e$ is the expected agreement, defined in the Equation 6.

$$\begin{aligned} \rho _e = \frac{(TP + FP)(TP + FN)+(FN + TN)(FP + TN)}{(TP + FP + FN + TN)^2}. \end{aligned}$$

(6)

Results

Database 1

Table 2 summarizes the value of the metrics studied for each setup presented in the “Experiments settings” section. In setup2 that uses PSO Search, 1250 of 2025 features were selected (27 channels $\times$ 5 IMF $\times$ 5 time windows $\times$ 3 features), in setups 3 and 5 Random Forest selected 888 of 2025 features (27 channels $\times$ 5 IMF $\times$ 5 time windows $\times$ 3 features), finally in setup6, 2106 of 5103 features were selected (27 channels $\times$ 7 frequency bands $\times$ 9 time windows $\times$ 3 features) also by Random Forest, but using frequency bands instead of IMF. We can observe a pattern of equivalence between the compared classifiers. None of them explicitly excel in any of the metrics.

Table 2 Result of the mean and standard deviation of the metrics of the experiments.

Full size table

In Fig. 7 we can observe the distribution of accuracies presented in Table 2. Each boxplot has 30 accuracy values, derived from cross validation. Some of the configurations have high interquartile range, having high variance in the results. The smallest variances are observed in the classification of segments (setup 4), since there is a greater volume of data in this base, reducing the unpredictability of the classification. The same pattern can be seen in Fig. 8. However, the results are still far from being of practical use.

The low performance of the metrics suggests that none of the approaches can generalize from the base adopted in this work. However, the averages and deviation obtained are very similar and the high variance suggests a tendency to underfitting. Our hypothesis is that the amount of sample in the base is not enough to discriminate well between classes due to the high sample variance. In this sense, we decided to synthetically increase the dataset, seeking to reduce the variance of the database by using the SMOTE method⁸³, which try to preserve the statistical behavior of the data. The Tables 3 and 4 present the results obtained by increasing the base by 30% and doubling the base size, respectively. Data augmentation was not performed in setup 2, due to the high computational complexity of the PSO for large volumes of data. SMOTE was not applied in setup 5 either, as it already performs another data augmentation method, resampling.

Table 3 Result of the mean and standard deviation of the metrics of the experiments increasing the base by 30%.

Full size table

We observe in Table 3 that SRMLP achieved the best average accuracy ($60.72\%$) and a higher percentage increase over MLP ($12.22\%$). The same can be seen with regard to Kappa. SRMLP gets the highest value (0.41) and the highest percentage increase compared to MLP ($32.26\%$). The Mann Whitney test was performed to look for statistical difference for p value < 0.05 between the MLP approach and the best approach that used sparse representation and the following p values were obtained: setup1 = 0.40, setup3 = 4.82E−07, setup4 = 2.98E−11, setup6 = 8.41E−10 . When we increased the base by 100% (Table 4) we noticed the same evolution of the SRC and SRMLP in relation to the average accuracy, obtaining the highest value ($82.51\%$), meaning an increase of $14\%$ in relation to MLP. Kappa also represents a high percentage increase ($25.42\%$). Finally, SRC outperformed all other methods in setups 3 and 4, second only to the hybrid method (SRMLP) in setup1 (heuristic approach) and setup6 (with time-frequency optimization). The Mann Whitney test was performed and the following p values were obtained: setup1 = 0.97, setup3 = 9.78E−11, setup4 = 3.00E−11, setup6 = 2.88E−11.

Table 4 Result of the mean and standard deviation of the experiments metrics increasing the base by 100%.

Full size table

When we look at the boxplots of the accuracies (Figs. 9, 10), we see a proportional increase in intra-setup approaches. It means that all approaches react positively to the increase in the training set and SMOTE does not seem to privilege a specific method. Setups 3 and 6 appear a little above the others and have in common the selection of features through Random Forest. We also noticed that the more we increased the base size, the smaller the variance of the results, the better the precision of the methods. There is a similarity in the distribution of results between the methods that use some sparse representation. Both have a certain statistical equivalence. On the other hand, the difference between an MLP and the other approaches is remarkable. The distributions of the Kappa coefficients, when we increase the base (Figs. 11, 12), behave similarly to the accuracy distributions. Setups 3 and 6 perform better than the others in almost all classifiers. The variance decreases as the volume of data increases.

We can also observe the results from the perspective of computational complexity. Since the volume of data needs to grow to achieve good results, the more expensive it will be to process. Figures 13 and 14 show the distribution of the average training time for each model. The experiments were performed on a common computer (CPU with 2 cores at 2.50 GHz and 4 threads). In general, training times are similar and should have little impact on the choice of algorithm in practice. Except when it comes to the hybrid approach in setup 4 which is highly at variance with the others. If we look at the magnitude of the training time, we note that this setup must be chosen with care, as it takes more than 20 minutes (1200 s) to train a model. It is also important to note that the same setup SRC approach has equivalent performance with a much lower computational cost.

Finally, we can assess the computational complexity of classification time. The Figs. 15 and 16 present the distribution of the average classification time of 1 partition of the cross validation. That is, it is not about the time to classify a sample, but the test set. In this sense, we noticed a trend of cost increase also for the SRC in the classification of segments (setup4). However, in practice, segment classification should be intended for online applications, whose prediction is given one sample at a time. This can mitigate the effect of computational cost.

Database 2: BCI competition IV 2b

To improve classification accuracy we generate artificial training data based on real training samples. The data augmentation method is very similar to the one presented by³³ which shows the effectiveness of the method. Trials were segmented into three crops and randomly recombined between trials of the same class and subject, generating new samples. Then the decomposition method is applied (EMD or filtering by frequency bands, according to the experiment setup) and the components are also randomly recombined to generate new samples. This method was applied to triple the size of the training base.

In addition to the classifiers used in the experiments with the previous base (MLP, SRC and SR+MLP), Ramdom Forest (RForest) and SVM were also tested. These algorithms were also used in the classification of the space representation (SRR-Forest and SRSVM). Table 5 summarizes the results obtained for each of the 9 subjects in the base (S1–S9) in 3 setups. Setup1 describes the condition in which the experiment does not perform any selection of features, since the selection of C3, Cz and C4 is not necessary, since the base already consists of using the same channels. Setups 3 (with EMD) and 6 (with frequency bands) are presented here to allow again the comparative analysis between these techniques. In setup3, 48 out of 135 features were selected (3 channels $\times$ 5 IMF $\times$ 3 time windows $\times$ 3 features) and in setup6, 114 out of 441 features were selected (3 channels $\times$ 7 frequency bands $\times$ 7 time windows $\times$ 3 features). The Mann Whitney test was performed to look for statistical difference for p value < 0.05 between the best approach with a conventional classifier (Random Forest) and the best approach that used sparse representation. Following p values were obtained: S1setup1 = 1.93E−09, S1setup3 = 0.56, S1setup6 = 9.07E−08, S2setup1 = 0.84, S2setup3 = 1.83E−10, S2setup6 = 6.80E−11, S3setup1 = 2.70E−06, S3setup3 = 0.007, S3setup6 = 1.64E−10, S4setup1 = 2.66E−11, S4setup3 = 1.29E−11, S4setup6 = 1.14E−06, S5setup1 = 0.004, S5setup3 = 5.43E−11, S5setup6 = 2.04E−05, S6setup1 = 0,80, S6setup3 = 1.39E−06, S6setup6 = 7.86E−07, S7setup1 = 0.0004, S7setup3 = 0.0002, S7setup6 = 3.23E−06, S8setup1 = 0.04, S8setup3 = 2.65E−11, S8setup6 = 5.77E−08, S9setup1 = 3.13E−05, S9setup3 = 0,002, S9setup6 = 3.87E−07. Finally, Fig. 17 presents the distribution of accuracies for subject 3. The same behavior can be observed in the other subjects and can be deduced from the standard deviation values of Table 5.

Table 5 BCI CIV 2b—Result of the mean and standard deviation of the accuracies of the experiments.

Full size table

Table 6 considers the most accurate models obtained by the previous experiment and presents the inter-subject average of each setup. The best sparse model (Setup6 SRSVM) achieve an average of $95.43\%$ of average accuracy among the subjects in the base. The best conventional model (Setup6 R.Forest) reaches up to $98.33\%$ accuracy, a very close result between the approaches.

Table 6 BCI CIV 2b—Accuracy of the best models of each evaluated configuration.

Full size table

Discussion

Database 1

The dataset 1, although voluminous, is still not sufficiently representative for the problem of distinguishing three classes of motor imagery. The performance improvement obtained with SMOTE corroborates this thesis. However, the user cannot be expected to have so much time to obtain a significant volume of data. To have an estimate, each collection has 60 trials of 8 s, which is equivalent to 8 minutes per collection. The original base, without data augmentation, has 21 collections, a total of 168 minutes. However, we only saw a noticeable improvement in the data when we doubled the size of this database (Figs. 10, 12). It is unfeasible for a person to take that long to provide a massive amount of data. In this sense, the improvement of approaches that use the data augmentation mechanism is essential if we want to reduce the user’s reluctance to use BCI. However, the problem is not new. It is possible to observe in the literature the use of data augmentation to increase the size of the training base in EEG signals³³.

About the performance obtained in each setup. We can observe that in addition to adaptive methods (EMD and SRC), the selection of features greatly impacts the result. The heuristic approach (setup 1) is far from being the most promising, as it grew the least as the base increased (Figs. 9, 10, 11, 12). This signals that the intentional choice of brain regions (C3, C4 and Cz channels) and frequencies (IMF 3 and 4) is not the best choice. It is possible to affirm that motor imagery information may be better distributed in other regions of the motor cortex and in the frequency spectrum. This reinforces the need to use automatic and adaptive mechanisms for feature selection.

Setup 2 uses PSO to select features among all channels and frequencies. It proved to be very costly for feature selection, being deprecated in favor of selection by Random Forest. Setup 3 replaces PSO by selecting Random Forest. It was one of the settings that grew the most as the base increased (Figs. 10, 12). It also presented one of the best performances of the SRC and SRMLP, surpassing the conventional technique. We observe with it the first indication of the effectiveness of sparse representations for classifying motor imagery.

Setup 4 is similar to setup 3, but classifies segments (1s) instead of trials (8s). It is as close to an online rating approach as the user expects real-time feedback (equal to or less than 1 s). It also corroborates the superiority of the SRC and SRMLP in relation to the MLP, presenting a more notable difference. However, when looking at training and testing time, we must consider it carefully, as the larger the training base, the SR approaches become more costly. For this case, it is possible to think about optimizing the training dictionary to reduce its size, since the SR dictionary is the set of all training samples in the base. It is also an option to merge the prediction between the pure (faster) MLP model and an SR model. From the degree of certainty that the MLP classifies its samples, we can resort to an SR model to help classify the more difficult samples, as suggested in⁷. Setup 5 has a similar setup to setup 3, but increasing by 30% the base with resampling. It did not stand out in relation to the other methods (Figs. 7, 8). When it comes to data augmentation, we find it more promising to use SMOTE as it generates new samples close to the group of existing samples.

Setup 6 has a configuration similar to setup 3, but instead of using EMD, it decomposes the signal through a bank of frequency filters and overlap the windows in the time domain, thus seeking to detect the best frequency band and window of time for a subject. Similar to setup 3, the SRMLP classifier is superior to the MLP, but the performance of the SRC does not repeat this behavior. For this dataset features based on EMD perform better when associated with the SRC. However, other classifiers have a behavior very similar to that observed in setup 3, improving their performance in the same proportion as the database increases (Figs. 9, 10, 11, 12). In this sense, the use of EMD does not seem to have an impact on most results. It is possible that the similarity of the results is due to the similarity between the IMF and the filtered signal. Figure 18 illustrates a comparison between the models. It is possible to notice the similarity in the frequency of the waves between the two approaches. We can assume this is because brain signals do not have very high frequencies and oscillate in a similar way. EMD has little room to generate IMFs of varying frequencies.

We observed that the classifiers that use sparse representation have equivalent results among themselves, but outperform the conventional MLP model. The SRC and SRMLP reach an average accuracy of $75.95\%$ and $82.51\%$ respectively, while the MLP is $72.38\%$, representing a gain of between $4.93\%$ and $14\%$. Between themselves, the two models that use sparse representation have very similar accuracies, however when time-frequency optimization is used (setup 6) the difference is notable, as the SRC drops in performance. This does not make setup 6 superior to other approaches. It is important to note that the time-frequency optimization does not stand out among the others for this dataset. On the other hand, the use of EMD is also not superior in most cases. However, EMD does not influence negatively, there is room for improvement. Finally, the use of data augmentation proved to be important for obtaining relevant results.

Database 2: BCI competition IV 2b

The results obtained in the BCI Competition dataset 2b present a different behavior from the first dataset of this study. When comparing the approaches observing the feature processing techniques, it is possible to perceive a superiority of the filtering by frequency bands in relation to the features based on EMD. The average accuracy of the setup 6 models generally exceeds the accuracies of setups 1 and 3 (Table 5). Regarding the classification algorithm, we can also notice that the models that use sparse representation perform slightly inferior to the others in most subjects. Therefore, the model proposed by⁷ in the classification of EMG signals does not seem to be the best when working with EEG data. However, dictionary optimization techniques can help to improve the results, better adapting this approach to the context of motor imagery classification.

When we look at Table 6, the previous observations are even more evident, however this table still allows us to observe which of the tried configurations is the best in the general average. The Random Forest classification of features based on frequency bands (AVG setup6, column 3) is the best configuration among the others, with $98.33\%$ accuracy. It is worth noting that in setup6, the selection of features is also carried out by random forest, which explains the reason for the good performance of this combination.

We can then compare with the cited works that use the same data set. Table 7 presents the performance between the approaches⁹ uses SRC with and without dictionary optimization. However, it only experiences the 3rd collection of each subject, so the training and test data are similar, as they were collected on the same day and time. Therefore, the adaptive factor, where the imagery could vary on different days, cannot be evaluated in these results. The same is in²⁹, but it also uses time window and frequency band optimization. The other works, including this one, use the complete base³¹ gives only the average result of all subjects. Our study thus obtains a performance similar to the state of the art. We can feature this performance to the lack of optimization of the sparse dictionary, considering that its optimization proved to be efficient in increasing the accuracy⁹. The synthetic increase in the training base contributed to increase the accuracy of our study. It is interesting to note that it is the same data augmentation technique used by the study that had the best performance, but that uses another³³ classification method.

Table 7 Comparison between the accuracies of different studies for the BCI Competition IV 2b base.

Full size table

Conclusion

In this work, we proposed to evaluate the combination of two adaptive methods to the subject’s context, EMD and SRC, in the classification of three-limb motor imagery. Observing the results, it is possible to affirm that SRC is effective and promising in the detection of motor imagery and can be very useful in BCI applications, generating gains of approximately 14% compared to conventional classifier algorithms. The use of EMD in the extraction of features from EEG signals allowed obtaining results competitive with the state-of-the-art works^{12,13,14,15,16,43,44,45}, with the potential advantage of using an approach that does not fix the separation by frequency band: the IMFs are obtained dynamically, through adaptive filtering that does not fix cutoff frequencies, which is more suitable for non-stationary signals such as EEG signals. Additionally, the evaluation of other datasets can also be useful to better understand the impact of the studied techniques. It is noteworthy here that the problem brought up is multiclass and in the domain of motor imagery it is not a trivial problem, with great variations between databases.

The possibility of combining the methods with other techniques is presented as an option for future work. For example, using Wavelet Transform Decomposition in pre-processing could make the method more noise-resilient⁸⁴. Finally, if we want to decrease the cost and effort to use the BCI and still guarantee good performance we need: (1) to invest efforts in generating new samples to synthetically increase the training base; (2) enhance self-adaptive mechanisms that respond efficiently to the user’s context and serve as an enhancement for joint use with the SRC. It is important to realize that although the proposed method was shown to be comparable with other approaches in data set 2, we cannot rule out the evidence found in the first dataset. If, on the one hand, the BCI Competition dataset 2b contains more subjects (9 subjects vs 1 subject), on the other hand the first dataset has a greater amount of subject data (21 collections vs 5). This only suggests that it is necessary to evaluate approaches on a variety of bases in order to observe prevailing behavior and to evidence in which contexts one technique may be superior to another.

Data availibility

The first dataset analysed during the current study is available in the Biomedical-Computing-UFPE/Motor-Imagery GitHub repository, at the following URL: https://github.com/Biomedical-Computing-UFPE/Motor-Imagery. The second dataset analysed during this study, BCI Competition IV dataset 2b, is included in this published article and its supplementary information files:⁴⁹.

References

Wolpaw, J. R. & Wolpaw, E. W. Brain–computer interfaces: Something new under the sun. Brain–Comput. Interfaces: Principles Pract. 14, 3–12 (2012).
Google Scholar
Sreeja, S. R. & Himanshu Samanta, D. Distance-based weighted sparse representation to classify motor imagery EEG signals for BCI applications. Multim. Tools Appl. 79(19), 13775–13793 (2020).
Google Scholar
Fouad, M. M., Amin, K. M., El-Bendary, N. & Hassanien, A. E. Brain computer interface: A review. Brain–Comput. Interfaces 74, 3–30 (2015).
Google Scholar
Jiao, Y. et al. Sparse group representation model for motor imagery EEG classification. IEEE J. Biomed. Health Inform. 23(2), 631–641 (2018).
PubMed Google Scholar
Shin, Y., Lee, S., Lee, J. & Lee, H.-N. Sparse representation-based classification scheme for motor imagery-based brain–computer interface systems. J. Neural Eng. 9(5), 056002 (2012).
ADS PubMed Google Scholar
Ameri, R., Pouyan, A. & Abolghasemi, V. Projective dictionary pair learning for EEG signal classification in brain computer interface applications. Neurocomputing 218, 382–389 (2016).
Google Scholar
Betthauser, J. L. et al. Limb position tolerant pattern recognition for myoelectric prosthesis control with adaptive sparse representations from extreme learning. IEEE Trans. Biomed. Eng. 65(4), 770–778 (2017).
PubMed PubMed Central Google Scholar
Meng, M. et al. Sparse representation-based classification with two-dimensional dictionary optimization for motor imagery EEG pattern recognition. J. Neurosci. Methods 361, 109274 (2021).
PubMed Google Scholar
Miao, M., Wang, A. & Liu, F. A spatial-frequency-temporal optimized feature sparse representation-based classification method for motor imagery eeg pattern recognition. Med. Biol. Eng. Comput. 55(9), 1589–1603 (2017).
PubMed Google Scholar
Zhang, Y. et al. Temporally constrained sparse group spatial patterns for motor imagery BCI. IEEE Trans. Cybern. 49(9), 3322–3332 (2018).
PubMed Google Scholar
Xu, C. et al. Two-level multi-domain feature extraction on sparse representation for motor imagery classification. Biomed. Signal Process. Control 62, 102160 (2020).
Google Scholar
Alam, M. E. & Samanta, B. Empirical mode decomposition of EEG signals for brain computer interface. in SoutheastCon 2017 1–6 (IEEE, 2017).
Bashar, S. K. & Bhuiyan, M. I. H. Classification of motor imagery movements using multivariate empirical mode decomposition and short time Fourier transform based hybrid method. Eng. Sci. Technol. Int. J. 19(3), 1457–1464 (2016).
Google Scholar
Chen, W. & You, Y. Masking empirical mode decomposition-based hybrid features for recognition of motor imagery in EEG. in 2017 3rd IEEE International Conference on Control Science and Systems Engineering (ICCSSE) 548–551 (IEEE, 2017).
Davies, S. R. & James, C. J. Using empirical mode decomposition with spatio-temporal dynamics to classify single-trial motor imagery in BCI. in 2014 36th Annual International Conference of the IEEE Engineering in Medicine and Biology Society 4631–4634 (IEEE, 2014).
Saha, S. K. & Ali, M. S. Data adaptive filtering approach to improve the classification accuracy of motor imagery for BCI. in 2016 9th International Conference on Electrical and Computer Engineering (ICECE) 247–250 (IEEE, 2016).
Oikonomou, V. P., Nikolopoulos, S. & Kompatsiaris, I. Motor imagery classification via clustered-group sparse representation. in 2019 IEEE 19th International Conference on Bioinformatics and Bioengineering (BIBE) 321–325 (IEEE, 2019).
Oikonomou, V. P., Nikolopoulos, S. & Kompatsiaris, I. Robust motor imagery classification using sparse representations and grouping structures. IEEE Access 8, 98572–98583 (2020).
Google Scholar
Peng, H., Lin, W., Cai, G., Huang, S., Pei, Y., & Ma, T. DW-FBCSP: EEG emotion recognition algorithm based on scale distance weighted optimization. in 2021 43rd Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC) 430–433 (IEEE, 2021).
Ang, K. K., Chin, Z. Y., Zhang, H. & Guan, C.: Filter bank common spatial pattern (FBCSP) in brain–computer interface. in 2008 IEEE International Joint Conference on Neural Networks (IEEE World Congress on Computational Intelligence) 2390–2397 (IEEE, 2008).
An, Y., Lam, H. K. & Ling, S. H. Multi-classification for EEG motor imagery signals using data evaluation-based auto-selected regularized FBCSP and convolutional neural network. Neural Comput. Appl. 1–27 (2023)
Schirrmeister, R. T. et al. Deep learning with convolutional neural networks for EEG decoding and visualization. Hum. Brain Mapp. 38(11), 5391–5420 (2017).
PubMed PubMed Central Google Scholar
Bentlemsan, M., Zemouri, E.-T., Bouchaffra, D., Yahya-Zoubir, B. & Ferroudji, K. Random forest and filter bank common spatial patterns for EEG-based motor imagery classification. in 2014 5th International Conference on Intelligent Systems, Modelling and Simulation 235–238 (IEEE, 2014).
Korik, A., Sosnik, R., Siddique, N. & Coyle, D. Decoding imagined 3d arm movement trajectories from EEG to control two virtual arms-a pilot study. Front. Neurorobot. 13, 94 (2019).
PubMed PubMed Central Google Scholar
Yang, B., Tang, J., Guan, C. & Li, B.: Motor imagery EEG recognition based on FBCSP and PCA. in Advances in Brain Inspired Cognitive Systems: 9th International Conference, BICS 2018, Xi’an, China, July 7-8, 2018, Proceedings 9 195–205 (Springer, 2018).
Bhatti, M. H. et al. Soft computing-based EEG classification by optimal feature selection and neural networks. IEEE Trans. Ind. Inf. 15(10), 5747–5754 (2019).
Google Scholar
Feng, J. et al. Towards correlation-based time window selection method for motor imagery BCIS. Neural Netw. 102, 87–95 (2018).
PubMed Google Scholar
Miao, Y. et al. Learning common time-frequency-spatial patterns for motor imagery classification. IEEE Trans. Neural Syst. Rehabil. Eng. 29, 699–707 (2021).
PubMed Google Scholar
Malan, N. S. & Sharma, S. Time window and frequency band optimization using regularized neighbourhood component analysis for multi-view motor imagery EEG classification. Biomed. Signal Process. Control 67, 102550 (2021).
Google Scholar
Zhang, K., Robinson, N., Lee, S.-W. & Guan, C. Adaptive transfer learning for EEG motor imagery classification with deep convolutional neural network. Neural Netw. 136, 1–10 (2021).
PubMed Google Scholar
Zhu, X. et al. Separated channel convolutional neural network to realize the training free motor imagery BCI systems. Biomed. Signal Process. Control 49, 396–403 (2019).
Google Scholar
Amin, S. U., Alsulaiman, M., Muhammad, G., Mekhtiche, M. A. & Hossain, M. S. Deep learning for EEG motor imagery classification based on multi-layer CNNS feature fusion. Future Gener. Comput. Syst. 101, 542–554 (2019).
Google Scholar
Dai, G., Zhou, J., Huang, J. & Wang, N. HS-CNN: a CNN with hybrid convolution scale for EEG motor imagery classification. J. Neural Eng. 17(1), 016025 (2020).
ADS PubMed Google Scholar
Zhao, X. et al. A multi-branch 3D convolutional neural network for EEG-based motor imagery classification. IEEE Trans. Neural Syst. Rehabil. Eng. 27(10), 2164–2177 (2019).
PubMed Google Scholar
Wright, J., Yang, A. Y., Ganesh, A., Sastry, S. S. & Ma, Y. Robust face recognition via sparse representation. IEEE Trans. Pattern Anal. Mach. Intell. 31(2), 210–227 (2008).
Google Scholar
Elad, M. Sparse and Redundant Representations: From Theory to Applications in Signal and Image Processing (Springer, 2010).
Wen, D., Jia, P., Lian, Q., Zhou, Y. & Lu, C. Review of sparse representation-based classification methods on EEG signal processing for epilepsy detection, brain–computer interface and cognitive impairment. Front. Aging Neurosci. 8, 172 (2016).
PubMed PubMed Central Google Scholar
Huang, G.-B., Zhu, Q.-Y. & Siew, C.-K. Extreme learning machine: Theory and applications. Neurocomputing 70(1–3), 489–501 (2006).
Google Scholar
Huang, G.-B., Wang, D. H. & Lan, Y. Extreme learning machines: A survey. Int. J. Mach. Learn. Cybern. 2(2), 107–122 (2011).
Google Scholar
Ding, S., Xu, X. & Nie, R. Extreme learning machine and its applications. Neural Comput. Appl. 25(3), 549–556 (2014).
ADS Google Scholar
Noble, W. S. What is a support vector machine?. Nat. Biotechnol. 24(12), 1565–1567 (2006).
PubMed CAS Google Scholar
Suthaharan, S. Support vector machine. in Machine Learning Models and Algorithms for Big Data Classification 207–235 (Springer, 2016).
Yusoff, M. Z. et al. Discrimination of four class simple limb motor imagery movements for brain–computer interface. Biomed. Signal Process. Control 44, 181–190 (2018).
Google Scholar
Alazrai, R., Aburub, S., Fallouh, F. & Daoud, M.I.: EEG-based BCI system for classifying motor imagery tasks of the same hand using empirical mode decomposition. in 2017 10th International Conference on Electrical and Electronics Engineering (ELECO) 615–619 (IEEE, 2017).
Alam, M. E. & Samanta, B.: Performance evaluation of empirical mode decomposition for EEG artifact removal. in ASME International Mechanical Engineering Congress and Exposition, vol. 58387, 04–05024 (American Society of Mechanical Engineers, 2017).
Gaur, P., Pachori, R. B., Wang, H. & Prasad, G. An empirical mode decomposition based filtering method for classification of motor-imagery EEG signals for enhancing brain–computer interface. in 2015 International Joint Conference on Neural Networks (IJCNN), 1–7 (IEEE, 2015).
Park, C., Looney, D., ur Rehman, N., Ahrabian, A. & Mandic, D. P. Classification of motor imagery BCI using multivariate empirical mode decomposition. IEEE Trans. Neural Syst. Rehabil. Eng. 21(1), 10–22 (2012).
PubMed Google Scholar
Meng, Q., Du, L., Chen, S. & Zhang, H.: Epileptic detection based on EMD and sparse representation in clinic EEG. in Advances in Neural Networks–ISNN 2018: 15th International Symposium on Neural Networks, ISNN 2018, Minsk, Belarus, June 25–28, 2018, Proceedings 15 842–849 (Springer, 2018).
Leeb, R., Brunner, C., Müller-Putz, G., Schlögl, A. & Pfurtscheller, G. BCI competition 2008–Graz data set B 1–6 (Graz University of Technology, 2008).
Kennedy, J. & Eberhart, R. Particle swarm optimization. in Proceedings of ICNN’95-International Conference on Neural Networks, Vol. 4, 1942–1948 (IEEE, 1995).
Eberhart, Y. S.: Particle swarm optimization: Developments, applications and resources. in Proceedings of the 2001 Congress on Evolutionary Computation (IEEE Cat. No.01TH8546), Vol. 1, 81–861 (2001).
Poli, R., Kennedy, J. & Blackwell, T. Particle swarm optimization. Swarm Intell. 1(1), 33–57 (2007).
Wang, X., Yang, J., Teng, X., Xia, W. & Jensen, R. Feature selection based on rough sets and particle swarm optimization. Pattern Recogn. Lett. 28(4), 459–471 (2007).
ADS CAS Google Scholar
Shi, Y. & Eberhart, R. C. Empirical study of particle swarm optimization. in Proceedings of the 1999 Congress on Evolutionary Computation-CEC99 (Cat. No. 99TH8406), Vol. 3, 1945–19503 (1999).
van den Bergh, F. & Engelbrecht, A. P. A study of particle swarm optimization particle trajectories. Inf. Sci. 176(8), 937–971. https://doi.org/10.1016/j.ins.2005.02.003 (2006).
Article MathSciNet MATH Google Scholar
Jóhannesson, G. H. et al. Combined electronic structure and evolutionary search approach to materials design. Phys. Rev. Lett. 88, 255506. https://doi.org/10.1103/PhysRevLett.88.255506 (2002).
Article ADS PubMed CAS Google Scholar
Zhu, Q., Samanta, A., Li, B., Rudd, R. E. & Frolov, T. Predicting phase behavior of grain boundaries with evolutionary search and machine learning. Nat. Commun.https://doi.org/10.1038/s41467-018-02937-2 (2018).
Article PubMed PubMed Central Google Scholar
Feng, L., Ong, Y., Jiang, S. & Gupta, A. Autoencoding evolutionary search with learning across heterogeneous problems. IEEE Trans. Evol. Comput. 21(5), 760–772 (2017).
Google Scholar
Gomes, J. C., de Freitas Barbosa, V. A., de Santana, M. A., de Lima, C. L., Calado, R. B., Junior, C. R. B., de Almeida Albuquerque, J. E., de Souza, R. G., de Araujo, R. J. E., Moreno, G. M. M., et al. Rapid protocols to support covid-19 clinical diagnosis based on hematological parameters. medRxiv (2021).
Barbosa, V. A. et al. Reconstruction of electrical impedance tomography using fish school search, non-blind search, and genetic algorithm. Int. J. Swarm Intell. Res. (IJSIR) 8(2), 17–33 (2017).
Google Scholar
Ribeiro, R. R., Feitosa, A. R., de Souza, R. E. & dos Santos, W. P. Reconstruction of electrical impedance tomography images using genetic algorithms and non-blind search. in 2014 IEEE 11th International Symposium on Biomedical Imaging (ISBI), 153–156 (IEEE, 2014).
Feitosa, A. R., Ribeiro, R. R., Barbosa, V. A., de Souza, R. E. & dos Santos W. P.: Reconstruction of electrical impedance tomography images using particle swarm optimization, genetic algorithms and non-blind search. in 5th ISSNIP-IEEE Biosignals and Biorobotics Conference (2014): Biosignals and Robotics for Better and Safer Living (BRC) 1–6 (IEEE, 2014).
Sakri, S. B., Rashid, N. B. A. & Zain, Z. M. Particle swarm optimization feature selection for breast cancer recurrence prediction. IEEE Access 6, 29637–29647 (2018).
Google Scholar
Hasan, M. A. M., Nasser, M., Ahmad, S. & Molla, K. I. Feature selection for intrusion detection using random forest. J. Inf. Secur. 7(3), 129–140 (2016).
Google Scholar
Sylvester, E. V. et al. Applications of random forest feature selection for fine-scale genetic population assignment. Evol. Appl. 11(2), 153–165 (2018).
PubMed CAS Google Scholar
Haykin, S. Neural networks: Principles and practice. Bookman 11, 900 (2001).
Google Scholar
Ramchoun, H., Idrissi, M. A. J., Ghanou, Y. & Ettaouil, M. Multilayer perceptron: Architecture optimization and training. Int. J. Int. Multim. Artif. Intell. 4(1), 26–30 (2016).
Google Scholar
Kumar, K. S., Sasank, V., Praveen, K. R. & Rao, Y. K. Multilayer perceptron back propagation algorithm for predicting breast cancer. in Intelligent System Design 41–53 (Springer, 2021).
de Vasconcelos, J., dos Santos, W. & de Lima, R. Analysis of methods of classification of breast thermographic images to determine their viability in the early breast cancer detection. IEEE Lat. Am. Trans. 16(6), 1631–1637 (2018).
Google Scholar
Pereira, J. M. S., Santana, M. A., Lima, R. C. F. & Santos, W. P. Lesion detection in breast thermography using machine learning algorithms without previous segmentation. in Understanding a Cancer Diagnosis 1st edn (eds dos Santos, W. P. et al.) 81–94 (Nova Science, 2020).
Santana, M. A., Pereira, J. M. S., Lima, R. C. F. & Santos, W. P. Breast lesions classification in frontal thermographic images using intelligent systems and moments of haralick and zernike. in Understanding a Cancer Diagnosis 1st edn (eds dos Santos, W. P. et al.) 65–80 (Nova Science, 2020).
Pereira, J. M. S., Santana, M. A., Lima, R. C. F., Lima, S. M. L. & Santos, W. P. Method for classification of breast lesions in thermographic images using elm classifiers. in Understanding a Cancer Diagnosis 1st edn (eds dos Santos, W. P. et al.) 117–132 (Nova Science, 2020).
Pereira, J. M. S. et al. Dialectical optimization method as a feature selection tool for breast cancer diagnosis using thermographic images. in Understanding a Cancer Diagnosis 1st edn (eds dos Santos, W. P. et al.) 95–118 (Nova Science, 2020).
Santana, M. A. d., Pereira, J. M. S., Silva, F. L. d., Lima, N. M. d., Sousa, F. N. d., Arruda, G. M. S. d., Lima, R. d. C. F. d., Silva, W. W. A. d. & Santos, W. P. d. Breast cancer diagnosis based on mammary thermography and extreme learning machines. Res. Biomed. Eng. 34, 45–53 (2018).
Rodrigues, A. L. et al. Identification of mammary lesions in thermographic images: Feature selection study using genetic algorithms and particle swarm optimization. Res. Biomed. Eng. 35(3), 213–222 (2019).
Google Scholar
Mishra, S., Tripathy, H. K., Mallick, P. K., Bhoi, A. K. & Barsocchi, P. EAGA-MLP—An enhanced and adaptive hybrid classification model for diabetes diagnosis. Sensors 20(14), 4036 (2020).
ADS PubMed PubMed Central Google Scholar
Commowick, O. et al. Objective evaluation of multiple sclerosis lesion segmentation using a data management and processing infrastructure. Sci. Rep. 8(1), 1–17 (2018).
CAS Google Scholar
Ahsan, M. M., Alam, E. T., Trafalis, T. & Huebner, P. Deep MLP-CNN model using mixed-data to distinguish between Covid-19 and non-Covid-19 patients. Symmetry 12(9), 1526 (2020).
ADS CAS Google Scholar
Ibrahim, S., Kamaruddin, S. A., Mangshor, N. N. A. & Fadzil, A. F. A. Performance evaluation of multi-layer perceptron (MLP) and radial basis function (RBF): Covid-19 spread and death contributing factors. Int. J. Adv. Trends Comput. Sci. Eng. 9(1.4 Special Issue), 625–631 (2020).
Google Scholar
Borghi, P. H., Zakordonets, O. & Teixeira, J. P. A Covid-19 time series forecasting model based on MLP ANN. Procedia Comput. Sci. 181, 940–947 (2021).
PubMed PubMed Central Google Scholar
Perroca, M. G. & Gaidzinski, R. R. Avaliando a confiabilidade interavaliadores de um instrumento para classificação de pacientes: Coeficiente kappa. Rev. Esc. Enferm. U.S.P. 37, 72–80 (2003).
Google Scholar
De Mast, J. Agreement and kappa-type indices. Am. Stat. 61(2), 148–153 (2007).
MathSciNet Google Scholar
Chawla, N. V., Bowyer, K. W., Hall, L. O. & Kegelmeyer, W. P. Smote: Synthetic minority over-sampling technique. J. Artif. Intell. Res. 16, 321–357 (2002).
MATH Google Scholar
Bajaj, N. Wavelets for EEG analysis. in Wavelet Theory (IntechOpen, 2020).

Download references

Acknowledgements

The authors are grateful to the Brazilian research agencies CAPES, FACEPE and CNPq, for the partial financial support of this research. The authors are also grateful to the Research Group on Biomedical Computing at UFPE, for the technological support; to the Research Group on Neurodynamics at UFPE, for providing the original dataset; and to Neurobots, for supporting this study.

Funding

This study was funded by the Brazilian research agencies CAPES, FACEPE and CNPq. The study was conducted in compliance with the guidelines of Resolution 466/2012 of the National Health Council and the Declaration of Helsinki of 1964 and approved by the Ethics Committee of the Health Sciences Center of Federal University of Pernambuco, CAAE number 79271517.2.0000.5208, Recife, Brazil. All participants signed the Informed Consent Term (ICT) prior to the start of the research. Patients were informed of the goals and procedures, as well as the risks and benefits, and could interrupt the assessments at any time, without the need for further explanation.

Author information

These authors contributed equally: José Antonio Alves de Menezes, Juliana Carneiro Gomes and Wellington Pinheiro dos Santos.

Authors and Affiliations

Escola Politécnica da Universidade de Pernambuco, Recife, Brazil
José Antonio Alves de Menezes, Juliana Carneiro Gomes & Wellington Pinheiro dos Santos
Neurobots Research and Development Ltd, Recife, Brazil
José Antonio Alves de Menezes, Vitor de Carvalho Hazin & Júlio César Sousa Dantas
Departamento de Fisiologia e Farmacologia, Universidade Federal de Pernambuco, Recife, Brazil
Marcelo Cairrão Araújo Rodrigues
Departamento de Engenharia Biomédica, Universidade Federal de Pernambuco, Recife, Brazil
Wellington Pinheiro dos Santos

Authors

José Antonio Alves de Menezes
View author publications
Search author on:PubMed Google Scholar
Juliana Carneiro Gomes
View author publications
Search author on:PubMed Google Scholar
Vitor de Carvalho Hazin
View author publications
Search author on:PubMed Google Scholar
Júlio César Sousa Dantas
View author publications
Search author on:PubMed Google Scholar
Marcelo Cairrão Araújo Rodrigues
View author publications
Search author on:PubMed Google Scholar
Wellington Pinheiro dos Santos
View author publications
Search author on:PubMed Google Scholar

Contributions

These authors contributed equally: J.A.A.d.M., J.C.G., V.d.C.H., J.C.S.D. These authors jointly supervised this work: M.C.A.R. and W.P.d.S.

Corresponding author

Correspondence to Wellington Pinheiro dos Santos.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

de Menezes, J.A.A., Gomes, J.C., de Carvalho Hazin, V. et al. Motor imagery classification using sparse representations: an exploratory study. Sci Rep 13, 15585 (2023). https://doi.org/10.1038/s41598-023-42790-y

Download citation

Received: 16 June 2022
Accepted: 14 September 2023
Published: 20 September 2023
Version of record: 20 September 2023
DOI: https://doi.org/10.1038/s41598-023-42790-y

This article is cited by

Early prediction of sepsis in intensive care units: A comparative analysis based on optimization techniques and committees
- André Luiz Vale de Araújo
- Flavio Secco Fonseca
- Wellington Pinheiros dos Santos
Research on Biomedical Engineering (2025)

Subjects

Abstract

Similar content being viewed by others

Adaptive EEG preprocessing to mitigate electrode shift variability for robust motor imagery classification

Motor imagery EEG signal classification using novel deep learning algorithm

A sparrow search algorithm-optimized LSTM framework with EMD denoising for rolling element bearing remaining useful life prediction

Introduction

Related works

Motor imagery based BCI

EMD and SRC MI-BCI

Materials and methods

Databases

Proposed classification methodology of EEG signals by sparse classifiers and empirical mode decomposition

Signal pre-processing and feature extraction

Empirical mode decomposition—EMD

Feature selection

Particle swarm optimization—PSO

Random forest for feature selection

Classification

Sparse representation classification

Multilayer perceptron

Experiments settings

Metrics

Results

Database 1

Database 2: BCI competition IV 2b

Discussion

Database 1

Database 2: BCI competition IV 2b

Conclusion

Data availibility

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Early prediction of sepsis in intensive care units: A comparative analysis based on optimization techniques and committees

Search

Quick links