Segmentation-enhanced approach for emotion detection from EEG signals using the fuzzy C-mean and SVM

Mahmood, Mahmood A.; Alsalem, Khalaf; Elbashir, Murtada K.; El-Ghany, Sameh Abd; El-Aziz, A. A. Abd

doi:10.1038/s41598-025-17220-w

Download PDF

Article
Open access
Published: 30 August 2025

Segmentation-enhanced approach for emotion detection from EEG signals using the fuzzy C-mean and SVM

Mahmood A. Mahmood¹,
Khalaf Alsalem¹,
Murtada K. Elbashir¹,
Sameh Abd El-Ghany¹ &
…
A. A. Abd El-Aziz¹

Scientific Reports volume 15, Article number: 31956 (2025) Cite this article

1750 Accesses
1 Altmetric
Metrics details

Subjects

Abstract

The analysis of EEG signals for determining emotion is one of the most important topics in the field of artificial intelligence. It can be applied in a wide variety of areas, such as emotional health care and the man/machine interface. The purpose of the paper is the demonstration that emotions may be identified using EEG recordings in the hybrid approach based on the differentiated support vector machine (SVM) models with various types of kernel functions, as well as fuzzy C-means. The EEG signal of two subjects was recorded with the help of the Muse headband; the signal data was described as positive, neutral, or negative emotions. A Gauss kernel was the second-best outcome (95.78%), and a linear kernel was the best outcome (97.66%). Precision, recall, and F1-scores were used in establishing the performance of the SVM technique in emotion classification in conjunction with the fuzzy C-means classification approach. Besides covering the discussion on the importance of kernel choice in achieving good performance in SVM-based models, the analysis also showed that there was a potential to use EEG-based emotion detection. Moreover, one-way ANOVA statistical analysis has expressed that the linear kernel did perform significantly better as compared to other kernels (p < 0.05). To confirm that the proposed system would be rather robust, other deep learning models (CNN-LSTM hybrids) were designed and tested, the results of which proved that they had similar performance and at the same time less accurate results than the linear SVM. These results indicate the efficacy of SVM and the optimization of kernel parameters along with the integration of fuzzy logic in recognizing emotions based on EEG records.

Hybrid deep models for parallel feature extraction and enhanced emotion state classification

Article Open access 23 October 2024

Detecting emotions through EEG signals based on modified convolutional fuzzy neural network

Article Open access 06 May 2024

CNN-XGBoost fusion-based affective state recognition using EEG spectrogram image analysis

Article Open access 19 August 2022

Introduction

Emotional processes are thus a critically important part of how we achieve rationality in situations that demand inspection, decision-making, and execution. They are affiliated with many aspects of human feelings, sensing, and moving. Therefore, the development of paradigms for using emotional signals for identifying emotions has become a moderately large research topic that has also been instrumental in enhancing BCI applications for social and clinical purposes¹. Scientists have provided two major models for emotions: dimensional and categorical. Whereas fear, joy, or sadness can be easily described semantically, such representations frequently fail to support the expression of more complex emotions in other languages. Theoretical frameworks such as dimensional models, on the other hand, propose that emotions should be described with reference to several dimensions, of which the arousal-valence space is probably the most familiar. Engl-emotions. This model categorizes emotions according to their valence from negative to positive and according to their arousal level from low to high².

One of the freshest trends in artificial intelligence is still the field of ‘affective computing’, which addresses the creation of systems recognizing and reacting to people’s emotions. Evaluative feelings and appraisals underlie the commonplace experience, including its behavioral, communicative, and even cognitive aspects. Affect recognition, which is a subfield of affective computing, has significant importance, especially in HCIs. A practical application of emotion recognition is neuromarketing, which involves examining or predicting the emotional tendencies of consumers to enhance advertisements, such as appraising responses to music³.

The present development of artificial intelligence and emotion identification has enriched people’s understanding of communication, decision-making, and intelligent systems. The research presented in Khare et al.⁴ provides a comprehensive and structured review of the approaches to emotion recognition published over the last 10 years of research in the field of physiological and physical signals. Gestures and voice are categorized under physical signals, while physiological signals include EEG, ECG, and eye tracking.

In Domínguez-Jiménez et al.⁵, the authors proposed a model with physiological data gathered from the wearable devices of participants to identify three emotions—neutral, sad, and humor—when the participants were watching emotion-inducing clips. Facial videos were recorded from 37 participants along with galvanic skin response (GSR) signals to introduce optical imaging and implement a support vector machine (SVM) model with 100% accuracy. Both time and frequency analyses were performed to evaluate the signal quality. Along the same line of thinking³, developed an emotion identification system using a valence/arousal model on an EEG signal that was preprocessed using DWT and ofter into the gamma, beta, alpha, and theta bands. Spectral features were taken from each of the bands and then normalized, reduced by PCA, and introduced to ANNs, KNNs, and SVM classifiers.

Further information focusing on the analysis of the EEG signal was given in Jafari et al.⁶, where works outlining the difficulties of emotion recognition using EEG data were considered. For recommendation, the study proposed that deep learning methods can be applied even more in terms of emotion recognition. For example, in Li et al.⁷, a multimodal classification framework was proposed for issues concerning emotion identification for EEG and electromyography (EMG) biosignals. Based on these signals, differential entropy features were introduced, and the classification was improved with the help of a multimodal long-short-term memory (LSTM) network that incorporated spatial and temporal profiles.

In Ahmad and Khan⁸, the authors proposed a fast and robust multimodal emotion recognition system that included signal heterogeneity and interpersonal variability by using spatial and temporal characteristic EEG and MEG signals. The users proposed their neural network model that took differential entropy from such signals and improved the recognition accuracy compared to single-modal signals, providing 95.89% and 94.99% arousal and valence detection, respectively, using the DEAP database.

The recent advances in artificial intelligence, especially deep learning, have had a great impact on medical diagnostics and on affective computing, respectively. Neural networks like Convolutional Neural Networks (CNNs), Long Short-Term Memory networks (LSTMs), and transformer models provide an effective method of deriving high-level representations out of raw EEG signals and do not need hand-crafted features. These models are so ideal in showing spatial patterns and the temporal patterns in biomedical signals. As a case in point, CNN-LSTM hybrids have now achieved impressive results in the presence of common neurological conditions, such as the detection of Parkinson disease in EEG signals, returning high accuracy and interpretability via explainable AI systems⁹. On a similar note, transformer models have also shown meaningful potential in early detection of schizophrenia through the use of attention mechanisms to identify meaningful parts of the EEG time series and thereby perform better than basic classifiers^10,11. Deep learning highly accurate models might need massive datasets and significant computational resources, even though they can discover complex arrangements. Alternatively, methods of machine learning such as SVMs have greater computational efficiency and explainability but lack the power of high-dimensional EEG features unless they are preprocessed with either dimensionality reduction or feature selection. Combining any of the above paradigms, such as deep feature extraction and robust/interpretable classification, by an ensemble or a hybrid strategy as suggested in our work tends to utilize the best of the two worlds on emotion recognition based on EEG signals¹².

In this study, we developed the following major research questions: (1) Is it possible to utilize a hybrid framework consisting of fuzzy C-means (FCM) clustering techniques coupled with support vector machines (SVMs) to enhance the performance of EEG-based emotion recognition considerably better than standalone classifiers? (2) What is the role of kernel selection of the SVM framework as it relates to the effect of classification accuracy across the various emotional states pronounced by the EEG signals? Is a linear kernel applied with fuzzy preprocessing more valid than more nonlinear types of kernels like circular, polygonal, or sigmoid in defining emotional states with noisy, low-sample EEG data? The theoretical as well as practical usages of affective computing in relation to educational practice can be unveiled with the help of these questions, which support our methodological design and evaluation of the performance.

The main contributions of this study are that we used top-notch machine learning techniques to decode EEG data and examine how different emotions affect EEG signals relative to different stimuli. By analyzing these alterations, we seek to enhance the accuracy of recognizing and forecasting emotional states. To this end, we develop and study a computationally superior classification model for FCM ensembles integrated with support vector machines (SVMs). This approach continues to accelerate the final fine-tuning of the feature extraction process; that is, emotion recognition is executed with the highest possible precision.

Moreover, Fuzzy C-Means (FCM) clustering was selected in this study because of the very features of this clustering method to deal with uncertainty and overlapping boundaries in emotion-labeled EEG data, an asset that is especially appropriate in the imprecise and nonlinear behavior of brain signals. In contrast to other bio-inspired algorithms, e.g., genetic algorithms, particle swarm optimization, and ant colony optimization, which are commonly computationally demanding and are only useful in global optimization problems, FCM directly and interpretably offers a soft clustering process. Instead of solid labels, it awards membership values and is therefore more appropriate to track the in-between states and multi-dimensional complexion prevalent in emotional reactions. Further, FCM can be very effectively applied in the hybrid model pipeline employed in this study, which is noted to have efficient convergence and great compatibility with subsequent learning, such as support vector machines (SVMs). Although other algorithms might be investigated in the future, FCM is a decent balance of the cost of computation and the ease of fuzzy emotional boundary adaptation and compatibility with the classification methods framework.

The initial hypothesis of this study was that a hybrid model combining fuzzy C-means clustering in a hybrid model of support vector machine classifiers will help to produce better and more significant accuracy and robustness of the emotion classification using EEG signals over the traditional models. The hypothesis that, based on the principle of reducing ambiguity that brainwave activity contains, fuzzy C-means offers a more detailed account of the varying states of emotions, whereas SVM, most notably, using linear and Gaussian kernels, would be appropriately able to produce adequately defined decision boundaries to classify such a complex signal. The associated hypothesis comes down to the notion that the recognition of emotion entails overlapping patterns in terms of EEG signals that can be better addressed by the corresponding models that can address fuzzy membership and nonlinear separation. Hence, a combination of these approaches was likely to deliver better performance, generalization, and clinical relevance in emotion recognition activities.

The remainder of the paper is structured as follows: “Literature review” section provides the literature review, “Materials and methods” section provides the method and material of the proposed approach, and “Results and discussion” section provides the implementation/analysis of the proposed approach. “Limitations of the study” section discusses the findings and implications of the model analysis, while “Conclusion” section presents the conclusion.

Literature review

In Samal and Hashmi¹³, the author proposed the idea of identifying emotion from multichannel EEG signals using MEEMD on the DEAP dataset, including time, frequency, and nonlinear features, in the calculation while using an ensemble tree classifier. In Hamzah and Abdalla¹⁴, the authors focused on deep learning networks in the area of EEG signal classification, paying attention to their potential for the automatic extraction of rich hierarchical representations from raw EEG signal data. This research will also serve as a guide for modern deep learning for different preprocessing techniques, signal representations, and network model architecture. Furthermore, the paper covers limitations detected during experiments, including variability of brain structure, electrode positioning and device positioning, all of which hinder modeling across devices and time sessions. In Xiaohu et al.¹⁵, the authors provide a review of recent related studies that use deep learning for EEG-based emotion recognition and demonstrated that this approach is capable of feature learning and classification. This paper addresses deep learning paradigms and datasets used in affective computing; issues regarding EEG-based emotion recognition; and new research ideas.

The authors in Hamzah and Abdalla¹⁶ focused on how emotions are detected and experienced in a virtual environment using EEG signals while bearing in mind that a real-time response system is fundamental. This paper focuses on the computational rate and user engagement in virtual environments and ways of dealing with awareness of emotions. The Tetromino feature generation function-based game was proposed in Tuncer et al.¹⁷ and is an idea of a new emotion classification system. The system extracts EEG channel features, and then, using their mutual relevance to the rest of the features, the most relevant characteristics are selected by the mRMR method for the classification of emotions. Then, a linear SVM is used for the final emotion classification, where the classification is made by a majority vote. In Xu et al.¹⁸, the authors found and compared seven approaches to channel selection for emotion recognition based on the DEAP dataset. EEG data are further partitioned into gamma, beta, alpha, and theta bands using the discrete wavelet transform (DWT) technique, and entropy and energy features are computed for each of these bands. Three approaches for channel selection, direct selection, mRMR, and experiential approaches, are compared to ELM for classification into seven emotions.

To our knowledge, multichannel EEG-based emotion classification using the TQWT and HCRNN was introduced in Zhong et al.¹⁹ with a spatiotemporal analysis of the proposed approach. The TQWT expands the EEG signal and obtains different subbands, and from these subbands, the mean absolute value and differential entropy features are extracted and converted to TFBS and subsequently used in deep model training. HCRNN, the combination of CNN and LSTM, learns both spatial and temporal features from TFBS for the classification of positive, neutral, and negative emotions, with impressive performance on the SEED dataset for emotions. A meta-analysis conducted in Yu et al.²⁰ A showed that N2 and P3 amplitudes are valid indices of inhibitory control abilities in IGD patients. Such findings are useful for better understanding the underlying neural substrates for behavioral inhibition disorders in IGD patients and for clinical application in early diagnosis and intervention.

In Fernandes et al.²¹, the authors compare simple classifiers with those of deep learning techniques and perform an analysis of emotions via EEG data. This work also presents a novel contribution by presenting a detailed comparative analysis of a wide range of deep learning and machine learning (DL/ML) methods in one study while also extending the literature on emotion recognition from EEG data using graph convolutional neural networks (GCNNs). These findings provide significant direction for constructing the sector of emotional neuroscience because they help elucidate the connection between affective states and neural activity. In Lim and Teo²², using EEG data, Lim and Teo formulated a game-induced emotion recognition method that utilizes an interpretable ruleset-based classifier. This method is innovative and exhaustively overcomes flaws in previous studies concerning emotion detection during video game interactions and indicates a very high level of accuracy in identifying participants’ emotions, including gender differences. Even though ruleset-based classifiers are slower during training, their advantages for modeling real-world applications can be quite substantial since they assist physicians in tracking changes in patients’ emotions with the help of parameters indicating different stages of such an emotion as well as shedding light on the connection between EEG parameters and emotions themselves.

Table 1 presents several studies have brought forward new deep learning architectures that were used to identify emotions based on EEG. Jinfeng et al.²³ proposed Fourier Adjacency Transformer (FAT) that first presented a + 6.5 profit over the then-state-of-the-art methods on the DEAP and SEED datasets. Teng et al. used 2D CNN-LSTM with the accuracy of 91.92 stability and 92.31 arousal on the matrices of different entropy values/differential entropy matrices. Caifeng et al.²⁵ integrated Transformer models with CNNs to achieve a high value of the structural similarity (SSIM) index of 0.98 in three datasets. Yue et al.²⁷ came up with a multi-scale residual BiLSTM with accuracies of 97.88% and 96.85% on DEAP, binary, and quadrantal classification. Liu et al.²⁷ presented a Transformer-based explainable ERTNet that impressed 73.31% and 80.99% in valence and arousal. The Model Echo State Network (MESN) presented by Yang et al.²⁸ attained accuracy of 65.3%, 62.5%, and 70% on valence, arousal, and stress/calm state, respectively. The critical point is that Shen et al. developed DAEST, an attention-based dynamic model that showed satisfactory results on SEED-V, 8SEED, and FACED datasets with a maximum of 88.1% accuracy in 3-class classification. Pan et al.³⁰ introduced a Dual Attentive Transformer model that generalized between the publicly released and privately released datasets with 64.43–85.27% accuracy between various types of class settings. Feng et al.³¹ used a CNN-Bi-LSTM with attention models on the Weibo COV V2 dataset to obtain a binary classification accuracy of 89.14%. Finally, there was Bagherzadeh et al.¹³, who presented an ensemble model with their best result on DEAP (98.76%) and MAHNOB-HCI (98.86%), demonstrating the effectiveness of ensemble learning in emotion recognition using EEG.

Table 1 Summarize state-of-the-arts in 2024–2025.

Subjects

Abstract

Similar content being viewed by others

Hybrid deep models for parallel feature extraction and enhanced emotion state classification

Detecting emotions through EEG signals based on modified convolutional fuzzy neural network

CNN-XGBoost fusion-based affective state recognition using EEG spectrogram image analysis

Introduction

Literature review

Materials and methods

Fuzzy C-means

Support vector machine

Linear kernel

Polynomial kernel

Gaussian kernel

Sigmoid kernel

Random Forest

Long short-term memory

Convolutional neural networks

Proposed approach

Preprocessing layer

Segmentation layer

Classification layer

Evaluation layer

Hyperparameters values of hybrid model

Results and discussion

Dataset description

Results

Experiemnt 1: hybrid CNN-LSTM model

Experiment 2: CNN-LSTM with Random Forest

Experiment 3: hybrid CNN-LSTM model with fuzzy C-means

Experiment 4: linear SVM with fuzzy C-means

Experiment 5: polynomial SVM with fuzzy C-means

Experiment 6: Gaussian SVM with fuzzy C-means

Experiment 7: sigmoid SVM with fuzzy C-means

Discussion

Limitations of the study

Conclusion

Data availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher’s note

Rights and permissions

About this article

Cite this article

Share this article

Search

Quick links