Sentiment analysis for deepfake X posts using novel transfer learning based word embedding and hybrid LGR approach

Khalid, Madiha; Mushtaq, Muhammad Faheem; Akram, Urooj; Safran, Mejdl; Alfarhood, Sultan; Ashraf, Imran

doi:10.1038/s41598-025-10661-3

Download PDF

Article
Open access
Published: 03 August 2025

Sentiment analysis for deepfake X posts using novel transfer learning based word embedding and hybrid LGR approach

Madiha Khalid¹,
Muhammad Faheem Mushtaq¹,
Urooj Akram¹,
Mejdl Safran²,
Sultan Alfarhood³ &
…
Imran Ashraf⁴

Scientific Reports volume 15, Article number: 28305 (2025) Cite this article

2902 Accesses
2 Altmetric
Metrics details

Subjects

Abstract

With the growth of social media, people are sharing more content than ever, including X posts that reflect a variety of emotions and opinions. AI-generated synthetic text, known as deepfake text, is used to imitate human writing to disseminate misleading information and fake news. However, as deepfake technology continues to grow, it becomes harder to accurately understand people’s opinions on deepfake posts. Existing sentiment analysis algorithms frequently fail to capture the domain-specific, misleading, and context-sensitive characteristics of deepfake-related content. This study proposes a hybrid deep learning (DL) approach and novel transfer learning (TL)-based feature extraction approach for deepfake posts’ sentiment analysis. The transfer learning-based approach combines the strengths of the hybrid DL technique to capture global and local contextual information. In this study, we compare the proposed approach with a range of machine learning algorithms, as well as, DL techniques for validation. Different feature extraction techniques, such as a bag of words (BOW), term frequency-inverse document frequency (TF-IDF), word embedding features, and novel TL features that combine the LSTM and DT, are used to build the models. The ML models are fine-tuned with extensive hyperparameter tuning to enhance performance and efficiency. The sentiment analysis performance of each applied method is validated using the k-fold cross-validation. The experimental results indicate that the proposed LGR (LSTM+GRU+RNN) approach with novel TL features performs well with a 99% accuracy. The proposed approach helps detect and prevent the spread of deepfake content, keeping people and organizations safe from its negative effects. This study covers a crucial gap in evaluating deepfake-specific social media sentiment by providing a comprehensive, scalable mechanism for monitoring and reducing the effect of fake content online.

Leveraging data analytics for detection and impact evaluation of fake news and deepfakes in social networks

Article Open access 08 July 2025

Deep learning based SentiNet architecture with hyperparameter optimization for sentiment analysis of customer reviews

Article Open access 10 October 2025

A hybrid deep learning and fuzzy logic framework for feature-based evaluation of english Language learners

Article Open access 29 September 2025

Introduction

The digital era has introduced novel innovations that have changed how information is generated, distributed, and understood. This technology can realistically change or synthesize audiovisual content, which has raised concerns about spreading misinformation, manipulating public opinion, and damaging digital media trust. Social media platforms bring people together, making it easy to share thoughts and opinions through photos, videos, audio, and text¹. With the rise of deepfake content on X (Twitter), it is significant to grasp the public’s feelings towards these manipulations. A social media bot manages accounts and interacts with content, including sharing and reacting to likely real or fake posts². Various software can customize data to user requirements, such as editing videos with selected faces, swapping voices, and generating deepfake text. Such manipulations can create more significant problems on social media, such as financial losses and stress³.

Sentiment analysis is a technique that looks at how people feel about things on X (Twitter) by analyzing their posts for emotions. It focuses on figuring out the opinions, evaluations, and attitudes people express through their posts⁴. This opinion significantly influenced people’s perceptions of specific things, products, ideas, and personalities⁵. As social media continues to grow, people can freely share their thoughts and opinions on topics like deepfake technology. Deepfake text is used with evil intent to spread fake information, making it difficult to verify its accuracy before spreading. Sentiment has a valuable effect on perceiving people’s opinions on X (Twitter), whether positive or negative⁶. The positive sentiment in the deepfake text represents the positive ways that increase the probability of the text being shared and believed. In contrast, the negative sentiment may spread the text with doubts. Furthermore, if we consider the perception of deepfake used to manipulate people’s sentiments to spread misinformation, it may have a strong negative impact.

Recent studies have focused on integrating deepfake technology, sentiment analysis, and the effects of social media. Bhukya et al.⁷ propose a deep learning (DL) model for detecting sarcasm, which faces limitations related to data diversity, model generalizability, and the inherent complexity of accurately identifying sarcasm across varied contexts. The sentiment analysis of ChatGPT tweets using Wolfram Mathematica is performed in⁸. It is limited by dataset bias, the challenge of effectively reading sentiments in social media content, and the ability to generalize transfer learning (TL) approaches. The study⁹ presents a TL fusion approach to improve sentiment analysis, especially when there is limited labeled data available. The authors highlighted issues such as computational complexity, overfitting, and knowledge transfer across various domains. The issues of detecting hate speech in languages with limited resources by employing DL are addressed in¹⁰.

The authors encountered challenges such as a lack of data, biases in annotations, and the ability of models to work well across different languages and cultures. Despite major advances by existing studies in sentiment analysis and deepfake detection, they are facing major challenges¹¹. Many existing studies rely on domain-specific datasets and classical models that do not accurately capture the rapidly changing nature of deepfake content. Many researchers work on transformer-based analysis using bidirectional encoder representations from transformers (BERT) and robustly optimized BERT approach (RoBERTa), which can effectively capture the sentiments present in the text. The study¹² introduced MisROBÆRTa transformer-based model for detecting misinformation. This suggests the hybrid models based on transformers can handle complexity in fake content, but increases computational cost. The study¹³ performs a comparative analysis of different machine learning and transformer-based related content for text classification.

The audiovisual fake content is detected by developing a Swiss transformer-based network¹⁴. The results are tested on five different datasets, demonstrating better performance. While these investigations signify substantial advancement, they also expose crucial constraints in the existing research, like static datasets that fail to extract public sentiments. Current models frequently encounter challenges in handling the contextual and semantic complexities of social media language. These limitations highlight the need for a unique, effective, and flexible sentiment analysis framework that is customized to distinct language and semantic patterns related to deepfake social media postings.

This study aims to create a new sentiment analysis framework that overcomes these limitations by utilizing advanced TL-based word embedding and a hybrid model that integrates gated recurrent unit (GRU) with long short-term memory (LSTM) and recurrent neural network (RNN) architectures. Several machine learning (ML) and deep learning (DL) algorithms, including decision trees (DT), support vector machines (SVM), K nearest neighbor classifiers (KNC), logistic regression (LR), LSTM, GRU, RNN, and TL, are applied to perform sentiment analysis on deepfake posts. The proposed hybrid DL model, combined with a unique word embedding-based TL technique, aims to enhance the model’s accuracy and performance in accurately classifying opinions expressed in tweets about deepfake technology into positive, negative, and neutral classes. This research aims to improve the accuracy and depth of our understanding of public opinion towards deepfakes on social media by using dynamic data-gathering methods and continuously adapting the model to incorporate new information. This research offers the following key contributions:

This research presents a novel X (Twitter) dataset scraped using different keywords over the last six years via the Python library SNScrape. The dataset is preprocessed to perform experiments, while exploratory data analysis is conducted to gain deeper insights into deepfakes. The dataset is labeled using the TextBlob lexicon-based sentiment analysis technique, which assigns multiclass labels.
A novel word embedding-based TL technique is presented that integrates LSTM with DT, which effectively identifies people’s opinions on deepfake technology.
This research introduces a novel LGR technique to analyze people’s sentiments toward deepfake technology. The approach combines DL models like LSTM, GRU, and RNN. Additionally, various ML algorithms, including DT, SVM, KNC, and LR, along with DL models such as LSTM, GRU, and RNN, are used for evaluation.
The evaluation parameters accuracy, precision, recall, F1-score, geometric mean, Cohen Kappa score, receive operating curve (ROC) accuracy score, and Brier score are used for evaluation. The comparison is performed with many other state-of-the-art techniques. All models used in the study are fine-tuned for hyperparameter tuning to enhance their performance and effectiveness precisely. The k-fold cross-validation is used to verify how well the models perform.

The novelty of the proposed approach lies in the structured integration of three models in a specific sequence that enhances temporal learning and improves accuracy. In the proposed architecture, the sequence first passes through an LSTM layer that handles long-term dependencies. The output is then processed by a GRU layer which improves efficiency, and finally, the RNN layer captures short-term dependencies and local sequential patterns. This layering mechanism is designed to maximize performance on complex, real-world text data and is not commonly employed in existing hybrid models. Additionally, the proposed architecture is applied to the specific domain of deepfake-related tweet sentiment analysis, which remains underexplored.

Section “Literature review” discusses the literature review on deepfake tweet sentiment analysis. Section “Methodology” discusses the methodology, including the dataset, data noise removal methods, ML, DL, and embedding-based transfer learning methods. Section “Results” represents the results and discussion. Section “Conclusion and future work” includes the conclusion and future research directions.

Literature review

In recent years, the advancement of deepfake technology has presented many modifications and challenges in the realm of sentiment analysis¹⁵. The comparative analysis challenges and constraints encountered in this domain are demonstrated in Table 1. The public opinions on deepfake tweets using a dataset from X (Twitter) analyzed by¹⁶. The authors employed BOW and TF-IDF for getting features and ML and DL models, including extra tree classifier (ETC), gradient boosting machine (GBM), SVM, Gaussian Naive Bayes (GNB), adaptive boosting algorithm (ADA), LSTM, GRU, bidirectional LSTM (BiLSTM), and convolutional neural network (CNN) + LSTM. The BiLSTM model outperformed others with an accuracy of 92%, highlighting the growing use of these techniques in sentiment analysis. This research used the TweepFake dataset, which includes posts from both bots and humans. Features were extracted using BoW and TF-IDF techniques. It utilized various deep learning techniques, including LSTM, RNN, and GPT-2, to analyze results, with the RoBERTa-based technique outperforming others by achieving a 90% accuracy score⁶.

The Fake-NewsNet dataset detects fake news in which SVM achieves the highest performance with 93% accuracy, but the Naive Bayes (NB) and LSTM did not perform well¹⁷. Thuseethan et al.¹⁸ utilized DL methods to evaluate attitudes from text and image data collected from the web, adopting a multimodal strategy to improve analysis precision and comprehensiveness. Deepfake posts analysis is performed on Russia and Ukraine war posts on X (Twitter)¹⁹. The study assesses various ML methods for classifying text created by humans and text provided by ChatGPT, achieving 79% accuracy with a transformer-based model on datasets generated by humans and ChatGPT queries. The efficacy of the model is limited in identifying the emotions present in the text²⁰.

DL approaches are crucial for detecting deepfake content²¹. The publicly available image dataset from Kaggle is used to perform classification²¹. The proposed hybrid technique VGG16 CNN performs well by achieving a 94% accuracy score. Sentiment analysis on COVID-19 Arabian tweets by utilizing a dataset from the cities of Riyadh, Jeddah, and Dammam in Saudi Arabia, is performed in²². Many DL techniques, BiLSTM and CNN, are used, but the comparison results present that CNN outperforms with 93% accuracy for sentiment analysis of Arabic tweets. The generative adversarial networks GANBOT framework for detecting social bots is proposed in²³. The proposed GANBOT-based technique is considered the best, with 95% accuracy compared to the previous contextual LSTM technique.

Understanding the differences between DL and ML techniques is key to accurately identifying fake and real content. The comparison of the closeness of ChatGPT to human experts is performed in²⁴. This proposes the Human ChatGPT Comparison Corpus (HC3) dataset, which is based on ChatGPT response gap analysis by human experts and future directions by large language models (LLMs). The results demonstrate that RoBERTa with the LR model shows the most promising results with a 94% accuracy score. The English posts from the pan-competition base dataset are used to perform detection between humans and bots²⁵. The BERT model shows an 83% accuracy score compared to other applied techniques.

Fake news is spread by social bots on social media platforms. The 30,000 posts from the PAN-20 dataset are analyzed in²⁶, and the BiLSTM technique outperforms all other applied techniques. The DL and lexicon-based methods to analyze sentiments in COVID-19 tweets are employed in²⁷. This covers data collection and model validation and addresses accuracy problems. The study points out challenges in deciphering confusing information and potential biases in the models caused by the dataset’s peculiarities. The GRU technique achieves a 93% accuracy on the COVID-19 dataset. The six publicly available datasets are used for sentiment analysis of tweets²⁸. The BERT model is employed with a combination of deep learning techniques like LSTM, RNN, and CNN. This combination reveals 93% accuracy scores and other metrics also get achievable scores.

The lexicon-based analysis for web spam detection uses two datasets: one from news articles obtained using a web scrapper and the other from Kaggle²⁹. Different ML techniques, like NB and RF, are used. The hybrid RCNN performs well by achieving 96% and 86% with web scrapped and Kaggle datasets, respectively. Sentiment analysis on various product reviews is performed, where the technique, LeBERT, gets an 88% score³⁰. Detection of deepfake content on social media using GPT-2 and Amazon reviews is also carried out. The generative model GPT-2 creates lengthy fake stories, which creates uncertainty in real-world scenarios involving multiple generative architectures³¹.

A 3D convolutional LSTM model is proposed by³² for the detection of anomalies for surveillance purposes. Coarse-level feature fusion techniques are used to get better features to increase generalization and avoid vanishing gradients. Uses depth-wise feature stacking to reduce computational cost as compared to typical CNN designs. It includes micro autoencoder blocks for downsampling and feature concatenation blocks for temporal consistency during upsampling.³³ proposed a light attention layered sequence model for detecting abnormalities in surveillance video. The other research³⁴ uses a 2D convolutional layer on each video frame and passes this to an LSTM sequence model. The authors proposed an active learning approach in³⁵ to improve data annotation. The approach is based on deep learning and can learn better representations from smaller datasets. An improved sample selection approach helps the model train better using only a smaller number of samples. Experiments on various datasets show improved classification results.

The study³⁶ integrates emotion-cognitive reasoning with BERT for analyzing sentiments related to online opinions on emergencies. The model aims to provide auxiliary knowledge to enhance the performance of the BERT model by combining the emotion model with deep learning. In this regard, Ortony, Clore, and Collins model is used to build rules governing emotion-cognition. Experimental results indicate the best 1.74% improvement in the BERT model. Danyal et al. opted for a hybrid model for sentiment analysis in³⁷. The model comprises BERT and XLNet models used to perform experiments on the IMDB dataset. Results show improved accuracy over traditional models.

Existing research in deepfake posts sentiment analysis includes different methods in which the feature-based method is employed³⁸ and the graph-based method is used³⁹. The survey conducted by⁴⁰ delves into the construction and detection of deepfakes, additionally pointing out the current limits in this field. However, these studies concentrated on new long stories, raising concerns about their relevance to short social media communications. To address challenges in detecting people’s opinions about deepfake on social media, this study on deepfake tweets dataset aids research in identifying sentiments related to diverse deepfake posts text instances.

Table 1 Comparison of previous studies.

Subjects

Abstract

Similar content being viewed by others

Leveraging data analytics for detection and impact evaluation of fake news and deepfakes in social networks

Deep learning based SentiNet architecture with hyperparameter optimization for sentiment analysis of customer reviews

A hybrid deep learning and fuzzy logic framework for feature-based evaluation of english Language learners

Introduction

Literature review

Methodology

Scrapped tweets dataset

Data noise removal

Sentiment labeling

Feature engineering techniques

Novel transfer features

Artificial intelligence techniques

Support vector machines

Decision tree

K nearest neighbors classifier

Logistic regression

Long short-term memory

Gated recurrent unit

Recurrent neural network

Proposed LGR approach

Hyperparameter tuning

Evaluation metrics

Results

Experimental design

Outcomes using BOW features

Outcomes with TF-IDF features

Outcomes with word embedding features

Results with novel transfer features

Results using proposed LGR model

Cross-validation results

Computational cost analysis

Statistical significance analysis

Error rate analysis

Ablation study analysis

State-of-the-art comparisons

Practical deployment for deepfake content detection

Ethical considerations

Limitations

Future work

Conclusion and future work

Data availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Additional information

Publisher’s note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Quick links