Unifying aspect-based sentiment analysis BERT and multi-layered graph convolutional networks for comprehensive sentiment dissection

Aziz, Kamran; Ji, Donghong; Chakrabarti, Prasun; Chakrabarti, Tulika; Iqbal, Muhammad Shahid; Abbasi, Rashid

doi:10.1038/s41598-024-61886-7

Download PDF

Article
Open access
Published: 25 June 2024

Unifying aspect-based sentiment analysis BERT and multi-layered graph convolutional networks for comprehensive sentiment dissection

Kamran Aziz¹,
Donghong Ji¹,
Prasun Chakrabarti²,
Tulika Chakrabarti³,
Muhammad Shahid Iqbal⁴ &
…
Rashid Abbasi⁵

Scientific Reports volume 14, Article number: 14646 (2024) Cite this article

15k Accesses
28 Citations
Metrics details

Subjects

Abstract

Aspect-Based Sentiment Analysis (ABSA) represents a fine-grained approach to sentiment analysis, aiming to pinpoint and evaluate sentiments associated with specific aspects within a text. ABSA encompasses a set of sub-tasks that together facilitate a detailed understanding of the multifaceted sentiment expressions. These tasks include aspect and opinion terms extraction (ATE and OTE), classification of sentiment at the aspect level (ALSC), the coupling of aspect and opinion terms extraction (AOE and AOPE), and the challenging integration of these elements into sentiment triplets (ASTE). Our research introduces a comprehensive framework capable of addressing the entire gamut of ABSA sub-tasks. This framework leverages the contextual strengths of BERT for nuanced language comprehension and employs a biaffine attention mechanism for the precise delineation of word relationships. To address the relational complexity inherent in ABSA, we incorporate a Multi-Layered Enhanced Graph Convolutional Network (MLEGCN) that utilizes advanced linguistic features to refine the model’s interpretive capabilities. We also introduce a systematic refinement approach within MLEGCN to enhance word-pair representations, which leverages the implicit outcomes of aspect and opinion extractions to ascertain the compatibility of word pairs. We conduct extensive experiments on benchmark datasets, where our model significantly outperforms existing approaches. Our contributions establish a new paradigm for sentiment analysis, offering a robust tool for the nuanced extraction of sentiment information across diverse text corpora. This work is anticipated to have significant implications for the advancement of sentiment analysis technology, providing deeper insights into consumer preferences and opinions for a wide range of applications.

Dual syntax aware graph attention networks with prompt for aspect-based sentiment analysis

Article Open access 09 October 2024

Triple dimensional psychology knowledge encouraging graph attention networks to exploit aspect-based sentiment analysis

Article Open access 25 July 2025

Aspect category sentiment analysis based on pre-trained BiLSTM and syntax-aware graph attention network

Article Open access 27 January 2025

Introduction

Aspect Based Sentiment Analysis represents a granular approach to parsing sentiments in text, focusing on the specific aspects or features discussed and the sentiment directed towards them^1,2,3,4. It begins with ATE, which identifies the nouns or phrases that represent the focal points of sentiment within the text^5,6,7. Then, OTE locates the adjectives or adverbs that express feelings or attitudes towards these aspects^8,9,10. Moving beyond identification, ALSC categorizes the sentiment towards each aspect as positive, negative, or neutral^11,12,13. Aspect-oriented Opinion Extraction then associates these sentiments with the corresponding aspects^14,15, while Aspect Extraction and Sentiment Classification combines the processes of ATE and ALSC for efficiency¹⁶. Aspect-Opinion Pair Extraction is the process of pairing each aspect with its qualifying opinion, and the most integrative task, ASTE, combines aspects, opinions, and sentiments into a comprehensive triplet for each aspect mentioned^17,18,19,20. The Figure 1 presents an example sentence annotated with universal dependency and part of speech, while Table 1 displays the outcomes of various subcomponents of sentiment analysis for this particular review.

tasks intricately mines text data to identify sentiments toward specific aspects mentioned within^21,22. In evaluating a smartphone review like “The camera delivers stunning images, but the battery life is quite disappointing,” ABSA performs a series of sophisticated sub-tasks: ATE identifies the features “camera” and “battery life” under scrutiny; OTE captures the corresponding evaluative terms “stunning” and “disappointing”; Aspect-Level Sentiment Classification (ALSC) assigns sentiments, labeling the camera’s as positive and the battery’s as negative. AOE links these sentiments to their respective aspects, crafting a direct association between “stunning” and “camera” and between “disappointing” and “battery life”. Aspect Extraction and Sentiment Classification (AESC) streamlines the process by tagging “camera” with a positive sentiment and “battery life” with a negative sentiment in one step. AOPE then pairs aspects with their opinions, forming the pairs (“camera”, “stunning”) and (“battery life”, “disappointing”), which is critical for pinpointing precise consumer attitudes. Finally, Aspect Sentiment Triplet Extraction integrates these elements, producing a comprehensive sentiment overview with triplets (“camera”, “stunning”, positive) and (“battery life”, “disappointing”, negative), offering granular insights into the multifaceted nature of consumer feedback^{22,23,24,25,26}.

Table 1 ABSA sub-tasks and their outputs for a given example.

Full size table

The implementation of ABSA is fraught with challenges that stem from the complexity and nuances of human language^27,28. One significant hurdle is the inherent ambiguity in sentiment expression, where the same term can convey different sentiments in different contexts. Moreover, sarcasm and irony pose additional difficulties, as they often invert the literal sentiment of terms, requiring sophisticated detection techniques to interpret correctly²⁹. Another challenge is co-reference resolution, where pronouns and other referring expressions must be accurately linked to the correct aspects to maintain sentiment coherence^30,31. Additionally, the detection of implicit aspects, where sentiments are expressed without explicitly mentioning the aspect, necessitates a deep understanding of implied meanings within the text. Furthermore, multilingual and cross-domain ABSA require models that can transfer knowledge and adapt to various languages and domains, given that sentiment indicators and aspect expressions can vary significantly across cultural and topical boundaries^32,33,34,35. The continuous evolution of language, especially with the advent of internet slang and new lexicons in online communication, calls for adaptive models that can learn and evolve with language use over time. These challenges necessitate ongoing research and development of more sophisticated ABSA models that can navigate the intricacies of sentiment analysis with greater accuracy and contextual sensitivity.

To effectively navigate the complex landscape of ABSA, the field has increasingly relied on the advanced capabilities of deep learning. Neural sequential models like Long Short-Term Memory (LSTM) and Gated Recurrent Units (GRU) have set the stage by adeptly capturing the semantics of textual reviews^36,37,38. These models contextualize the sequence of words, identifying the sentiment-bearing elements within. The Transformer architecture, with its innovative self-attention mechanisms, along with Embeddings from Language Models (ELMo), has further refined the semantic interpretation of texts^39,40,41. These advancements have provided richer, more nuanced semantic insights that significantly enhance sentiment analysis. However, despite these advancements, challenges arise when dealing with the complex syntactic relationships inherent in language-connections between aspect terms, opinion expressions, and sentiment polarities^42,43,44. To bridge this gap, Tree hierarchy models like Tree LSTM and Graph Convolutional Networks (GCN) have emerged, integrating syntactic tree structures into their learning frameworks^45,46. This incorporation has led to a more granular analysis that combines semantic depth with syntactic precision, allowing for a more accurate sentiment interpretation in complex sentence constructions. Furthermore, the integration of external syntactic knowledge into these models has shown to add another layer of understanding, enhancing the models’ performance and leading to a more sophisticated sentiment analysis process.

In our approach to ABSA, we introduce an advanced model that incorporates a biaffine attention mechanism to determine the relationship probabilities among words within sentences. This mechanism generates a multi-dimensional vector where each dimension corresponds to a specific type of relationship, effectively forming a relation adjacency tensor for the sentence. To accurately capture the intricate connections within the text, our model converts sentences into a multi-channel graph. This graph treats words as nodes and the elements of the relation adjacency tensor as edges, thereby mapping the complex network of word relationships. Our model stands out by integrating a wide array of linguistic features. These include lexical and syntactic information such as part-of-speech tags, types of syntactic dependencies, tree-based distances, and relative positions between pairs of words. Each set of features is transformed into edges within the multi-channel graph, substantially enriching the model’s linguistic comprehension. This comprehensive integration of linguistic features is novel in the context of the ABSA task, particularly in the ASTE task, where such an approach has seldom been applied. Additionally, we implement a refining strategy that utilizes the outcomes of aspect and opinion extractions to enhance the representation of word pairs. This strategy allows for a more precise determination of whether word pairs correspond to aspect-opinion relationships within the context of the sentence. Overall, our model is adept at navigating all seven sub-tasks of ABSA, showcasing its versatility and depth in understanding and analyzing sentiment at a granular level.

We present an advanced neural network architecture that addresses every sub-task associated with Aspect-Based Sentiment Analysis (ABSA). This model establishes a new benchmark for the integration of syntactic and semantic data. It markedly improves the accuracy in detecting aspect and opinion terms along with their corresponding sentiment classifications.
Our research integrates an extensive range of linguistic features, such as syntactic dependencies and part-of-speech patterns, into the ABSA framework. This integration substantially enhances the model’s ability to capture the nuances of language, leading to improved sentiment analysis accuracy.
We have crafted a novel refining strategy that utilizes the initial results of aspect and opinion extractions. This strategy refines the representation of word pairs, sharpening the alignment between aspects and their corresponding opinions. This step is vital for the precise detection of sentiment orientations and intensities, which lie at the heart of ABSA.

The remainder of this paper is organized as follows: In Sect. Related work, we discuss the relevant literature and prior work in the domain. Section Proposed framework delves into the proposed framework for the model our proposed methodology, encompassing the techniques and algorithms we employed, Sect. Experiments showcases the experimental results, and the evaluation is presented in Sect. Model analysis we perform an ablation study. Finally, Sect. Conclusion concludes the paper, summarizing our contributions and suggesting potential avenues for future research.

Related work

In this segment, we explore the landscape of Aspect Based Sentiment Analysis research, focusing on both individual tasks and integrated sub-tasks. We begin by delving into early research that highlights the application of graph neural network models in ABSA. This is followed by an examination of studies that leverage attention mechanisms and pre-trained language models, showcasing their impact and evolution in the field of ABSA.

Aspect based sentiment analysis and its subtasks

The field of ABSA has garnered significant attention over the past ten years, paralleling the rise of e-commerce platforms. Xue and Li present a streamlined convolutional neural network model with gating mechanisms for ABSA, offering improved accuracy and efficiency over traditional LSTM and attention-based methods, particularly in aspect-category and aspect-term sentiment analysis⁴⁷. Ma et al. enhance ABSA by integrating commonsense knowledge into an LSTM with a hierarchical attention mechanism, leading to a novel ’Sentic LSTM’ that outperforms existing models in targeted sentiment tasks⁴⁸. Yu et al. propose a multi-task learning framework, the Multiplex Interaction Network (MIN), for ABSA, emphasizing the importance of ATE and OTE. Their approach, which adeptly handles interactions among subtasks, showcases flexibility and robustness, especially in scenarios where certain subtasks are missing, and their model’s proficiency in both ATE and OTE stands out in extensive benchmark testing⁴⁹. Dai et al. demonstrate that fine-tuned RoBERTa (FT-RoBERTa) models, with their intrinsic understanding of sentiment-word relationships, can enhance ABSA and achieve state-of-the-art results across multiple languages⁵⁰. Chen et al. propose a Hierarchical Interactive Network (HI-ASA) for joint aspect-sentiment analysis, which excels in capturing the interplay between aspect extraction and sentiment classification. This method, integrating a cross-stitch mechanism for feature blending and mutual information for output constraint, showcases the effectiveness of interactive tasks, particularly in Aspect Extraction and Sentiment Classification (AESC)⁵¹. Zhao et al. address the challenge of extracting aspect-opinion pairs in ABSA by introducing an end-to-end Pair-wise Aspect and Opinion Terms Extraction (PAOTE) method. This approach diverges from traditional sequence tagging by considering the task through the lens of joint term and relation extraction, utilizing a multi-task learning framework that supervises term extraction via span boundaries while concurrently identifying pair-wise relations. Their extensive testing indicates that this model sets a new benchmark, surpassing previous state-of-the-art methods^52,53.

Innovations in ABSA have introduced models that outpace traditional methods in efficiency and accuracy. New techniques integrating commonsense knowledge into advanced LSTM frameworks have improved targeted sentiment analysis⁵⁴. Multi-task learning models now effectively juggle multiple ABSA subtasks, showing resilience when certain data aspects are absent. Pre-trained models like RoBERTa have been adapted to better capture sentiment-related syntactic nuances across languages. Interactive networks bridge aspect extraction with sentiment classification, offering more complex sentiment insights. Additionally, novel end-to-end methods for pairing aspect and opinion terms have moved beyond sequence tagging to refine ABSA further. These strides are streamlining sentiment analysis and deepening our comprehension of sentiment expression in text^{55,56,57,58,59}.

Innovative approaches to sentiment analysis leveraging attention mechanisms

Attention mechanisms have gained traction in deep learning(DL) models addressing ABSA sub-components, recognized for their effectiveness in semantically linking aspects with contextual words. In addressing aspect-based sentiment classification, Liu et al. identified a gap in current neural attention models, which tend to highlight sentiment words without adequately linking them to the relevant aspects within a sentence. This shortcoming becomes particularly evident in sentences with multiple aspects and complex structures. They introduced a novel attention-based model that incorporates dual mechanisms: a sentence-level attention for global aspect relevance, and a context-level attention that accounts for the sequence and interrelation of words. Their empirical results showed that this dual mechanism approach significantly improves performance over existing models⁶⁰. Lin et al. advanced the interpretability of sentence embeddings by leveraging a self-attention mechanism. Their novel approach represents embeddings as 2-D matrices, allowing each row to focus on distinct segments of a sentence. This not only enhances the model’s performance on tasks such as author profiling, sentiment classification, and textual entailment, but also provides an intuitive method for visualizing the parts of the sentence that contribute to the embedding’s formation⁶¹.Chen et al. explored the integration of Graph Convolutional Networks (GCN) with co-attention mechanisms to enhance aspect-based sentiment analysis (ABSA). Their model effectively utilizes both semantic and syntactic information to filter out irrelevant context, demonstrating significant improvements in identifying the sentiment polarity of specific aspects within sentences⁶². Wang et al. targeted the challenge of discerning sentiment polarity towards specific aspects in text, a task complicated by the subtleties of language and the presence of multiple aspects within a single sentence. Their solution involves a novel encoding of syntactic information into a unified aspect-oriented dependency tree structure. By deploying a relational graph attention network (R-GAT) that operates on this refined tree structure, their method more accurately identifies connections between aspects and opinion words, leading to notable improvements in sentiment analysis performance on prominent datasets⁶³.

Attention mechanisms have revolutionized ABSA, enabling models to home in on text segments critical for discerning sentiment toward specific aspects⁶⁴. These models excel in complex sentences with multiple aspects, adjusting focus to relevant segments and improving sentiment predictions. Their interpretability and enhanced performance across various ABSA tasks underscore their significance in the field^65,66,67.

Syntax-driven approaches to aspect-level sentiment analysis

Zhang and Qian’s model improves aspect-level sentiment analysis by using hierarchical syntactic and lexical graphs to capture word co-occurrences and differentiate dependency types, outperforming existing methods on benchmarks⁶⁸. In the field of ALSC, Zheng et al. have highlighted the importance of syntactic structures for understanding sentiments related to specific aspects. Their novel neural network model, RepWalk, leverages replicated random walks on syntax graphs to better capture the informative contextual words crucial for sentiment analysis. This method has shown superior performance over existing models on multiple benchmark datasets, underscoring the value of incorporating syntactic structure into sentiment classification representations⁶⁹. Zhang and Li’s research advances aspect-level sentiment classification by introducing a proximity-weighted convolution network that captures syntactic relationships between aspects and context words. Their model enhances LSTM-derived contexts with syntax-aware weights, effectively distinguishing sentiment for multiple aspects and improving the overall accuracy of sentiment predictions⁷⁰. Huang and Li’s work enhances aspect-level sentiment classification by integrating syntactic structure and pre-trained language model knowledge. Employing a graph attention network on dependency trees alongside BERT’s subword features, their approach achieves refined context-aspect interactions, leading to more precise sentiment polarity determinations in complex sentences⁷¹. Xu, Pang, Wu, Cai, and Peng’s research focuses on leveraging comprehensive syntactic structures to improve aspect-level sentiment analysis. They introduce “Scope” as a novel concept to outline structural text regions pertinent to specific targets. Their hybrid graph convolutional network (HGCN) merges insights from both constituency and dependency tree analyses, enhancing sentiment-relation modeling and effectively sifting through noisy opinion words⁷². Xiao et al. enhance aspect-based sentiment classification by introducing a graph neural network model that leverages a part-of-speech guided syntactic dependency graph and a syntactic distance attention layer, significantly outperforming traditional methods on public datasets⁷³. Incorporating syntax-aware techniques, the Enhanced Multi-Channel Graph Convolutional Network (EMC-GCN) for ASTE stands out by effectively leveraging word relational graphs and syntactic structures. Its use of biaffine attention to construct relation-aware representations, combined with a unique refining strategy for syntactically-informed word-pair representations, results in significant improvements over existing methods as evidenced by benchmark dataset performances¹⁹.

The integration of syntactic structures into ABSA has significantly improved the precision of sentiment attribution to relevant aspects in complex sentences^74,75. Syntax-aware models excel in handling sentences with multiple aspects, leveraging grammatical relationships to enhance sentiment discernment. These models not only deliver superior performance but also offer better interpretability, making them invaluable for applications requiring clear rationale. The adoption of syntax in ABSA underscores the progression toward more human-like language processing in artificial intelligence^76,77,78.

While existing literature lays a solid groundwork for Aspect-Based Sentiment Analysis, our model addresses critical limitations by advancing detection and classification capabilities in complex linguistic contexts. Our Multi-Layered Enhanced Graph Convolutional Network (MLEGCN) integrates a biaffine attention mechanism and a sophisticated graph-based approach to enhance nuanced text interpretation. This model effectively handles multiple sentiments within a single context and dynamically adapts to various ABSA sub-tasks, improving both theoretical and practical applications of sentiment analysis. This not only overcomes the simplifications seen in prior models but also broadens ABSA’s applicability to diverse real-world datasets, setting new standards for accuracy and adaptability in the field.

Proposed framework

In this section, we introduce the formal definitions pertinent to the sub-tasks of ABSA. Figure 3 is the overall architecture for Fine-grained Sentiments Comprehensive Model for Aspect-Based Analysis. Following these definitions, we then formally outline the problem based on these established terms.

Given an input sentence $S = \{w_1, w_2, \ldots , w_n\}$ comprising $n$ words, our model excels in performing seven subtasks of ABSA. It identifies $a$ as an aspect term and $o$ as an opinion term, while $s$ represents the sentiment polarity associated with the aspect. This sentiment polarity is classified within a label set $X = \{\text {POS}, \text {NEU}, \text {NEG}\}$, encompassing three sentiment polarities: positive, neutral, and negative. The model processes the sentence to discern and interpret these specific elements.

1.
Aspect Term Extraction (ATE): Extracts all aspect terms from the given sentence $S$.
- ATE: $A = \{a_i | a_i \in S\}$
2.
Opinion Term Extraction (OTE): Identifies all opinion terms within the sentence $S$.
- OTE: $O = \{o_j | o_j \in S\}$
3.
Aspect Level Sentiment Classification (ALSC): Predicts the sentiment polarity of each aspect term in $S$, with polarities defined in $X$.
- ALSC: $S_A = \{s(a_i) | a_i \in A, s(a_i) \in X\}$
4.
Aspect-Opinion Pair Extraction (AOE): Extracts pairs of aspect terms and their corresponding opinion terms from $S$.
- AOE: $AO = \{(a_i, o_j) | a_i \in A, o_j \in O\}$
5.
Aspect and Sentiment Co-Extraction (AESC): Simultaneously identifies aspect terms and their sentiments from $S$.
- AESC: $AS = \{(a_i, s(a_i)) | a_i \in A, s(a_i) \in X\}$
6.
Aspect-Opinion Pairing (AOP): Finds pairs of aspect and opinion terms that are related within $S$.
- AOP: $AOM = \{(a_i, o_j) | a_i \in A, o_j \in O, \text {related}\}$
7.
Aspect-Sentiment-Triplet Extraction (ASTE): Forms triplets from $S$ that consist of an aspect term, opinion term, and sentiment polarity.
- ASTE: $T = \{(a_i, o_j, s_k) | a_i \in A, o_j \in O, s_k \in X\}$

Relation definition and table filling

The study employs a framework that categorizes word relationships within sentences into ten distinct types, following the methodology introduced by Chen et al.¹⁹. Four specific labels—{B-A, I-A, B-O, I-O}—are applied to accurately identify terms that represent aspects and opinions. This refined strategy enhances boundary definition within the model, offering improvements over the GTS approach previously outlined by Wu et al⁷⁹. The ‘B’ and ‘I’ labels signify the start and continuation of a term, respectively. Additionally, the suffixes -A and -O are used to categorize a term as either an aspect or an opinion. In Table 1, the A and O relations assist in determining whether pairs of distinct words pertain to the same aspect or opinion term. Moreover, the sentiment relations—{POS, NEU, NEG}—serve a dual purpose: they confirm whether word pairs correspond and also ascertain the sentiment polarity linked with aspect-opinion pairs. By implementing the table-filling technique, as detailed by Miwa & Sasaki⁸⁰ and Gupta et al.⁸¹, a relation table is constructed for each sentence with annotations. This process is exemplified in Figure 2, which illustrates the word pairs along with their designated relations, with each table cell denoting a specific word-to-word relationship (Figure 3).

Model layers and formation

Input layer

BERT, short for Bidirectional Encoder Representations from Transformers, was introduced by Devlin et al. in 2019. This model has been widely recognized for its outstanding performance on various natural language processing tasks. BERT utilizes a deep learning technique known as the Transformer, which employs attention mechanisms to capture contextual information from all words in a sentence, irrespective of their positions⁸². When BERT processes an input sentence $S$, which consists of a sequence of tokens $S = \{w_1, w_2, \ldots , w_n\}$ where $n$ is the number of tokens, it generates a corresponding sequence of hidden states $H = \{h_1, h_2, \ldots , h_n\}$. These hidden states are derived from the last layer of the Transformer block within BERT, capturing the nuanced contextual relationships between the input tokens. This representation power of BERT enables it to serve as an effective sentence encoder for various downstream tasks, providing enriched feature representations that can significantly enhance the performance of natural language understanding systems.

Attention module

In our model, we employ a biaffine attention module to determine the relational probability distribution between word pairs in a sentence. The effectiveness of biaffine attention in syntactic dependency parsing is well-documented⁸³. The biaffine attention mechanism is defined by several key equations, as outlined below:

Equation (1) defines the transformation of hidden state $h_i$ through the attention module:
$$\begin{aligned} h^a_i = \text {MLP}_a(h_i) \end{aligned}$$
(1)
Equation (2) similarly transforms hidden state $h_j$:
$$\begin{aligned} h^o_j = \text {MLP}_o(h_j) \end{aligned}$$
(2)
Equation (3) calculates the interaction score $g_{i,j}$ for word pairs:
$$\begin{aligned} g_{i,j} = h^a_i {}^T U^1 h^o_j + U^2 (h^a_i \oplus h^o_j) + b \end{aligned}$$
(3)
Equation (4) normalizes these scores to determine the relation probability $r_{i,j,k}$ for each relation type:
$$\begin{aligned} r_{i,j,k} = \frac{\exp (g_{i,j,k})}{\sum _{l=1}^m \exp (g_{i,j,l})} \end{aligned}$$
(4)
Finally, Equation (5) applies the biaffine attention to obtain the adjacency tensor $R$:
$$\begin{aligned} R = \text {Biaffine}(\text {MLP}_a(H), \text {MLP}_o(H)) \end{aligned}$$
(5)

These equations collectively model the relations between words in a sentence, where $m$ represents the number of relation types, and each relation type corresponds to a channel in the adjacency tensor $R$. The trainable parameters $U^1$, $U^2$, and $b$, along with the concatenation operation $\oplus$, are integral to this process.

Multi-layered enhanced graph convolutional network (MLEGCN)

The MLEGCN represents a significant development over traditional Graph Convolutional Networks (GCN), designed to process graph-structured data more effectively in natural language processing tasks. Originating from the adaptation of Convolutional Neural Networks (CNNs) to graph data^84,85, the MLEGCN enhances this model by introducing mechanisms that capture complex relational dynamics within sentences.

In the MLEGCN framework, each node in the graph corresponds to a word, while edges reflect the syntactic dependencies between these words. This setup facilitates in-depth modeling of sentence structures. The connections between nodes are represented using an adjacency matrix $A \in \mathbb {R}^{n \times n}$, where $n$ is the number of words in a sentence. In this matrix, $A_{ij} = 1$ indicates a direct syntactic link between the words corresponding to nodes $i$ and $j$, and $A_{ij} = 0$ otherwise.

A significant enhancement in MLEGCN is the integration of soft edges, which express the probabilistic strengths of connections between node pairs. This concept is inspired by advancements in attention mechanisms⁸⁶, allowing the network to adjust the influence of each connection dynamically. The model incorporates a multi-channel adjacency tensor $R^{ba} \in \mathbb {R}^{n \times n \times m}$, where each channel $m$ corresponds to a unique type of relational dynamic, modulated through a biaffine attention module.

The computational operations in MLEGCN are detailed as follows:

$$\begin{aligned} H^{ba}_{k} = \sigma (R^{ba}_{:,:,k} H W_k + b_k) \quad \end{aligned}$$

(6)

$$\begin{aligned} {\hat{H}}^{ba} = f(H^{ba}_1, H^{ba}_2, \ldots , H^{ba}_m) \quad \end{aligned}$$

(7)

In Equation 6, $R^{ba}_{:,:,k}$ represents the $k$-th relational channel within $R^{ba}$. $W_k$ and $b_k$ denote the weight and bias specific to that channel. The function $\sigma$ is an activation function, such as ReLU, used to introduce non-linearity into the network. The function $f(\cdot )$, a pooling operation, combines the hidden representations from all channels to produce a unified node representation.

Through its channel-specific convolutions, MLEGCN is able to differentiate and analyze various types of word relationships. This capability allows for a more nuanced understanding of language, making MLEGCN particularly effective for tasks like sentiment analysis, entity recognition, and syntactic parsing. The consolidated output ${\hat{H}}^{ba}$, derived by pooling across channels (Equation 7), provides a holistic view of the word relationships, crucial for performing complex downstream tasks.

Enhanced understanding of syntactic features

Chen et al. 2022’s innovative framework employs a comprehensive suite of linguistic features that critically examine the interrelations between word pairs within sentences. These features, which include combinations of part-of-speech tags, varieties of syntactic dependencies, tree-based hierarchical distances, and relative positioning within the sentence, contribute to the detailed understanding of language structure.

In practical terms, the model initiates an examination of each word pair $(w_i, w_j)$ by assigning a self-dependency feature that signifies the inherent syntactic role associated with the words. This is operationalized through the initialization of four adjacency tensors: $R_{\text {psc}}$ for part-of-speech combinations, $R_{\text {dep}}$ for syntactic dependencies, $R_{\text {tbd}}$ for tree-based distances, and $R_{\text {rpd}}$ for relative positions-each offering a different perspective on the sentence structure.

Focusing on the syntactic dependency dimension as an illustrative case, when there is a recognized dependency type such as ’nsubj’ (nominal subject) between $w_i$ and $w_j$, the corresponding location in the tensor $R_{\text {dep}}$ is embedded with a vector representation of ’nsubj’. This embedding is retrieved from a dynamically learned table, encapsulating the relationship’s essence. Conversely, the absence of a dependency connection is indicated by a zero vector at the respective tensor indices.

The tensors undergo a process of graph convolutions, refining the raw node representations into enriched forms ${\hat{H}}_{\text {psc}}, {\hat{H}}_{\text {dep}}, {\hat{H}}_{\text {tbd}},$ and ${\hat{H}}_{\text {rpd}}$. Through techniques such as average pooling and concatenation, these representations are synthesized into holistic node and edge descriptors for the sentence as given by:

$$\begin{aligned} H = f({\hat{H}}_{\text {ba}}, {\hat{H}}_{\text {psc}}, {\hat{H}}_{\text {dep}}, {\hat{H}}_{\text {tbd}}, {\hat{H}}_{\text {rpd}}) \end{aligned}$$

(8)

$$\begin{aligned} R = R_{\text {ba}} \oplus R_{\text {psc}} \oplus R_{\text {dep}} \oplus R_{\text {tbd}} \oplus R_{\text {rpd}} \end{aligned}$$

(9)

Here, $H$ encapsulates the ensemble of node representations $\{h_1, h_2, \ldots , h_n\}$, while $R$ aggregates the edge representations $\{r_{1,1}, r_{1,2}, \ldots , r_{n,n}\}$ which collectively enhance the model’s proficiency in recognizing and interpreting complex linguistic constructs, thereby substantially improving its applicability in diverse NLP tasks.

Figure 4 illustrates the matrices corresponding to the syntactic features utilized by the model. The Part-of-Speech Combinations and Dependency Relations matrices reveal the frequency and types of grammatical constructs present in a sample sentence. Similarly, the Tree-based Distances and Relative Position Distance matrices display numerical representations of word proximities and their respective hierarchical connections within the same sentence. These visualizations underscore the framework’s capacity to capture and quantify the syntactic essence of language.

Correlation constraints

To ensure the precise delineation of word relationships within a sentence, the model enforces a constraint on the adjacency tensor, which originates from the biaffine attention framework. This constraint is quantified by the following expression in equation 10:

$$\begin{aligned} L_{ba} = -\sum _{i=1}^{n} \sum _{j=1}^{n} \sum _{c \in C} I(y_{ij} = c) \log (r_{ij|c}) \end{aligned}$$

(10)

Where in the formula:

$I(\cdot )$ stands for the indicator function.
$y_{ij}$ denotes the verified relationship type between the word pair $(w_i, w_j)$.
$C$ represents the comprehensive set of all potential relationship types.
$r_{ij|c}$ is the model’s forecasted probability score for the relationship type $c$ between the word pair $(w_i, w_j)$.

In addition, this relational constraint is similarly applied to four other adjacency tensors, each linked to distinct linguistic features. These tensors are labeled as $L_{psc}$, $L_{dep}$, $L_{tbd}$, and $L_{rpd}$, correlating with individual linguistic feature sets.

Systematic refinement and prediction module

The predictive capabilities of our model are heavily reliant on accurately determining the sentiment relationship between word pairs $(w_i, w_j)$. This process begins with the combination of individual node representations $h_i$ and $h_j$, along with the edge representation $r_{ij}$, as illustrated in equation 11⁸⁷. To enhance this initial representation, we introduce a systematic refinement strategy that utilizes additional self-referential edge representations $r_{ii}$ and $r_{jj}$. These are crucial in contexts where words may have self-related sentiment implications that affect their interaction with other words in the sentence.

Refinement Strategy Rationale and Mechanism:

Enhanced Contextual Understanding: The inclusion of $r_{ii}$ and $r_{jj}$ allows our model to incorporate not only direct relational dynamics between $w_i$ and $w_j$ but also each word’s relationship with itself. This dual consideration is critical, especially in complex sentences where aspects and opinions can be nuanced.
Aspect and Opinion Extraction Influences: When $w_i$ is an aspect and $w_j$ an opinion, the combined representation $s_{ij}$ is enriched by this systematic refinement approach. We leverage outcomes from aspect and opinion extractions to better assess and predict the potential sentiment (positive, neutral, or negative) based on empirical observations that aspects and opinions typically generate strong sentiment indicators.

$$\begin{aligned} s_{ij} = h_i \oplus h_j \oplus r_{ij} \oplus r_{ii} \oplus r_{jj} \end{aligned}$$

(11)

This refined representation $s_{ij}$ is processed through a linear layer followed by a softmax activation to calculate the probabilities of the sentiment label distribution also depicted in Figure 5:

$$\begin{aligned} p_{ij} = \text {softmax}(W_p s_{ij} + b_p) \end{aligned}$$

(12)

Impact of Refinement on Prediction Accuracy: To validate the effectiveness of our refinement strategy, we conducted error analysis comparing model outputs with and without the inclusion of self-referential edges. Our findings reveal that models incorporating $r_{ii}$ and $r_{jj}$ consistently perform better in scenarios involving implicit sentiment relations and complex aspect-opinion structures. Specifically, error rates decrease significantly in cases involving subtle sentiment expressions, underscoring the importance of our systematic refinement strategy. Equation 11 refines the representation of word pairs by integrating additional context that enhances the model’s sensitivity to nuanced linguistic features. Equation 12 then leverages this refined representation to predict the most likely sentiment label for each word pair, demonstrating a tangible improvement in the model’s ability to discern and classify sentiment relationships accurately. This enhancement is crucial for robust performance across diverse datasets and is supported by quantitative improvements in prediction accuracy in our experimental results section.

Loss function

The loss function we aim to minimize is given by:

$$\begin{aligned} L = L_p + \alpha L_{ba} + \beta (L_{psc} + L_{dep} + L_{tbd} + L_{rpd}) \end{aligned}$$

(13)

where:

$L_p$ is the standard cross-entropy loss for the ASTE task, defined as:
$$\begin{aligned} L_p = -\sum _{i=1}^n \sum _{j=1}^n \sum _{c \in C} I(y_{ij} = c) \log (p_{i,j|c}) \end{aligned}$$
(14)
$\alpha$ and $\beta$ are coefficients that balance the influence of the different components of the loss function.
$L_{ba}$, $L_{psc}$, $L_{dep}$, $L_{tbd}$, and $L_{rpd}$ represent additional loss components, addressing specific constraints and aspects of the task.

The structure of $L$ combines the primary task-specific loss with additional terms that incorporate constraints and auxiliary objectives, each weighted by their respective coefficients.

Experiments

Datasets

The study presents a detailed examination of a method’s efficacy when applied to two distinct benchmark datasets within the field of ABSA. These datasets are associated with the Semantic Evaluation (SemEval) challenges that occurred over the course of three consecutive years-2014 through 2016^88,89.

The first of these datasets, referred to herein as Dataset 1 (D1), was introduced in a study by Wu et al. under the 2020a citation. The second dataset, known as Dataset 2 (D2), is the product of annotations by Xu et al. in 2020. It represents an enhanced and corrected version of an earlier dataset put forth by Peng et al. in 2020, aiming to rectify previous inaccuracies^79,90,91.

Comprehensive metrics and statistical breakdowns of these two datasets are thoughtfully compiled in a section of the paper designated as Table 2. This table likely offers an in-depth look at the datasets, including the volume of data points, the assortment and balance of sentiment classifications, the variety of aspects evaluated, and other critical data that are essential for determining the strength and effectiveness of the ABSA methodology under review.

Additional resources and tools relevant to this study can be found at the following GitHub repositories: (https://github.com/xuuuluuu/SemEval-Triplet-data/tree/master/ASTE-Data-V2-EMNLP2020, https://github.com/huggingface/transformers, https://github.com/NJUNLP/GTS).

Table 2 Combined statistics for datasets D1 and D2.

Full size table

Implementation details

In our research, we have implemented the BERT-base-uncased version 5 as the core sentence encoder. To optimize this encoder, we employ the AdamW optimizer, as proposed by Loshchilov and Hutter (2018)⁹². This optimizer is specifically configured with a learning rate of $2 \times 10^{-5}$, a setting that is particularly tailored for fine-tuning the BERT component. For other trainable aspects of our model, a distinct learning rate of $10^{-3}$ is utilized. This bifurcation in learning rates is a strategic decision, ensuring that while the BERT model is fine-tuned with precision, other model components are trained more aggressively. Additionally, we set the dropout rate at 0.5 to mitigate the risk of overfitting, a common concern in deep learning models.

The architecture of our model is built with a keen eye on dimensionality, where the hidden state sizes for BERT and the Graph Convolutional Network (GCN) are set to 768 and 300, respectively. This difference reflects the varied complexity and nature of the data each component handles. Our model, termed MLEGCN, diverges from the traditional EMC-GCN framework. It undergoes an extensive training regime spanning 100 epochs, with each training batch comprising 16 samples. This epoch count and batch size are meticulously chosen to balance computational efficiency with effective learning. To manage the influence of relation constraints within our model, we meticulously tune two hyperparameters: $\alpha$ is set to 0.1 and $\beta$ to 0.01. This fine-tuning is crucial for balancing the relation dynamics in the model. It is noteworthy that the number of channels in our model is directly equivalent to the predefined number of relations, a design choice influenced by the immutable nature of these relation constraints.

For parsing and preparing the input sentences, we employ the Stanza tool, developed by Qi et al. (2020). Stanza is renowned for its robust parsing capabilities, which is critical for preparing the textual data for processing by our model. We ensure that the model parameters are saved based on the optimal performance observed in the development set, a practice aimed at maximizing the efficacy of the model in real-world applications⁹³. Furthermore, to present a comprehensive and reliable analysis of our model’s performance, we average the results from five distinct runs, each initialized with a different random seed. This method provides a more holistic view of the model’s capabilities, accounting for variability and ensuring the robustness of the reported results.

Baselines

We evaluate the proposed method against a diverse set of baseline models, as detailed in Table 3. While many baseline models focus solely on specific subsets of the tasks associated with Aspect-Based Sentiment Analysis (ABSA), only a few provide comprehensive solutions for all associated sub-tasks.

OTE-MTL⁹⁴ conceptualizes ABSA as a process of extracting opinion triplets and utilizes a multi-task learning approach with distinct detection heads along with a sentiment dependency parser.
Li-Unified+⁹⁵ introduces a unified model for target-based sentiment analysis, employing dual RNNs to predict unified tags and determine target boundaries.
RINANTE+⁹⁶ uses rules derived from dependency parsing outputs to extract aspect and opinion terms, applying these rules on auxiliary data and refining the approach through a neural model.
TS⁹⁷ addresses the extraction of aspect sentiment triplets, advocating a two-step methodology for the prediction and association of aspects, opinions, and sentiments.
CMLA+⁹⁸ offers a comprehensive solution for the simultaneous extraction of aspect and opinion terms using a multi-layer attention mechanism.
EMC-GCN¹⁹ incorporates word relationships within a multi-channel graph structure, representing these relationships as nodes and edges for extracting aspect sentiment triplets.
SPAN-ASTE⁹⁹ explores the interaction between complete spans of aspects and opinions to predict sentiment relationships essential for triplet extraction.
IMN-BERT¹⁰⁰ learns multiple tasks associated with ABSA at both token and document levels simultaneously, using a multi-task network approach.
JET-BERT¹⁰¹ employs an end-to-end model for triplet extraction with a position-aware tagging scheme to capture complex interactions among triplets.
DMRC¹⁰² tackles all ABSA tasks in a unified framework, jointly training two BERT MRC models with shared parameters.
BMRC¹⁰³ conceptualizes ASTE as a multi-turn MRC problem, deploying a bidirectional MRC architecture to identify sentiment triplets.
BART-ABSA¹⁰⁴ converts ABSA tasks into a generative model framework using BART for an integrated approach.
SE-GCN¹⁰⁵ presents a ’Syntax-Enhanced Graph Convolutional Network’, which integrates semantic and syntactic insights through graph convolution and attention mechanisms, thereby improving performance across various benchmarks.

Table 3 The summary of baseline competencies in the experiments is presented as follows: a check mark ($\checkmark$) indicates the baseline’s ability to perform the sub-task, while a cross mark ($\times$) indicates the baseline’s inability to perform the sub-task.

Full size table

Table 4 Empirical outcomes for the tasks of Opinion Target Extraction (OTE), Aspect-Based Sentiment Classification (AESC), Aspect and Opinion Pairing (AOP), and Aspect Sentiment Triplet Extraction (ASTE) on the dataset D1.

Full size table

Performance evaluation and comparative analysis

Our experimental evaluation on the D1 dataset presented in Table 4 included a variety of models handling tasks such as OTE, AESC, AOP, and ASTE. These models were assessed on their precision, recall, and F1-score metrics, providing a comprehensive view of their performance in Aspect Based Sentiment Analysis.

The “Ours” model showcased consistent high performance across all tasks, especially notable in its F1-scores. This indicates a well-balanced approach to precision and recall, crucial for nuanced tasks in natural language processing. SE-GCN also emerged as a top performer, particularly excelling in F1-scores, which suggests its efficiency in dealing with the complex challenges of sentiment analysis.

In the specific task of OTE, models like SE-GCN, BMRC, and “Ours” achieved high F1-scores, indicating their effectiveness in accurately identifying opinion terms within texts. For AESC, “Ours” and SE-GCN performed exceptionally well, demonstrating their ability to effectively extract and analyze aspects and sentiments in tandem.

In the Aspect-Opinion Pairing task, “Ours” and SE-GCN showed remarkable proficiency, suggesting their adeptness at correctly pairing aspects with corresponding opinions. Additionally, in the ASTE task, our model demonstrated superior performance, underlining its capability in intricately extracting linked aspect-sentiment entities.

When comparing our model to traditional models like Li-Unified+ and RINANTE+, it is evident that “Ours” outperforms them in almost all metrics. This superiority could be attributed to more advanced or specialized methodologies employed in our model. RACL-BERT also showed significant performance in certain tasks, likely benefiting from the advanced contextual understanding provided by BERT embeddings. The TS model, while not topping any category, showed consistent performance across tasks, suggesting its robustness.

An interesting observation from the results is the trade-off between precision and recall in several models. This indicates potential areas for improvement in future research. The selection of a model for practical applications should consider specific needs, such as the importance of precision over recall or vice versa.

These results indicate that there is room for enhancement in the field, particularly in balancing precision and recall. Future research could explore integrating context-aware embeddings and sophisticated neural network architectures to enhance performance in Aspect Based Sentiment Analysis.

In conclusion, our model demonstrates excellent performance across various tasks in ABSA on the D1 dataset, suggesting its potential for comprehensive and nuanced sentiment analysis in natural language processing. However, the choice of the model for specific applications should be aligned with the unique requirements of the task, considering the inherent trade-offs in precision, recall, and the complexities of natural language understanding. This study opens avenues for further research to enhance the accuracy and effectiveness of sentiment analysis models.

Table 5 Empirical findings for aspect sentiment triplet extraction (ASTE) on the D2 dataset.

Full size table

In Table 5, we observe a detailed comparison of various models for ASTE across four datasets: Lap14, Res14, Res16, and Res15. The evaluation metrics-Precision (P), Recall (R), and F1-score (F1)-provide a comprehensive view of each model’s performance in complex sentiment analysis tasks. Notably, SE-GCN stands out in the Lap14 dataset, achieving the highest F1-score (59.72), which reflects its effective handling of sentiment relationships. However, our model demonstrates exceptional consistency across all datasets, either closely matching or surpassing SE-GCN in terms of F1-scores. This is particularly evident in the Res14 and Res15 datasets, where our model records the highest F1-scores, showcasing its precision and robustness in sentiment analysis.

While other models like SPAN-ASTE and BART-ABSA show competitive performances, they are slightly outperformed by the leading models. In the Res16 dataset, our model continues its dominance with the highest F1-score (71.49), further establishing its efficacy in ASTE tasks. This performance indicates a refined balance in identifying and linking aspects and sentiments, a critical aspect of effective sentiment analysis. In contrast, models such as RINANTE+ and TS, despite their contributions, show room for improvement, especially in achieving a better balance between precision and recall.

The results presented in Table 5 emphasize the varying efficacy of models across different datasets. Each dataset’s unique characteristics, including the complexity of language and the nature of expressed aspects and sentiments, significantly impact model performance. The consistent top-tier performance of our model across diverse datasets highlights its adaptability and nuanced understanding of sentiment dynamics. Such adaptability is crucial in real-world scenarios, where data variability is a common challenge. Overall, these findings from Table 5 underscore the significance of developing versatile and robust models for Aspect Based Sentiment Analysis, capable of adeptly handling a variety of linguistic and contextual complexities.

Model analysis

Ablation study

The ablation study results reveal several important insights about the contributions of various components to the performance of our model. Firstly, it is evident that the complete model configuration comprising refinement processes, syntactic features, and the integration of the MLEGCN and attention modules-consistently yields the highest F1 scores across both the Res14 and Lap14 datasets. This underscores the synergy between the components, suggesting that each plays a crucial role in the model’s ability to effectively process and analyze linguistic data. Particularly, the removal of the refinement process results in a uniform decrease in performance across all model variations and datasets, albeit relatively slight. This suggests that while the refinement process significantly enhances the model’s accuracy, its contribution is subtle, enhancing the final stages of the model’s predictions by refining and fine-tuning the representations.

Table 6 F1 score for abalation study.

Full size table

Table 6 More pronounced are the effects observed from the removal of syntactic features and the MLEGCN and attention mechanisms. The exclusion of syntactic features leads to varied impacts on performance, with more significant declines noted in tasks that likely require a deeper understanding of linguistic structures, such as AESC, AOPE, and ASTE. This indicates that syntactic features are integral to the model’s ability to parse complex syntactic relationships effectively. Even more critical appears the role of the MLEGCN and attention mechanisms, whose removal results in the most substantial decreases in F1 scores across nearly all tasks and both datasets. This substantial performance drop highlights their pivotal role in enhancing the model’s capacity to focus on and interpret intricate relational dynamics within the data. The attention mechanisms, in particular, are crucial for weighting the importance of different elements within the input data, suggesting that their ability to direct the model’s focus is essential for tasks requiring nuanced understanding and interpretation.

These observations from the ablation study not only validate the design choices made in constructing the model but also highlight areas for further refinement and exploration. The consistent performance degradation observed upon the removal of these components confirms their necessity and opens up avenues for further enhancing these aspects of the model. Future work could explore more sophisticated or varied attention mechanisms and delve deeper into optimizing syntactic feature extraction and integration to boost the model’s performance, particularly in tasks that heavily rely on these components.

Syntactic features qualitative analysis

These visualizations serve as a form of qualitative analysis for the model’s syntactic feature representation in Figure 6. The observable patterns in the embedding spaces provide insights into the model’s capacity to encode syntactic roles, dependencies, and relationships inherent in the linguistic data. For instance, the discernible clusters in the POS embeddings suggest that the model has learned distinct representations for different grammatical categories, which is crucial for tasks reliant on POS tagging. Moreover, the spread and arrangement of points in the dependency embeddings indicate the model’s ability to capture a variety of syntactic dependencies, a key aspect for parsing and related NLP tasks. Such qualitative observations complement our quantitative findings, together forming a comprehensive evaluation of the model’s performance.

Case study

The presented case study offers a meticulous examination of our model’s capabilities in Aspect-Based Sentiment Analysis (ABSA) against established benchmarks such as BART-ABSA and BMRC presented in Table 7. Through a diverse array of product reviews, our model consistently demonstrates superior accuracy in deciphering complex aspect-sentiment relationships. For example, in Review 3, our model accurately captures the nuanced sentiment ’superb’ associated with ’noise cancellation’ and the negative sentiment ’short’ tied to ’battery life,’ aligning perfectly with the ground truth. This precision is attributed to our model’s advanced linguistic feature extraction and refined sentiment contextualization, which outperforms the competing models, particularly in cases where the sentiment is subtle or the aspect term is compound. Moreover, the case study underscores the models’ error patterns, where BART-ABSA occasionally falters in associating sentiments with the correct aspects, and BMRC sometimes misinterprets complex sentiment expressions. In contrast, our model exhibits a robust understanding of intricate linguistic cues, leading to its enhanced performance. These case study insights not only reaffirm our model’s adeptness at tackling the multifaceted nature of sentiment analysis but also highlight its potential to serve as a formidable tool in understanding and quantifying nuanced customer feedback across various product domains.

Table 7 Different Models outputs for given Examples. Wrong predictions are indicated by the marker $\times$.

Full size table

Conclusion

This research presents a pioneering framework for ABSA, significantly advancing the field. The model uniquely combines a biaffine attention mechanism with a MLEGCN, adeptly handling the complexities of syntactic and semantic structures in textual data. This approach allows for precise extraction and interpretation of aspects, opinions, and sentiments. The model’s proficiency in addressing all ABSA sub-tasks, including the challenging ASTE, is demonstrated through its integration of extensive linguistic features. The systematic refinement strategy further enhances its ability to align aspects with corresponding opinions, ensuring accurate sentiment analysis. Overall, this work sets a new standard in sentiment analysis, offering potential for various applications like market analysis and automated feedback systems. It paves the way for future research into combining linguistic insights with deep learning for more sophisticated language understanding.

Data availability

The datasets analyzed during the current study are available in the Wu et al and Xu et al repositories, https://github.com/NJUNLP/GTS, https://github.com/xuuuluuu/SemEval-Triplet-data/tree/master/ASTE-Data-V2-EMNLP2020.

References

Ruder, S., Ghaffari, P. & Breslin, J. G. A hierarchical model of reviews for aspect-based sentiment analysis. CoRR (2016). arXiv:1609.02745.
Mohammad, A.-S., Al-Ayyoub, M., Al-Sarhan, H. & Jararweh, Y. Using aspect-based sentiment analysis to evaluate arabic news affect on readers. In 2015 IEEE/ACM 8th International Conference on Utility and Cloud Computing (UCC), 436–441 (IEEE, 2015).
Phan, M. H. & Ogunbona, P. O. Modelling context and syntactical features for aspect-based sentiment analysis. In Jurafsky, D., Chai, J., Schluter, N. & Tetreault, J. (eds.) Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 3211–3220, https://doi.org/10.18653/v1/2020.acl-main.293 (Association for Computational Linguistics, Online, 2020).
Xu, H., Liu, B., Shu, L. & Yu, P. S. BERT post-training for review reading comprehension and aspect-based sentiment analysis. CoRR (2019). arXiv:1904.02232.
Chen, Z. & Qian, T. Enhancing aspect term extraction with soft prototypes. In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP), 2107–2117 (2020).
Toh, Z. & Wang, W. Dlirec: Aspect term extraction and term polarity classification system. In Proceedings of the 8th international workshop on semantic evaluation (SemEval 2014), 235–240 (2014).
Li, X., Bing, L., Li, P., Lam, W. & Yang, Z. Aspect term extraction with history attention and selective transformation. CoRR (2018). arXiv:1805.00760.
Wu, C., Wu, F., Wu, S., Yuan, Z. & Huang, Y. A hybrid unsupervised method for aspect term and opinion target extraction. Knowl.-Based Syst. 148, 66–73. https://doi.org/10.1016/j.knosys.2018.01.019 (2018).
Article Google Scholar
Dai, H. & Song, Y. Neural aspect and opinion term extraction with mined rules as weak supervision. CoRR (2019). arXiv:1907.03750.
Kumar, A. et al. Aspect term extraction for opinion mining using a hierarchical self-attention network. Neurocomputing 465, 195–204 (2021).
Article Google Scholar
Tian, Y., Chen, G. & Song, Y. Enhancing aspect-level sentiment analysis with word dependencies. In Merlo, P., Tiedemann, J. & Tsarfaty, R. (eds.) Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, 3726–3739, https://doi.org/10.18653/v1/2021.eacl-main.326 (Association for Computational Linguistics, Online, 2021).
Schouten, K. & Frasincar, F. Survey on aspect-level sentiment analysis. IEEE Trans. Knowl. Data Eng. 28, 813–830. https://doi.org/10.1109/TKDE.2015.2485209 (2016).
Article Google Scholar
Sun, K., Zhang, R., Mensah, S., Mao, Y. & Liu, X. Aspect-level sentiment analysis via convolution over dependency tree. In Inui, K., Jiang, J., Ng, V. & Wan, X. (eds.) Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), 5679–5688, https://doi.org/10.18653/v1/D19-1569 (Association for Computational Linguistics, Hong Kong, China, 2019).
Zhou, J. et al. Moit: A novel task for mining opinions towards implicit targets. Eng. Appl. Artif. Intell. 126, 106841. https://doi.org/10.1016/j.engappai.2023.106841 (2023).
Article Google Scholar
Marrese-Taylor, E., Velásquez, J. D. & Bravo-Marquez, F. A novel deterministic approach for aspect-based opinion mining in tourism products reviews. Expert Syst. Appl. 41, 7764–7775. https://doi.org/10.1016/j.eswa.2014.05.045 (2014).
Article Google Scholar
Dey, S. Aspect extraction and sentiment classification of mobile apps using app-store reviews. CoRR (2017). arXiv:1712.03430.
Yuan, L., Wang, J., Yu, L.-C. & Zhang, X. Encoding syntactic information into transformers for aspect-based sentiment triplet extraction. IEEE Trans. Affect. Comput.https://doi.org/10.1109/TAFFC.2023.3291730 (2023).
Article Google Scholar
Liu, S., Li, K. & Li, Z. A robustly optimized BMRC for aspect sentiment triplet extraction. In Carpuat, M., de Marneffe, M.-C. & Meza Ruiz, I. V. (eds.) Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 272–278, https://doi.org/10.18653/v1/2022.naacl-main.20 (Association for Computational Linguistics, Seattle, United States, 2022).
Chen, H., Zhai, Z., Feng, F., Li, R. & Wang, X. Enhanced multi-channel graph convolutional network for aspect sentiment triplet extraction. In Muresan, S., Nakov, P. & Villavicencio, A. (eds.) Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2974–2985, https://doi.org/10.18653/v1/2022.acl-long.212 (Association for Computational Linguistics, Dublin, Ireland, 2022).
Aziz, K. et al. Urduaspectnet: Fusing transformers and dual gcn for urdu aspect-based sentiment detection. ACM Trans. Asian Low-Resour. Lang. Inf. Process.https://doi.org/10.1145/3663367 (2024). Just Accepted.
Fei, H. et al. On the robustness of aspect-based sentiment analysis: Rethinking model, data, and training. ACM Trans. Inf. Syst.https://doi.org/10.1145/3564281 (2022).
Article Google Scholar
Shi, L., Han, D., Han, J., Qiao, B. & Wu, G. Dependency graph enhanced interactive attention network for aspect sentiment triplet extraction. Neurocomputing 507, 315–324. https://doi.org/10.1016/j.neucom.2022.07.067 (2022).
Article Google Scholar
Liu, J. et al. Unified instance and knowledge alignment pretraining for aspect-based sentiment analysis. IEEE/ACM Transactions on Audio, Speech, and Language Processing 31, 2629–2642. https://doi.org/10.1109/TASLP.2023.3290431 (2023).
Article Google Scholar
Yang, H., Zhang, C. & Li, K. Pyabsa: A modularized framework for reproducible aspect-based sentiment analysis. In Proceedings of the 32nd ACM International Conference on Information and Knowledge Management, CIKM ’23, 5117-5122, https://doi.org/10.1145/3583780.3614752 (Association for Computing Machinery, New York, NY, USA, 2023).
Chen, C., Teng, Z., Wang, Z. & Zhang, Y. Discrete opinion tree induction for aspect-based sentiment analysis. In Muresan, S., Nakov, P. & Villavicencio, A. (eds.) Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2051–2064, https://doi.org/10.18653/v1/2022.acl-long.145 (Association for Computational Linguistics, Dublin, Ireland, 2022).
Mao, R. & Li, X. Bridging towers of multi-task learning with a gating mechanism for aspect-based sentiment analysis and sequential metaphor identification. In Proceedings of the AAAI conference on artificial intelligence 35, 13534–13542 (2021).
Article Google Scholar
Nazir, A., Rao, Y., Wu, L. & Sun, L. Issues and challenges of aspect-based sentiment analysis: A comprehensive survey. IEEE Trans. Affect. Comput. 13, 845–863. https://doi.org/10.1109/TAFFC.2020.2970399 (2022).
Article Google Scholar
Liu, H., Chatterjee, I., Zhou, M., Lu, X. S. & Abusorrah, A. Aspect-based sentiment analysis: A survey of deep learning methods. IEEE Trans. Comput. Soc. Syst. 7, 1358–1375. https://doi.org/10.1109/TCSS.2020.3033302 (2020).
Article Google Scholar
Hoang, M., Bihorac, O. A. & Rouces, J. Aspect-based sentiment analysis using BERT. In Hartmann, M. & Plank, B. (eds.) Proceedings of the 22nd Nordic Conference on Computational Linguistics, 187–196 (Linköping University Electronic Press, Turku, Finland, 2019).
Pandey, S. V. & Deorankar, A. V. A study of sentiment analysis task and it’s challenges. In 2019 IEEE International Conference on Electrical, Computer and Communication Technologies (ICECCT), 1–5, https://doi.org/10.1109/ICECCT.2019.8869160 (2019).
T, B. S. et al. Asvm: Adaboost with svm-based classifier implementation for aspect-based opinion mining to appraise products. In 2022 International Conference on Inventive Computation Technologies (ICICT), 1034–1041, https://doi.org/10.1109/ICICT54344.2022.9850655 (2022).
Liu, P., Zhang, L. & Gulla, J. A. Multilingual review-aware deep recommender system via aspect-based sentiment analysis. ACM Trans. Inf. Syst.https://doi.org/10.1145/3432049 (2021).
Article Google Scholar
Jafarian, H., Taghavi, A. H., Javaheri, A. & Rawassizadeh, R. Exploiting bert to improve aspect-based sentiment analysis performance on persian language. In 2021 7th International Conference on Web Research (ICWR), 5–8, https://doi.org/10.1109/ICWR51868.2021.9443131 (2021).
Zhang, W., He, R., Peng, H., Bing, L. & Lam, W. Cross-lingual aspect-based sentiment analysis with aspect term code-switching. In Moens, M.-F., Huang, X., Specia, L. & Yih, S. W.-t. (eds.) Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 9220–9230, https://doi.org/10.18653/v1/2021.emnlp-main.727 (Association for Computational Linguistics, Online and Punta Cana, Dominican Republic, 2021).
Xiao, L. et al. Atlantis: Aesthetic-oriented multiple granularities fusion network for joint multimodal aspect-based sentiment analysis. Information Fusion 102304 (2024).
Ali, W., Yang, Y., Qiu, X., Ke, Y. & Wang, Y. Aspect-level sentiment analysis based on bidirectional-gru in siot. IEEE Access 9, 69938–69950. https://doi.org/10.1109/ACCESS.2021.3078114 (2021).
Article Google Scholar
Zhang, X., Yu, L. & Tian, S. Bgat: Aspect-based sentiment analysis based on bidirectional GRU and graph attention network. J. Intell. Fuzzy Syst. 44, 3115–3126 (2023).
Article Google Scholar
Nagelli, A. & Saleena, B. Optimal trained bi-long short term memory for aspect based sentiment analysis with weighted aspect extraction. J. Web Eng. 21, 2115–2148 (2022).
Google Scholar
Üveges, I. & Ring, O. Hunembert: A fine-tuned bert-model for classifying sentiment and emotion in political communication. IEEE Access 11, 60267–60278. https://doi.org/10.1109/ACCESS.2023.3285536 (2023).
Article Google Scholar
Du, K., Xing, F. & Cambria, E. Incorporating multiple knowledge sources for targeted aspect-based financial sentiment analysis. ACM Trans. Manage. Inf. Syst.https://doi.org/10.1145/3580480 (2023).
Article Google Scholar
Lengkeek, M., van der Knaap, F. & Frasincar, F. Leveraging hierarchical language models for aspect-based sentiment analysis on financial data. Inf. Process. Manag. 60, 103435. https://doi.org/10.1016/j.ipm.2023.103435 (2023).
Article Google Scholar
Tian, Y., Chen, G. & Song, Y. Aspect-based sentiment analysis with type-aware graph convolutional networks and layer ensemble. In Toutanova, K. et al. (eds.) Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2910–2922, https://doi.org/10.18653/v1/2021.naacl-main.231 (Association for Computational Linguistics, Online, 2021).
Li, R. et al. Dual graph convolutional networks for aspect-based sentiment analysis. In Zong, C., Xia, F., Li, W. & Navigli, R. (eds.) Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers), 6319–6329, https://doi.org/10.18653/v1/2021.acl-long.494 (Association for Computational Linguistics, Online, 2021).
Veyseh, A. P. B. et al. Improving aspect-based sentiment analysis with gated graph convolutional networks and syntax-based regulation. CoRR (2020). arXiv:2010.13389.
Khan, J., Ahmad, N., Khalid, S., Ali, F. & Lee, Y. Sentiment and context-aware hybrid DNN with attention for text sentiment classification. IEEE Access 11, 28162–28179. https://doi.org/10.1109/ACCESS.2023.3259107 (2023).
Article Google Scholar
Zhang, Q., Wang, S. & Li, J. A contrastive learning framework with tree-lstms for aspect-based sentiment analysis. Neural Processing Letters 1–18 (2023).
Xue, W. & Li, T. Aspect based sentiment analysis with gated convolutional networks. CoRR (2018). arXiv:1805.07043.
Ma, Y., Peng, H. & Cambria, E. Targeted aspect-based sentiment analysis via embedding commonsense knowledge into an attentive lstm. In Proceedings of the AAAI conference on artificial intelligence, vol. 32 (2018).
Yu, G. et al. Making flexible use of subtasks: A multiplex interaction network for unified aspect-based sentiment analysis. In Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021, 2695–2705 (2021).
Dai, J., Yan, H., Sun, T., Liu, P. & Qiu, X. Does syntax matter? A strong baseline for aspect-based sentiment analysis with roberta. CoRR (2021). arXiv:2104.04986.
Chen, W., Du, J., Zhang, Z., Zhuang, F. & He, Z. A hierarchical interactive network for joint span-based aspect-sentiment analysis (2022). arXiv:2208.11283.
Zhao, H., Huang, L., Zhang, R., Lu, Q. & Xue, H. SpanMlt: A span-based multi-task learning framework for pair-wise aspect and opinion terms extraction. In Jurafsky, D., Chai, J., Schluter, N. & Tetreault, J. (eds.) Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 3239–3248, https://doi.org/10.18653/v1/2020.acl-main.296 (Association for Computational Linguistics, Online, 2020).
Mao, R., Liu, Q., He, K., Li, W. & Cambria, E. The biases of pre-trained language models: An empirical study on prompt-based sentiment analysis and emotion detection. IEEE transactions on affective computing (2022).
Chen, P., Sun, Z., Bing, L. & Yang, W. Recurrent attention network on memory for aspect sentiment analysis. In Palmer, M., Hwa, R. & Riedel, S. (eds.) Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 452–461, https://doi.org/10.18653/v1/D17-1047 (Association for Computational Linguistics, Copenhagen, Denmark, 2017).
Tang, D., Qin, B. & Liu, T. Aspect level sentiment classification with deep memory network. arXiv preprintarXiv:1605.08900 (2016).
Medhat, W., Hassan, A. & Korashy, H. Sentiment analysis algorithms and applications: A survey. Ain Shams Eng J 5, 1093–1113 (2014).
Article Google Scholar
Brody, S. & Elhadad, N. An unsupervised aspect-sentiment model for online reviews. In Human language technologies: The 2010 annual conference of the North American chapter of the association for computational linguistics, 804–812 (2010).
Feldman, R. Techniques and applications for sentiment analysis. Commun. ACM 56, 82–89 (2013).
Article Google Scholar
Zhang, L., Wang, S. & Liu, B. Deep learning for sentiment analysis: A survey. Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery 8, e1253 (2018).
Google Scholar
Liu, Q., Zhang, H., Zeng, Y., Huang, Z. & Wu, Z. Content attention model for aspect based sentiment analysis. In Proceedings of the 2018 World Wide Web Conference, WWW ’18, 1023-1032, https://doi.org/10.1145/3178876.3186001 (International World Wide Web Conferences Steering Committee, Republic and Canton of Geneva, CHE, 2018).
Lin, Z. et al. A structured self-attentive sentence embedding. CoRR (2017). arXiv:1703.03130.
Chen, Z., Xue, Y., Xiao, L., Chen, J. & Zhang, H. Aspect-based sentiment analysis using graph convolutional networks and co-attention mechanism. In Neural Information Processing: 28th International Conference, ICONIP 2021, Sanur, Bali, Indonesia, December 8–12, 2021, Proceedings, Part VI 28, 441–448 (Springer, 2021).
Wang, K., Shen, W., Yang, Y., Quan, X. & Wang, R. Relational graph attention network for aspect-based sentiment analysis. CoRR (2020). arXiv:2004.12362.
Galassi, A., Lippi, M. & Torroni, P. Attention in natural language processing. IEEE Trans Neural Netw Learn Syst 32, 4291–4308 (2020).
Article Google Scholar
Huang, Z., Zhao, H., Peng, F., Chen, Q. & Zhao, G. Aspect category sentiment analysis with self-attention fusion networks. In Database Systems for Advanced Applications: 25th International Conference, DASFAA 2020, Jeju, South Korea, September 24–27, 2020, Proceedings, Part III 25, 154–168 (Springer, 2020).
Zhang, C., Li, Q. & Song, D. Aspect-based sentiment classification with aspect-specific graph convolutional networks. CoRR (2019). arXiv:1909.03477.
Chaudhari, S., Mithal, V., Polatkan, G. & Ramanath, R. An attentive survey of attention models. ACM Trans Intell Syst Technol (TIST) 12, 1–32 (2021).
Article Google Scholar
Zhang, M. & Qian, T. Convolution over hierarchical syntactic and lexical graphs for aspect level sentiment analysis. In Webber, B., Cohn, T., He, Y. & Liu, Y. (eds.) Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP), 3540–3549, https://doi.org/10.18653/v1/2020.emnlp-main.286 (Association for Computational Linguistics, Online, 2020).
Zheng, Y., Zhang, R., Mensah, S. & Mao, Y. Replicate, walk, and stop on syntax: an effective neural network model for aspect-level sentiment classification. In Proceedings of the AAAI conference on artificial intelligence 34, 9685–9692 (2020).
Article Google Scholar
Zhang, C., Li, Q. & Song, D. Syntax-aware aspect-level sentiment classification with proximity-weighted convolution network. In Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR’19, 1145-1148, https://doi.org/10.1145/3331184.3331351 (Association for Computing Machinery, New York, NY, USA, 2019).
Huang, L., Sun, X., Li, S., Zhang, L. & Wang, H. Syntax-aware graph attention network for aspect-level sentiment classification. In Scott, D., Bel, N. & Zong, C. (eds.) Proceedings of the 28th International Conference on Computational Linguistics, 799–810, https://doi.org/10.18653/v1/2020.coling-main.69 (International Committee on Computational Linguistics, Barcelona, Spain (Online), 2020).
Xu, L., Pang, X., Wu, J., Cai, M. & Peng, J. Learn from structural scope: Improving aspect-level sentiment analysis with hybrid graph convolutional networks. Neurocomputing 518, 373–383. https://doi.org/10.1016/j.neucom.2022.10.071 (2023).
Article Google Scholar
Xiao, L. et al. Exploring fine-grained syntactic information for aspect-based sentiment classification with dual graph neural networks. Neurocomputing 471, 48–59 (2022).
Article Google Scholar
Zhang, R., Chen, Q., Zheng, Y., Mensah, S. & Mao, Y. Aspect-level sentiment analysis via a syntax-based neural network. IEEE/ACM Transactions on Audio, Speech, and Language Processing 30, 2568–2583. https://doi.org/10.1109/TASLP.2022.3190731 (2022).
Article Google Scholar
Xiao, L. et al. Cross-modal fine-grained alignment and fusion network for multimodal aspect-based sentiment analysis. Information Processing & Management 60, 103508. https://doi.org/10.1016/j.ipm.2023.103508 (2023).
Article Google Scholar
Liang, S., Wei, W., Mao, X.-L., Wang, F. & He, Z. BiSyn-GAT: Bi-syntax aware graph attention network for aspect-based sentiment analysis. In Findings of the Association for Computational Linguistics: ACL 2022, https://doi.org/10.18653/v1/2022.findings-acl.144 (Association for Computational Linguistics, 2022).
Huang, B. et al. Crf-gcn: An effective syntactic dependency model for aspect-level sentiment analysis. Knowl.-Based Syst. 260, 110125 (2023).
Article Google Scholar
Bao, X., Wang, Z., Jiang, X., Xiao, R. & Li, S. Aspect-based sentiment analysis with opinion tree generation. In IJCAI 2022, 4044–4050 (2022).
Google Scholar
Wu, Z. et al. Grid tagging scheme for aspect-oriented fine-grained opinion extraction. In Cohn, T., He, Y. & Liu, Y. (eds.) Findings of the Association for Computational Linguistics: EMNLP 2020, 2576–2585, https://doi.org/10.18653/v1/2020.findings-emnlp.234 (Association for Computational Linguistics, Online, 2020).
Miwa, M. & Sasaki, Y. Modeling joint entity and relation extraction with table representation. In Moschitti, A., Pang, B. & Daelemans, W. (eds.) Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), 1858–1869, https://doi.org/10.3115/v1/D14-1200 (Association for Computational Linguistics, Doha, Qatar, 2014).
Gupta, P., Schütze, H. & Andrassy, B. Table filling multi-task recurrent neural network for joint entity and relation extraction. In Matsumoto, Y. & Prasad, R. (eds.) Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: Technical Papers, 2537–2547 (The COLING 2016 Organizing Committee, Osaka, Japan, 2016).
Devlin, J., Chang, M.-W., Lee, K. & Toutanova, K. BERT: Pre-training of deep bidirectional transformers for language understanding. In Burstein, J., Doran, C. & Solorio, T. (eds.) Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers), 4171–4186, https://doi.org/10.18653/v1/N19-1423 (Association for Computational Linguistics, Minneapolis, Minnesota, 2019).
Dozat, T. & Manning, C. D. Deep biaffine attention for neural dependency parsing. arXiv preprintarXiv:1611.01734 (2016).
Kipf, T. N. & Welling, M. Semi-supervised classification with graph convolutional networks. CoRR (2016). arXiv:1609.02907.
Sun, K., Zhang, R., Mensah, S., Mao, Y. & Liu, X. Aspect-level sentiment analysis via convolution over dependency tree. In Inui, K., Jiang, J., Ng, V. & Wan, X. (eds.) Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), 5679–5688, https://doi.org/10.18653/v1/D19-1569 (Association for Computational Linguistics, Hong Kong, China, 2019).
Guo, Z., Zhang, Y. & Lu, W. Attention guided graph convolutional networks for relation extraction. In Korhonen, A., Traum, D. & Màrquez, L. (eds.) Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, 241–251, https://doi.org/10.18653/v1/P19-1024 (Association for Computational Linguistics, Florence, Italy, 2019).
Read, J., Pfahringer, B., Holmes, G. & Frank, E. Classifier chains for multi-label classification. Mach. Learn. 85, 333–359. https://doi.org/10.1007/s10994-011-5256-5 (2011).
Article MathSciNet Google Scholar
Pontiki, M., Galanis, D., Papageorgiou, H., Manandhar, S. & Androutsopoulos, I. SemEval-2015 task 12: Aspect based sentiment analysis. In Nakov, P., Zesch, T., Cer, D. & Jurgens, D. (eds.) Proceedings of the 9th International Workshop on Semantic Evaluation (SemEval 2015), 486–495, https://doi.org/10.18653/v1/S15-2082 (Association for Computational Linguistics, Denver, Colorado, 2015).
Pontiki, M. et al. Semeval-2016 task 5: Aspect based sentiment analysis. In ProWorkshop on Semantic Evaluation (SemEval-2016), 19–30 (Association for Computational Linguistics, 2016).
Peng, H. et al. Knowing what, how and why: A near complete solution for aspect-based sentiment analysis. Proceedings of the AAAI Conference on Artificial Intelligence 34, 8600–8607. https://doi.org/10.1609/aaai.v34i05.6383 (2020).
Article Google Scholar
Xu, L., Li, H., Lu, W. & Bing, L. Position-aware tagging for aspect sentiment triplet extraction. In Webber, B., Cohn, T., He, Y. & Liu, Y. (eds.) Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP), 2339–2349, https://doi.org/10.18653/v1/2020.emnlp-main.183 (Association for Computational Linguistics, Online, 2020).
Loshchilov, I. & Hutter, F. Fixing weight decay regularization in adam (2018).
Qi, P., Zhang, Y., Zhang, Y., Bolton, J. & Manning, C. D. Stanza: A python natural language processing toolkit for many human languages. In Celikyilmaz, A. & Wen, T.-H. (eds.) Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics: System Demonstrations, 101–108, https://doi.org/10.18653/v1/2020.acl-demos.14 (Association for Computational Linguistics, Online, 2020).
Zhang, C., Li, Q., Song, D. & Wang, B. A multi-task learning framework for opinion triplet extraction. In Cohn, T., He, Y. & Liu, Y. (eds.) Findings of the Association for Computational Linguistics: EMNLP 2020, 819–828, https://doi.org/10.18653/v1/2020.findings-emnlp.72 (Association for Computational Linguistics, Online, 2020).
Li, X., Bing, L., Li, P. & Lam, W. A unified model for opinion target extraction and target sentiment prediction. 33, 6714–6721. https://doi.org/10.1609/aaai.v33i01.33016714 (2019).
Dai, H. & Song, Y. Neural aspect and opinion term extraction with mined rules as weak supervision. In Korhonen, A., Traum, D. & Màrquez, L. (eds.) Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, 5268–5277, https://doi.org/10.18653/v1/P19-1520 (Association for Computational Linguistics, Florence, Italy, 2019).
Peng, H. et al. Knowing what, how and why: A near complete solution for aspect-based sentiment analysis. Proceedings of the AAAI Conference on Artificial Intelligence 34, 8600–8607. https://doi.org/10.1609/aaai.v34i05.6383 (2020).
Article Google Scholar
Wang, W., Pan, S. J., Dahlmeier, D. & Xiao, X. Coupled multi-layer attentions for co-extraction of aspect and opinion terms. 31, https://doi.org/10.1609/aaai.v31i1.10974 (2017).
Xu, L., Chia, Y. K. & Bing, L. Learning span-level interactions for aspect sentiment triplet extraction. In Zong, C., Xia, F., Li, W. & Navigli, R. (eds.) Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers), 4755–4766, https://doi.org/10.18653/v1/2021.acl-long.367 (Association for Computational Linguistics, Online, 2021).
He, R., Lee, W. S., Ng, H. T. & Dahlmeier, D. An interactive multi-task learning network for end-to-end aspect-based sentiment analysis. In Korhonen, A., Traum, D. & Màrquez, L. (eds.) Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, 504–515, https://doi.org/10.18653/v1/P19-1048 (Association for Computational Linguistics, Florence, Italy, 2019).
Xu, L., Li, H., Lu, W. & Bing, L. Position-aware tagging for aspect sentiment triplet extraction. In Webber, B., Cohn, T., He, Y. & Liu, Y. (eds.) Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP), 2339–2349, https://doi.org/10.18653/v1/2020.emnlp-main.183 (Association for Computational Linguistics, Online, 2020).
Mao, Y., Shen, Y., Yu, C. & Cai, L. A joint training dual-mrc framework for aspect based sentiment analysis. Proceedings of the AAAI Conference on Artificial Intelligence 35, 13543–13551. https://doi.org/10.1609/aaai.v35i15.17597 (2021).
Article Google Scholar
Chen, S., Wang, Y., Liu, J. & Wang, Y. Bidirectional machine reading comprehension for aspect sentiment triplet extraction. Proceedings of the AAAI Conference on Artificial Intelligence 35, 12666–12674. https://doi.org/10.1609/aaai.v35i14.17500 (2021).
Article Google Scholar
Yan, H., Dai, J., Ji, T., Qiu, X. & Zhang, Z. A unified generative framework for aspect-based sentiment analysis. In Zong, C., Xia, F., Li, W. & Navigli, R. (eds.) Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers), 2416–2429, https://doi.org/10.18653/v1/2021.acl-long.188 (Association for Computational Linguistics, Online, 2021).
Shi, J., Li, W., Bai, Q., Yang, Y. & Jiang, J. Syntax-enhanced aspect-based sentiment analysis with multi-layer attention. Neurocomputing 557, 126730. https://doi.org/10.1016/j.neucom.2023.126730 (2023).
Article Google Scholar

Download references

Acknowledgements

The authors are grateful for the support provided by the National Natural Science Foundation of China (NSFC), which funded this research under Project No. 62176187.

Author information

Authors and Affiliations

Key Laboratory of Aerospace Information Security and Trusted Computing Ministry of Education, School of Cyber Science and Engineering, Wuhan University, Wuhan, China
Kamran Aziz & Donghong Ji
Department of Computer Science and Engineering, Sir Padampat Singhania University, Udaipur, 313601, Rajasthan, India
Prasun Chakrabarti
Department of Chemistry, Sir Padampat Singhania University, Udaipur, 313601, Rajasthan, India
Tulika Chakrabarti
Department of Computer science and Information Technology, Women university, Bagh, Azad Jammu and Kashmir, Pakistan
Muhammad Shahid Iqbal
School of Computer Science and Artifical Intelligent, Wenzhou University, Wenzhou, 325035, China
Rashid Abbasi

Authors

Kamran Aziz
View author publications
Search author on:PubMed Google Scholar
Donghong Ji
View author publications
Search author on:PubMed Google Scholar
Prasun Chakrabarti
View author publications
Search author on:PubMed Google Scholar
Tulika Chakrabarti
View author publications
Search author on:PubMed Google Scholar
Muhammad Shahid Iqbal
View author publications
Search author on:PubMed Google Scholar
Rashid Abbasi
View author publications
Search author on:PubMed Google Scholar

Contributions

K.A. conceived the study, conducted the majority of the experiments, and wrote the main manuscript text. D.J. provided critical feedback and helped shape the research, analysis, and manuscript. P.C. contributed to the design and implementation of the research, and T.C. assisted in data analysis and interpretation. M.S.I. and R.A. contributed to the preparation of figures and data visualization. All authors reviewed and approved the manuscript.

Corresponding author

Correspondence to Donghong Ji.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Aziz, K., Ji, D., Chakrabarti, P. et al. Unifying aspect-based sentiment analysis BERT and multi-layered graph convolutional networks for comprehensive sentiment dissection. Sci Rep 14, 14646 (2024). https://doi.org/10.1038/s41598-024-61886-7

Download citation

Received: 28 January 2024
Accepted: 10 May 2024
Published: 25 June 2024
Version of record: 25 June 2024
DOI: https://doi.org/10.1038/s41598-024-61886-7

This article is cited by

A deep sentiment model combining ALBERT-driven context and EHO-optimized architecture
- Hadi Oqaibi
- Saurabh Sharma
Scientific Reports (2026)
Aspect-based sentiment analysis (ABSA) for academic linguistics articles
- Haiyun Wang
Discover Artificial Intelligence (2026)
Diabetic retinopathy detection using adaptive deep convolutional neural networks on fundus images
- Rashid Abbasi
- Farhan Amin
- Huiling Chen
Scientific Reports (2025)
Advancing Urdu named entity recognition: deep learning for aspect targeting
- Kamran Aziz
- Naveed Ahmed
- Donghong Ji
Complex & Intelligent Systems (2025)
Fine-Tuning Llama 3 for Sentiment Analysis: Leveraging AWS Cloud for Enhanced Performance
- Shantanu Kumar
- Shruti Singh
SN Computer Science (2024)

Subjects

Abstract

Similar content being viewed by others

Introduction

Related work

Aspect based sentiment analysis and its subtasks

Innovative approaches to sentiment analysis leveraging attention mechanisms

Syntax-driven approaches to aspect-level sentiment analysis

Proposed framework

Relation definition and table filling

Model layers and formation

Input layer

Attention module

Multi-layered enhanced graph convolutional network (MLEGCN)

Enhanced understanding of syntactic features

Correlation constraints

Systematic refinement and prediction module

Loss function

Experiments

Datasets

Implementation details

Baselines

Performance evaluation and comparative analysis

Model analysis

Ablation study

Syntactic features qualitative analysis

Case study

Conclusion

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Search

Quick links