Table 3 Evaluation (Precision, Recall, and F-1 score) of five popular libraries and a few heuristics based on human perspective. The best and second-best results are highlighted in bold and italics.

From: A bimodal longitudinal investigation on changes in sentiments over social media interactions owing to COVID-19 pandemic

Library/Heuristic

Precision

Recall

F-1 Score

Accuracy (%)

Negative

Neutral

Positive

Negative

Neutral

Positive

Negative

Neutral

Positive

L-1: VADER

0.73

0.57

0.73

0.76

0.79

0.26

0.74

0.66

0.39

66

L-2: Afinn

0.71

0.64

0.50

0.79

0.54

0.48

0.75

0.59

0.49

64

L-3: TwitterSentiment

0.55

0.50

0.25

0.76

0.01

0.41

0.63

0.01

0.31

44

L-4: Transformer

0.58

0.83

0.52

0.96

0.03

0.54

0.72

0.06

0.53

57

L-5: TextBlob

0.90

0.40

0.44

0.43

0.59

0.59

0.58

0.48

0.5

52

H-1: Majority voting on all five (L-1 to L-5)

0.70

0.60

0.56

0.81

0.52

0.51

0.75

0.56

0.53

65

H-2: Majority voting on L-1, L-2, and L-4. In case of a tie, decide neutral based on only L-4; otherwise, decide based on only L-1

0.69

0.68

0.68

0.64

0.59

0.64

0.76

0.56

0.54

64

H-3: Majority voting on L-1, L-2, and L-4. In case of a tie, decide neutral based on only L-4; otherwise, decide based on only L-2

0.70

0.63

0.67

0.83

0.61

0.46

0.76

0.62

0.54

68