Table 5 Pairwise model comparison using McNemar’s test for binary classification.
From: Trade-offs between machine learning and deep learning for mental illness detection on social media
Model 1 | Model 2 | Statistic | p-value | Winner | Corrected p-value | Significant |
|---|---|---|---|---|---|---|
SVM | lightGBM | 6.75 | 0.009 | SVM | 0.037 | Yes |
SVM | RandomForest | 9.73 | 0.001 | SVM | 0.009 | Yes |
SVM | LogisticRegression | 13.27 | <0.001 | SVM | <0.001 | Yes |
SVM | ALBERT | 107.57 | <0.001 | ALBERT | <0.001 | Yes |
SVM | GRU | 21.04 | <0.001 | GRU | <0.001 | Yes |
lightGBM | RandomForest | 0.44 | 0.508 | lightGBM | 1.000 | No |
lightGBM | LogisticRegression | 0.00 | 0.964 | Tie | 1.000 | No |
lightGBM | ALBERT | 159.51 | <0.001 | ALBERT | <0.001 | Yes |
lightGBM | GRU | 49.60 | <0.001 | GRU | <0.001 | Yes |
RandomForest | LogisticRegression | 0.26 | 0.672 | LogisticRegression | 1.000 | No |
RandomForest | ALBERT | 158.77 | <0.001 | ALBERT | <0.001 | Yes |
RandomForest | GRU | 54.86 | <0.001 | GRU | <0.001 | Yes |
LogisticRegression | ALBERT | 184.25 | <0.001 | ALBERT | <0.001 | Yes |
LogisticRegression | GRU | 43.95 | <0.001 | GRU | <0.001 | Yes |
ALBERT | GRU | 42.05 | <0.001 | ALBERT | <0.001 | Yes |