Table 3 Results of different vision transformers as a classifiers against the proposed model. Here, Diet and PVT: stand for Data efficient Image Transformer and, Pyramid Vision Transformer respectively.
Model | ACC(%) | Pre(%) | Rec(%) | F1-score(%) | MCC |
|---|---|---|---|---|---|
DeiT | 88.64 | 86.15 | 84.56 | 83.94 | 0.79 |
PVT | 92.34 | 90.92 | 89.06 | 88.57 | 0.84 |
Swin Transformer | 93.71 | 90.20 | 91.00 | 93.14 | 0.81 |
The proposed method | 97.80 | 98.71 | 96.33 | 96.81 | 0.95 |