Table 2 Confusion matrices assessing the performance of the AI models, including the realistic performance of different models in the test set.
| Â | YOLOv4 | Adaptive light-weight YOLOv4 | SSD | |||
|---|---|---|---|---|---|---|
Predicted class | Predicted class | Predicted class | ||||
Adenoma | Others | Adenoma | Others | Adenoma | Others | |
True class | ||||||
 Adenoma | 145 | 28 | 165 | 39 | 151 | 47 |
 Others | 26 | 133 | 20 | 112 | 32 | 119 |
Accuracy (%) | 92.95% | 95.00% | 92.36% | 77.68% | 88.43% | 78.41% |
FPR (%) | – | 16.35% | – | 15.15% | – | 21.19% |
| Â | Ensemble model | Proposed YOLOv3 | Proposed YOLOv4 | |||
|---|---|---|---|---|---|---|
Predicted class | Predicted class | Predicted class | ||||
Adenoma | Others | Adenoma | Others | Adenoma | Others | |
True class | Â | |||||
 Adenoma | 178 | 12 | 124 | 32 | 130 | 27 |
 Others | 9 | 101 | 26 | 138 | 20 | 102 |
Accuracy (%) | 92.70% | 81.45% | 88.54% | 78.23% | 65.69% | 53.43% |
FPR (%) | – | 8.18% | – | 15.85% | – | 16.39% |
| Â | YOLOv7 | YOLOv8 | ||||
|---|---|---|---|---|---|---|
Predicted class | Predicted class | |||||
Adenoma | Others | Adenoma | Others | |||
True class | ||||||
 Adenoma | 65 | 55 | 67 | 62 | ||
 Others | 62 | 43 | 70 | 53 | ||
Accuracy (%) | 76.47% | 68.25% | 78.82% | 84.12% | ||
FPR (%) | – | 59.05% | – | 56.91% | ||