Table 1 Overview of the dataset composition.
Class | ||||||
---|---|---|---|---|---|---|
Source | Cancer | High-grade dysplasia | Low-grade dysplasia | Hyperplastic polyp | Normal | Total images |
Training dataset: automatic weak labels (SKET) | ||||||
Catania | 422 | 464 | 630 | 251 | 462 | 1704 |
Radboudumc | 189 | 119 | 434 | 493 | 1000 | 2065 |
Total | 611 | 583 | 1064 | 744 | 1462 | 3769 |
Training dataset: manual weak labels (ground truth) | ||||||
Catania | 379 | 454 | 529 | 181 | 438 | 1704 |
Radboudumc | 188 | 94 | 453 | 428 | 1048 | 2065 |
Total | 567 | 548 | 982 | 609 | 1486 | 3769 |
Private testing datasets | ||||||
Catania | 52 | 44 | 54 | 23 | 79 | 227 |
Radboudumc | 50 | 23 | 92 | 62 | 219 | 423 |
Total | 102 | 67 | 146 | 85 | 298 | 650 |
Public testing datasets | ||||||
GlaS36 | 91 | 0 | 0 | 42 | 133 | |
CRC37 | 69 | 0 | 0 | 71 | 140 | |
0 | 1370 | 5804 | 545 | 950 | 8669 | |
0 | 46 | 184 | 41 | 21 | 292 | |
TCGA-COAD33 | 50 | 0 | 0 | 0 | 0 | 50 |
Xu38 | 355 | 0 | 0 | 0 | 362 | 717 |
AIDA34 | 31 | 4 | 1 | 65 | 101 | |
IMP-CRC35 | 268 | 547 | 271 | 1086 | ||
Total | 11888 |