Table 4 Sørensen-Dice coefficient and Fleiss’ Kappa on a dataset level (mean ± std) for annotator A, B and C. The metrics were calculated over the 3D tumor volume.
Dataset | A vs. B | A vs. C | B vs. C | Fleiss’ Kappa |
|---|---|---|---|---|
Dataset 1 | 0.908 ± 0.035 | 0.911 ± 0.031 | 0.915 ± 0.028 | 0.911 ± 0.024 |
Dataset 2 | 0.874 ± 0.055 | 0.891 ± 0.038 | 0.881 ± 0.048 | 0.882 ± 0.040 |
Dataset 3 | 0.870 ± 0.056 | 0.884 ± 0.053 | 0.889 ± 0.038 | 0.881 ± 0.042 |
Dataset 4 | 0.887 ± 0.049 | 0.895 ± 0.050 | 0.900 ± 0.043 | 0.894 ± 0.042 |
Dataset 5 | 0.931 ± 0.053 | 0.933 ± 0.042 | 0.928 ± 0.044 | 0.931 ± 0.041 |
Dataset 6 | 0.841 ± 0.047 | 0.853 ± 0.048 | 0.872 ± 0.048 | 0.855 ± 0.041 |
Dataset 7 | 0.938 ± 0.029 | 0.938 ± 0.024 | 0.941 ± 0.031 | 0.939 ± 0.026 |
Dataset 8 | 0.934 ± 0.031 | 0.932 ± 0.031 | 0.921 ± 0.036 | 0.929 ± 0.029 |
Dataset 9 | 0.956 ± 0.019 | 0.958 ± 0.010 | 0.946 ± 0.023 | 0.953 ± 0.015 |
Dataset 10 | 0.868 ± 0.046 | 0.883 ± 0.042 | 0.858 ± 0.049 | 0.869 ± 0.040 |