Table 4 Sørensen-Dice coefficient and Fleiss’ Kappa on a dataset level (mean ± std) for annotator A, B and C. The metrics were calculated over the 3D tumor volume.

From: 3D whole body preclinical micro-CT database of subcutaneous tumors in mice with annotations from 3 annotators

Dataset

A vs. B

A vs. C

B vs. C

Fleiss’ Kappa

Dataset 1

0.908 ± 0.035

0.911 ± 0.031

0.915 ± 0.028

0.911 ± 0.024

Dataset 2

0.874 ± 0.055

0.891 ± 0.038

0.881 ± 0.048

0.882 ± 0.040

Dataset 3

0.870 ± 0.056

0.884 ± 0.053

0.889 ± 0.038

0.881 ± 0.042

Dataset 4

0.887 ± 0.049

0.895 ± 0.050

0.900 ± 0.043

0.894 ± 0.042

Dataset 5

0.931 ± 0.053

0.933 ± 0.042

0.928 ± 0.044

0.931 ± 0.041

Dataset 6

0.841 ± 0.047

0.853 ± 0.048

0.872 ± 0.048

0.855 ± 0.041

Dataset 7

0.938 ± 0.029

0.938 ± 0.024

0.941 ± 0.031

0.939 ± 0.026

Dataset 8

0.934 ± 0.031

0.932 ± 0.031

0.921 ± 0.036

0.929 ± 0.029

Dataset 9

0.956 ± 0.019

0.958 ± 0.010

0.946 ± 0.023

0.953 ± 0.015

Dataset 10

0.868 ± 0.046

0.883 ± 0.042

0.858 ± 0.049

0.869 ± 0.040