Table 2 Agreement measures for each pair of raters on the testing data set.
Raters | Cohen’s kappa | β (p-value) | α (p-value) |
---|---|---|---|
E1, E2 | 0.88 ± 0.05 | 1.13 (p = 7.99e−10*) | − 2.30 (p = 0.141) |
E1, E3 | 0.87 ± 0.07 | 1.45 (p = 1.11e−06*) | − 8.26 (p = 0.107) |
E2, E3 | 0.89 ± 0.07 | 1.29 (p = 1.15e−07*) | − 5.60 (p = 0.126) |
E1, M | 0.84 ± 0.06 | 1.05 (p = 6.12e−08*) | − 1.24(p = 0.600) |
E2, M | 0.82 ± 0.08 | 0.93 (p = 4.97e−08*) | 1.01 (p = 0.648) |
E3, M | 0.82 ± 0.08 | 0.69 (p = 1.31e−05*) | 6.36 (p = 0.142) |