Table 1 Summary of quantitative evaluation of liver tumor segmentation. Mean and standard deviation of the different scores for segmentation evaluation are given for algorithms \(A_1\) and \(A_2\). For comparison, the mean Dice score among raters is \(\overline{D(R_i,R_j)} = 0.781 \pm 0.121\). The p-value indicates statistical significance of differences between both algorithms, also after correction for multiple testing.
\(D(\cdot ,R_1)\) | \(D(\cdot ,R_2)\) | \(D(\cdot ,R_3)\) | \(\overline{D(\cdot ,R_i)}\) | \(\phi (\cdot ,R_1,R_2,R_3)\) | |
|---|---|---|---|---|---|
\(A_1\) | 0.732 ± 0.210 | 0.744 ± 0.200 | 0.738 ± 0.189 | 0.738 ± 0.194 | 0.873 ± 0.082 |
\(A_2\) | 0.689 ± 0.214 | 0.714 ± 0.195 | 0.697 ± 0.190 | 0.700 ± 0.196 | 0.852 ± 0.082 |
p-value | 0.0017 | 0.0269 | 0.0029 | 0.0025 | 0.0022 |