Figure 2

Example cases of three different Likert score (5, 4, and 3) is shown for two different cohorts. Blue arrow highlights the uncertainties in boundaries between the manual and model predicted contours. For score 4 and 3 in the images of \({{\text{C}}}_{{\text{PVE}}}\), the arrow highlights the hole in segment 5–8 due to metal artifacts. In \({{\text{C}}}_{{\text{PVE}}}\), a score of 4 is given when image has a hole, but segments boundaries follow the vessels.