Figure 4

Tukey’s boxplot comparison of interobserver variability (IV), DLS to expert variability (DV), and previous method4 to expert variability (DVprevious), measured in symmetric mean curve distance (mm). Statistical significance measured with Wilcoxon signed-rank test. (a) Comparison of full test dataset (N = 300). (b) Device-wise comparison between the groups (N = 60 per device).