Fig. 5

Effect of lesion size and annotation type on performance for the best performing model (bpMRI). (A) Performance distribution stratified by dataset and lesion size (below or above median). (B) Distribution density for lesion sizes across both datasets. Circles represent the median value while black horizontal lines represent the range between the 1st and 3rd quartiles. (C) Performance distribution stratified by dataset and annotation type (whether the lesion was annotated by a radiologist or by an AI model). (D) Comparison of lesion size with Dice. Each point corresponds to a case, different shapes correspond to different annotation types. Across all plots, golden and blue correspond to PI-CAI and ProstateNet, respectively. p-values in (A,C) correspond to a two-sided Wilcoxon test.