Fig. 4: Qualitative lesion segmentation results obtained in a Turing-like test.
From: DeepISLES: a clinically validated ischemic stroke segmentation model from the ISLES'22 challenge

Neuroradiologists prefer lesions delineated by DeepISLES over manual expert delineations (sample size N = 150). Score values range between 1 and 6 (worst and best quality scenarios, respectively). Boxes show the interquartile range (IQR; 25th-75th percentiles), the center line marks the median, whiskers span values within 1.5 × IQR, and points beyond are displayed as outliers.