Extended Data Fig. 2: Individual algorithms’ agreement with the reference standard for the validation sets.
From: Artificial intelligence for diagnosis and Gleason grading of prostate cancer: the PANDA challenge

Concordance with ISUP GG of the reference standard (Cohen’s quadratically weighted kappa with 95% CI over cases) is shown for each algorithm on each validation set. The dashed line indicates the mean of all teams on the validation set in question.