Extended Data Fig. 10: Cohen’s kappa scores across all ensembles and their individual model components.
From: Benchmarking foundation models as feature extractors for weakly supervised computational pathology

Objective measure of similarity of prediction scores using Cohen’s Kappa and majority vote across the five folds to binarize the predictions. The concatenated versions of CONCH, Virchow2 (V2), Prov-GigaPath (GP), H-optimus-0 (HO0), UNI and DinoSSLPath (Dino) and their single model counterparts are shown.