Extended Data Fig. 5: Model performance with reduced downstream training dataset.
From: Benchmarking foundation models as feature extractors for weakly supervised computational pathology

Mean AUROC across all five folds on 29 tasks for all foundation models trained with a reduced downstream dataset of 75 (A), 150 (B), or 300 patients (C). Patients were randomly selected from the TCGA cohorts, ensuring the ground truth was defined for all analyzed tasks. The tasks Lauren in Kiel and Bern were excluded due to insufficient patient numbers.