Table 1 Description of evaluation datasets

Evaluation Datasets	In training data?		# Samples	# Slides	# Cells
Evaluation Datasets	Study	Disease	# Samples	# Slides	# Cells
Stanford-PGC	Yes	Yes	149	1	817,765
Ochsner-CRC	No	No	94	1	635,649
Tuebingen-GEJ	No	No	240	1	365,734
UChicago-DLBCL	Yes	Yes	2	1	3,099,419
Total	-	-	485	8	4,918,567

We describe the four evaluation datasets used in this study. Samples from Stanford-PGC and UChicago-DLBCL are divided into training and test splits; on the other hand, no samples and disease types from Ochsner-CRC and Tuebingen-GEJ are used in the model’s training data. UChicago-DLBCL contains two full tissue samples, whereas the rest of the datasets consist of TMA core samples.

Quick links

Search