Fig. 7: Performance of tumor type classifier and validation for diagnostic markers.

a Schematic of random-forest classifier. Seventy five percent patients of pan-SCC cohort were used to train the classifier. b Performance of the classifier across 17 SCCs in validation set. True (established) cancer types are displayed horizontally and predicted cancer types are displayed vertically. The number of tumors of each cancer type in the cohort is shown at the top, and sensitivity and specificity of the predictions are indicated at the top and right. c The heatmap of 19 proteins used in the classifier and annotation of cellular component, TF, kinase, phosphatase, and drug target are provided. d Haematoxylin and eosin (H&E) staining, immunohistochemistry (IHC) staining of P63, P16, PRKCE, SLC27A1, and CPXM2, and in situ hybridization (ISH) of EBER in representative samples of partial SCC types (originating organ includes: esophagus, nasopharynx, thymus, lung, pancreas, gallbladder, cervix, and vagina). Scale bar, 100 ÎĽm.