Table 1 Datasets used in this study.

From: A cross-cohort computational framework to trace tumor tissue-of-origin based on RNA sequencing

Abbreviation

TCGA primary

TCGA metastatic

ICGC

Cancer name

ACC

79

0

0

Adrenocortical carcinoma

BLCA

414

0

0

Bladder urothelial carcinoma

BRCA

1102

7

50

Breast invasive carcinoma

CESC

304

2

0

Cervical squamous cell carcinoma and endocervical adenocarcinoma

CHOL

36

0

0

Cholangiocarcinoma

COAD

478

1

0

Colon adenocarcinoma

DLBC

48

0

107

Lymphoid neoplasm diffuse large B-cell lymphoma

ESCA

161

1

0

Esophageal carcinoma

GBM

156

0

0

Glioblastoma multiforme

HNSC

500

2

40

Head and neck squamous cell carcinoma

KICH

65

0

0

Kidney chromophobe

KIRC

538

0

136

Kidney renal clear cell carcinoma

KIRP

288

0

0

Kidney renal papillary cell carcinoma

LAML

151

0

323

Acute myeloid leukemia

LGG

511

0

0

Brain lower grade glioma

LIHC

371

0

606

Liver hepatocellular carcinoma

LUAD

533

0

0

Lung adenocarcinoma

LUSC

502

0

0

Lung squamous cell carcinoma

MESO

86

0

0

Mesothelioma

OV

374

0

111

Ovarian serous cystadenocarcinoma

PAAD

177

1

389

Pancreatic adenocarcinoma

PCPG

178

2

0

Pheochromocytoma and paraganglioma

PRAD

498

1

169

Prostate adenocarcinoma

READ

166

0

0

Rectum adenocarcinoma

SARC

259

1

57

Sarcoma

SKCM

103

367

0

Skin cutaneous melanoma

STAD

375

0

0

Stomach adenocarcinoma

TGCT

150

0

0

Testicular germ cell tumors

THCA

502

8

0

Thyroid carcinoma

THYM

119

0

0

Thymoma

UCEC

551

0

0

Uterine corpus endometrial carcinoma

UCS

56

0

0

Uterine carcinosarcoma

UVM

80

0

0

Uveal melanoma

Sum

9911

393

1988

Sum of all cancers