Table 1 Sources of data.

From: Consistently processed RNA sequencing data from 50 sources enriched for pediatric data

Source name

n

percent of all datasets

The Cancer Genome Atlas Program (TCGA)15

9806

59.6%

St. Jude34,35,36,37

2142

13.0%

Therapeutically Applicable Research to Generate Effective Treatments (TARGET)16,17,18,19,20

1356

8.2%

Cancer Cell Line Encyclopedia (CCLE)38

893

5.4%

Children’s Brain Tumor Network/Kids First Data Resource Center39,40

377

2.3%

International Cancer Genome Consortium (ICGC)41,42,43,44

191

1.2%

Clinical collaborators

677

4.1%

Other projects via database of Genotypes and Phenotypes (dbGaP) /Short Read Archive (SRA)45,46,47,48,49,50,51,52,53,54,55,56,57,58,59,60,61,62,63,64

801

4.9%

Other projects via European Genome-phenome Archive (EGA)65,66,67,68,69,70,71

203

1.2%