Fig. 2: The 33 gene sets that were enriched according to DEPICT were clustered into ten biological themes.

The reconstituted gene sets were named after the predefined gene sets used in the development of DEPICT, and source databases are reported in brackets. The nominal enrichment p for each reconstituted gene set as reported by DEPICT is reported in brackets, with emphasis added for gene sets with p < 1 × 10−4. Clustering into biological themes was performed using affinity propagation clustering after calculating pairwise Pearson correlation between all enriched gene sets, as depicted in the bubble chart. CEBPB: CCAAT/enhancer binding protein beta, CTNNB1: catenin beta 1, DAG: diacylglycerol, EGFR: epidermal growth factor receptor, ENSG: Ensembl gene, ERBB2: erb-b2 receptor tyrosine kinase 2, GO: gene ontology, IP3: inositol triphosphate, KEGG: Kyoto Encyclopedia of Genes and Genomes, MAP: mitogen-activated protein kinase, MAP2K1: MAP kinase kinase 1, MP: mammalian phenotype ontology, NAT9: N-acetyltransferase 9, NGF: nerve growth factor, NOD1: nucleotide binding oligomerization domain containing 1, PLCG1: phospholipase C, gamma 1, RAC2: ras-related C3 botulinum toxin substrate 2, RAC3: ras-related C3 botulinum toxin substrate 3, RAF1: Raf-1 proto-oncogene, serine/threonine kinase, TCR: T cell receptor; TGOLN2: trans-golgi network protein 2, TRKA: tropomyosin receptor kinase A.