Fig. 2: The different subpopulations and gene pathways identified in copy number high C3Tag cancer cells across the disease states.

a UMAP plot of 2025 copy number high cancer cells with RNAseq data colored by the disease state (Top panel) and by the cell populations identified (Bottom panel). b Heatmap of top 10 significant upregulated genes identified per cancer cell subpopulation using the Wilcoxon rank sum test. c UMAP plots highlighting significant breast cancer gene signatures from Fan et al.26 enriched in specific cancer cell subpopulations: Glycolysis and Hypoxia gene signature for subpopulation 0, Proliferation gene signature for subpopulation 3, and Interferon gene signature for subpopulation 4. d Heatmap of top 10 significant upregulated genes identified in cancer cells per Prepuberty, DCIS, Tumor1 (KRAS amplified) and Tumor2 disease states by Wilcoxon rank sum test. e Violin plots of NFKB pathway, KRAS pathway, and RHOA pathway scores in cancer cells from Prepuberty, DCIS, Tumor1 (KRAS amplified), and Tumor 2 states. Source data are provided as a Source Data file 2.