Extended Data Fig. 2: MMTV-HER2 scRNASeq data distribution of phenotypes across clusters.

(a) Number of cells per cluster analyzed in the single-cell RNAseq of HER2-, HER2+ eL (early lungs) and LL (late lungs) DCCs (see Fig. 2b, c). (b) Number of UMIs per cluster (left) and per sample (right) analyzed in the single-cell RNAseq (see Fig. 2b, c). (c) Scatterplots of single-cell RNAseq datasets (see Fig. 2b, c) using UMAP projections, color coded by per cluster (left) and per sample (right). (d) Distribution of Epithelial (Ep) and Mesenchymal (M) scores (gene lists in Supplementary Table 4, showed in Fig. 2a) in MMTV-HER2 lung DCC clusters. Cell clusters were subgrouped as M-like (1ā4, higher M-like score), Hybrid (5ā8) and Ep-like (9ā15). (e) Distribution of gene modules B and D (M-like) in all DCC clusters. Dots represent single cells color-coded by cluster (left), sample origin (eL or LL, middle) and sub-group (Ep-like, hybrid, M-like, right). Gene module lists in Supplementary Table 4. (f) Distribution of gene modules I (Ep-like) and B (M-like) in all DCC clusters. Dots represent single cells color-coded by cluster (left), sample origin (eL or LL, middle) and sub-group (Ep-like, hybrid, M-like, right). Gene module lists in Supplementary Table 4.