Abstract
Background
Breast cancer is a molecularly heterogeneous disease composed of multiple intrinsic subtypes. Recent studies have highlighted the substantial intratumor heterogeneity of breast cancer, wherein malignant cells of distinct intrinsic subtypes co-exist within the same tumor. However, most existing subtyping methods are designed for bulk transcriptomic data and are therefore limited in their ability to resolve such intratumor heterogeneity at single-cell resolution.
Methods
We develop UBS93, a computational framework that enables robust molecular subtyping of both bulk tumor samples and individual breast cancer cells. We rigorously validate UBS93 and demonstrate its superior performance relative to existing approaches, particularly in identifying the highly aggressive Claudin-low subtype.
Results
Applying UBS93 to single-cell RNA sequencing data from human Basal-like breast cancers, we identify the co-existence of Basal-like and Claudin-low cancer cell populations within the same tumor—a form of intratumor heterogeneity previously observed in mouse models with genetically engineered RAS pathway alterations. Further analyses suggest that Claudin-low cancer cells originate from Basal-like population, with down-regulation of transcription factor ELF3 playing a pivotal role in the Basal-like/Claudin-low transition.
Conclusions
Our findings establish UBS93 as a powerful tool for breast cancer subtyping and uncover the intratumor heterogeneity in human Basal-like breast cancer.
Plain language summary
Breast cancer is not a single disease and different subtypes of cancer cells can exist within the same tumor. Most current methods used to classify breast cancer look at tumors as a whole and cannot detect differences between individual cells. In this study, we developed a computational tool called UBS93 which can classify breast cancer using data from either whole tumor or single cells. Using UBS93, we found that two aggressive subtypes can exist together within the same tumor. We also discovered that one subtype may arise from the other through changes in gene activity. These findings improve our understanding of breast cancer diversity and may help guide more precise diagnosis and treatment in the future.
Data availability
The authors declare that all data supporting the findings of this study are available within the article and its Supplementary information files or from the corresponding author upon reasonable request. The generated HCC70 scRNA-seq data are available at Zenodo57.
Code availability
The code, which can be used to reproduce the figures, is available at GitHub58.
References
Nolan, E., Lindeman, G. J. & Visvader, J. E. Deciphering breast cancer: from biology to the clinic. Cell 186, 1708–1728 (2023).
Parker, J. S. et al. Supervised risk predictor of breast cancer based on intrinsic subtypes. J. Clin. Oncol. 27, 1160–1167 (2009).
Herschkowitz, J. I. et al. Identification of conserved gene expression features between murine mammary carcinoma models and human breast tumors. Genome Biol. 8, R76 (2007).
Prat, A. et al. Phenotypic and molecular characterization of the claudin-low intrinsic subtype of breast cancer. Breast Cancer Res. 12, R68 (2010).
Rädler, P. D. et al. Highly metastatic claudin-low mammary cancers can originate from luminal epithelial cells. Nat. Commun. 12, 3742 (2021).
Fougner, C., Bergholtz, H., Norum, J. H. & Sørlie, T. Re-definition of claudin-low as a breast cancer phenotype. Nat. Commun. 11, 1787 (2020).
Jordan, N. V. et al. HER2 expression identifies dynamic functional states within circulating breast cancer cells. Nature 537, 102–106 (2016).
Chung, W. et al. Single-cell RNA-seq enables comprehensive tumour and immune cell profiling in primary breast cancer. Nat. Commun. 8, 15081 (2017).
Azizi, E. et al. Single-cell map of diverse immune phenotypes in the breast tumor microenvironment. Cell 174, 1293–1308 (2018).
Karaayvaz, M. et al. Unravelling subclonal heterogeneity and aggressive disease states in TNBC through single-cell RNA-seq. Nat. Commun. 9, 3588 (2018).
Kim, C. et al. Chemoresistance evolution in triple-negative breast cancer delineated by single-cell sequencing. Cell 173, 879–893 (2018).
Savas, P. et al. Single-cell profiling of breast cancer T cells reveals a tissue-resident memory subset associated with improved prognosis. Nat. Med. 24, 986–993 (2018).
Gulati, G. S. et al. Single-cell transcriptional diversity is a hallmark of developmental potential. Science 367, 405–411 (2020).
Bassez, A. et al. A single-cell map of intratumoral changes during anti-PD1 treatment of patients with breast cancer. Nat. Med. 27, 820–832 (2021).
Pal, B. et al. A single-cell RNA expression atlas of normal, preneoplastic and tumorigenic states in the human breast. The EMBO journal 40, e107333 (2021).
Wu, S. Z. et al. A single-cell and spatially resolved atlas of human breast cancers. Nat. Genet. 53, 1334–1347 (2021).
Cerami, E. et al. The cBio cancer genomics portal: an open platform for exploring multidimensional cancer genomics data. Cancer Discov. 2, 401–404 (2012).
Gao, J. et al. Integrative analysis of complex cancer genomics and clinical profiles using the cBioPortal. Sci. Signal. 6, pl1 (2013).
Liberzon, A. et al. Molecular signatures database (MSigDB) 3.0. Bioinformatics 27, 1739–1740 (2011).
Yu, G., Wang, L. G., Han, Y. & He, Q. Y. clusterProfiler: an R package for comparing biological themes among gene clusters. Omics J. Integr. Biol. 16, 284–287 (2012).
Dobin, A. et al. STAR: ultrafast universal RNA-seq aligner. Bioinformatics 29, 15–21 (2013).
Li, B. & Dewey, C. N. RSEM: accurate transcript quantification from RNA-seq data with or without a reference genome. BMC Bioinform. 12, 323 (2011).
Frankish, A. et al. GENCODE: reference annotation for the human and mouse genomes in 2023. Nucleic Acids Res. 51, D942–d949 (2023).
Chen, S., Zhou, Y., Chen, Y. & Gu, J. fastp: an ultra-fast all-in-one FASTQ preprocessor. Bioinformatics 34, i884–i890 (2018).
Zheng, G. X. et al. Massively parallel digital transcriptional profiling of single cells. Nat. Commun. 8, 14049 (2017).
Satija, R., Farrell, J. A., Gennert, D., Schier, A. F. & Regev, A. Spatial reconstruction of single-cell gene expression data. Nat. Biotechnol. 33, 495–502 (2015).
Stuart, T. et al. Comprehensive integration of single-cell data. Cell 177, 1888–1902 (2019).
Hao, Y. et al. Integrated analysis of multimodal single-cell data. Cell 184, 3573–3587 (2021).
Hao, Y. et al. Dictionary learning for integrative, multimodal and scalable single-cell analysis. Nat. Biotechnol. 42, 293–304 (2024).
The ggplot2 package https://cran.r-project.org/web/packages/ggplot2/index.html.
The pheatmap package https://github.com/raivokolde/pheatmap.
Gu, Z., Eils, R. & Schlesner, M. Complex heatmaps reveal patterns and correlations in multidimensional genomic data. Bioinformatics 32, 2847–2849 (2016).
Subramanian, A. et al. Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proc. Natl. Acad. Sci. USA 102, 15545–15550 (2005).
Hänzelmann, S., Castelo, R. & Guinney, J. GSVA: gene set variation analysis for microarray and RNA-seq data. BMC Bioinform. 14, 7 (2013).
Barretina, J. et al. The cancer cell line encyclopedia enables predictive modelling of anticancer drug sensitivity. Nature 483, 603–607 (2012).
Greer, Y. E. et al. Mitochondrial matrix protease ClpP agonists inhibit cancer stem cell function in breast cancer cells by disrupting mitochondrial homeostasis. Cancer Res. Commun. 2, 1144–1161 (2022).
Daemen, A. et al. Modeling precision treatment of breast cancer. Genome Biol. 14, R110 (2013).
Marcotte, R. et al. Functional genomic landscape of human breast cancer drivers, vulnerabilities, and resistance. Cell 164, 293–309 (2016).
Weinstein, J. N. et al. The Cancer Genome Atlas Pan-Cancer Analysis Project. Nat. Genet. 45, 1113–1120 (2013).
Gambardella, G. et al. A single-cell analysis of breast cancer cell lines to study tumour heterogeneity and drug response. Nat. Commun. 13, 1714 (2022).
Jovanović, B. et al. Heterogeneity and transcriptional drivers of triple-negative breast cancer. Cell Rep. 42, 113564 (2023).
Hollern, D. P. et al. B cells and T follicular helper cells mediate response to checkpoint inhibitors in high mutation burden mouse models of breast cancer. Cell 179, 1191–1206 (2019).
Gendoo, D. M. et al. Genefu: an R/bioconductor package for computation of gene expression-based signatures in breast cancer. Bioinformatics 32, 1097–1099 (2016).
Pan, C. et al. Research progress of claudin-low breast cancer. Front. Oncol. 13, 1226118 (2023).
Dias, K. et al. Claudin-low breast cancer; clinical & pathological characteristics. PLoS ONE 12, e0168669 (2017).
Li, W. et al. Unraveling the roles of CD44/CD24 and ALDH1 as cancer stem cell markers in tumorigenesis and metastasis. Sci. Rep. 7, 13856 (2017).
Reddy, T. P. et al. A comprehensive overview of metaplastic breast cancer: clinical features and molecular aberrations. Breast Cancer Res. 22, 121 (2020).
Weigelt, B. et al. Metaplastic breast carcinomas display genomic and transcriptomic heterogeneity [corrected]. Mod. Pathol. 28, 340–351 (2015).
Pal, B. et al. A single-cell RNA expression atlas of normal, preneoplastic and tumorigenic states in the human breast. EMBO J. 40, e107333 (2021).
Creighton, C. J. et al. Residual breast cancers after conventional therapy display mesenchymal as well as tumor-initiating features. Proc. Natl. Acad. Sci. USA 106, 13820–13825 (2009).
Pommier, R. M. et al. Comprehensive characterization of claudin-low breast tumors reflects the impact of the cell-of-origin on cancer evolution. Nat. Commun. 11, 3431 (2020).
Park, S. & Kwon, Y. The role of ELF3 in metastasis of triple negative breast cancer. Ann. Oncol. 28, x34 (2017).
Kuang, L. & Li, L. E74-like factor 3 suppresses microRNA-485-5p transcription to trigger growth and metastasis of ovarian cancer cells with the involvement of CLDN4/Wnt/β-catenin axis. Saudi J. Biol. Sci. 28, 4137–4146 (2021).
Zhang, X. et al. Single-cell transcriptomics reveal metastatic CLDN4+ cancer cells underlying the recurrence of malignant pleural effusion in patients with advanced non-small-cell lung cancer. Clin. Transl. Med. 14, e1649 (2024).
Liu, K. et al. Evaluating cell lines as models for metastatic breast cancer through integrative analysis of genomic data. Nat. Commun. 10, 2138 (2019).
Park, M. et al. Breast cancer metastasis: mechanisms and therapeutic implications. Int. J. Mol. Sci. 23, 6806 (2022).
Liu, K. HCC70 scRNA-seq data. Zenodo https://doi.org/10.5281/zenodo.15482443 (2025).
bioklab. UBS93. GitHub https://github.com/bioklab/UBS93 (2025).
Acknowledgements
We thank Dr. Xiaohui Lin for helping us revise the manuscript. The research was supported by the National Natural Science Foundation of China (Fund 32370715, 82330108), the Science Fund for Distinguished Oversea Young Scholars of Shandong Province (2023HWYQ-015), the Taishan Young Scholar Program of Shandong Province (tsqn202312020), and the Cheeloo Young Scholar Program of Shandong University. The content is solely the responsibility of the authors and does not necessarily represent the official views of sponsors.
Author information
Authors and Affiliations
Contributions
J.L., K.L. conceived the study. J.L. performed the majority of computational analysis, Y.G., W.X.Y. performed all the experimental assays, K.L., S.Z.Z., and W.X.Y. supervised the study. S.R.Z., K.L. revised the manuscript. All authors contributed to writing, reviewing, and editing the manuscript and approved the manuscript.
Corresponding authors
Ethics declarations
Competing interests
The authors declare no competing interests.
Peer review
Peer review information
Communications Medicine thanks the anonymous reviewers for their contribution to the peer review of this work. A peer review file is available.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.
About this article
Cite this article
Li, J., Gao, Y., Zhang, S. et al. Intratumor heterogeneity in human basal like breast cancer is revealed by single cell molecular subtyping. Commun Med (2026). https://doi.org/10.1038/s43856-026-01548-z
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s43856-026-01548-z