Abstract
Somatic mutations such as copy number alterations accumulate during cancer progression, driving intratumor heterogeneity that impacts therapy effectiveness. Understanding the characteristics and spatial distribution of genetically distinct subclones is essential for unraveling tumor evolution and improving cancer treatment. Here we present Clonalscope, a subclone detection method using copy number profiles, applicable to spatial transcriptomics and single-cell sequencing data. Clonalscope implements a nested Chinese Restaurant Process to identify de novo tumor subclones, which can incorporate prior information from matched bulk DNA sequencing data for improved subclone detection and malignant cell labeling. On single-cell RNA sequencing and single-cell assay for transposase-accessible chromatin using sequencing data from gastrointestinal tumors, Clonalscope successfully labeled malignant cells and identified genetically different subclones with thorough validations. On spatial transcriptomics data from various primary and metastasized tumors, Clonalscope labeled malignant spots, traced subclones and identified spatially segregated subclones with distinct differentiation levels and expression of genes associated with drug resistance and survival.
This is a preview of subscription content, access via your institution
Access options
Access Nature and 54 other Nature Portfolio journals
Get Nature+, our best-value online-access subscription
$32.99 / 30 days
cancel any time
Subscribe to this journal
Receive 12 print issues and online access
$259.00 per year
only $21.58 per issue
Buy this article
- Purchase on SpringerLink
- Instant access to the full article PDF.
USD 39.95
Prices may be subject to local taxes which are calculated during checkout






Similar content being viewed by others
Data availability
The patient scRNA-seq and WGS data generated for this study are available under database of Genotypes and Phenotypes (dbGAP) identifier phs001818. Data are available with dbGaP authorized access for health/medical/biomedical purposes. The other patient scRNA-seq data were obtained from dbGAP under accession phs001818 (refs. 31,51). The patient scDNA-seq data were from dbGAP under accession phs001711 (ref. 9) and accession phs001818 (ref. 51). For the SNU601 sample, scDNA-seq data and scATAC–seq data were retrieved from the Sequence Read Archive under accession PRJNA598203 (ref. 60) and accession PRJNA674903 (ref. 9), respectively. For P9962-23049B and P9962-23050D, the ST data and WGS were uploaded to dbGAP identifier phs001818. For the V11Y04-378-A1 sample, the ST data were uploaded to GSE284061. For the mouse Slide-seq and Slide-DNA-seq30, the data were retrieved from PRJNA768453 and https://singlecell.broadinstitute.org/single_cell/study/SCP1278. The ST and WGS data of the SCC P6 were provided and retrieved from GSE144240 (ref. 23). For ST-colon 1 and ST-liver 1 (ref. 32), the data were retrieved from OEP001756 and http://www.cancerdiversity.asia/scCRLM. The three ST datasets of breast cancer were retrieved from 10x Genomics datasets as in Supplementary Table 3.
Code availability
Clonalscope is available via GitHub at https://github.com/seasoncloud/Clonalscope.
References
Magrangeas, F. et al. Minor clone provides a reservoir for relapse in multiple myeloma. Leukemia 27, 473–481 (2013).
Ding, L. et al. Clonal evolution in relapsed acute myeloid leukaemia revealed by whole-genome sequencing. Nature 481, 506–510 (2012).
Wang, K. et al. Genomic landscape of copy number aberrations enables the identification of oncogenic drivers in hepatocellular carcinoma. Hepatology 58, 706–717 (2013).
Beroukhim, R. et al. The landscape of somatic copy-number alteration across human cancers. Nature 463, 899–905 (2010).
Futreal, P. A. et al. A census of human cancer genes. Nat. Rev. Cancer 4, 177–183 (2004).
Kim, C. et al. Chemoresistance evolution in triple-negative breast cancer delineated by single-cell sequencing. Cell 173, 879–893 (2018).
Laks, E. et al. Clonal decomposition and DNA replication states defined by scaled single-cell genome sequencing. Cell 179, 1207–1221 (2019).
Navin, N. et al. Tumour evolution inferred by single-cell sequencing. Nature 472, 90–94 (2011).
Wu, C.-Y. et al. Integrative single-cell analysis of allele-specific copy number alterations and chromatin accessibility in cancer. Nat. Biotechnol. 39, 1259–1269 (2021).
Niknafs, N., Beleva-Guthrie, V., Naiman, D. Q. & Karchin, R. Subclonal hierarchy inference from somatic mutations: automatic reconstruction of cancer evolutionary trees from multi-region next generation sequencing. PLoS Comput. Biol. 11, e1004416 (2015).
Yuan, K., Sakoparnig, T., Markowetz, F. & Beerenwinkel, N. BitPhylogeny: a probabilistic framework for reconstructing intra-tumor phylogenies. Genome Biol. 16, 36 (2015).
Deshwar, A. G. et al. PhyloWGS: reconstructing subclonal composition and evolution from whole-genome sequencing of tumors. Genome Biol. 16, 35 (2015).
El-Kebir, M., Satas, G., Oesper, L. & Raphael, B. J. Inferring the mutational history of a tumor using multi-state perfect phylogeny mixtures. Cell Syst. 3, 43–53 (2016).
Jiang, Y., Qiu, Y., Minn, A. J. & Zhang, N. R. Assessing intratumor heterogeneity and tracking longitudinal and spatial clonal evolutionary history by next-generation sequencing. Proc. Natl Acad. Sci. USA 113, E5528–E5537 (2016).
Gao, R. et al. Delineating copy number and clonal substructure in human tumors from single-cell transcriptomes. Nat. Biotechnol. 39, 599–608 (2021).
Patel, A. P. et al. Single-cell RNA-seq highlights intratumoral heterogeneity in primary glioblastoma. Science 344, 1396–1401 (2014).
Serin Harmanci, A., Harmanci, A. O. & Zhou, X. CaSpER identifies and visualizes CNV events by integrative analysis of single-cell or bulk RNA-sequencing data. Nat. Commun. 11, 89 (2020).
Fan, J. et al. Linking transcriptional and genetic tumor heterogeneity through allele analysis of single-cell RNA-seq data. Genome Res. 28, 1217–1227 (2018).
Müller, S., Cho, A., Liu, S. J., Lim, D. A. & Diaz, A. CONICS integrates scRNA-seq with DNA sequencing to map gene expression to tumor sub-clones. Bioinformatics 34, 3217–3219 (2018).
Granja, J. M. et al. Single-cell multiomic analysis identifies regulatory programs in mixed-phenotype acute leukemia. Nat. Biotechnol. 37, 1458–1465 (2019).
Satpathy, A. T. et al. Massively parallel single-cell chromatin landscapes of human immune cell development and intratumoral T cell exhaustion. Nat. Biotechnol. 37, 925–936 (2019).
Ståhl, P. L. et al. Visualization and analysis of gene expression in tissue sections by spatial transcriptomics. Science 353, 78–82 (2016).
Ji, A. L. et al. Multimodal analysis of composition and spatial architecture in human squamous cell carcinoma. Cell 182, 497–514 (2020).
Erickson, A. et al. Spatially resolved clonal copy number alterations in benign and malignant tissue. Nature 608, 360–367 (2022).
Elyanow, R., Zeira, R., Land, M. & Raphael, B. J. STARCH: copy number and clone inference from spatial transcriptomics data. Phys. Biol. 18, 035001 (2021).
Zafar, H., Navin, N., Chen, K. & Nakhleh, L. SiCloneFit: Bayesian inference of population structure, genotype, and phylogeny of tumor clones from single-cell genome sequencing data. Genome Res. 29, 1847–1859 (2019).
Borgsmüller, N. et al. BnpC: Bayesian non-parametric clustering of single-cell mutation profiles. Bioinformatics 36, 4854–4859 (2020).
Myhre, S. et al. Influence of DNA copy number and mRNA levels on the expression of breast cancer related proteins. Mol. Oncol. 7, 704–718 (2013).
Campbell, K. R. et al. clonealign: statistical integration of independent single-cell RNA and DNA sequencing data from human cancers. Genome Biol. 20, 54 (2019).
Zhao, T. et al. Spatial genomics enables multi-modal study of clonal heterogeneity in tissues. Nature 601, 85–91 (2022).
Sathe, A. et al. Colorectal cancer metastases in the liver establish immunosuppressive spatial networking between tumor-associated SPP1+ macrophages and fibroblasts. Clin. Cancer Res. 29, 244–260 (2023).
Wu, Y. et al. Spatiotemporal immune landscape of colorectal cancer liver metastasis at single-cell level. Cancer Discov. 12, 134–153 (2022).
Cancer Genome Atlas Network. Comprehensive molecular characterization of human colon and rectal cancer. Nature 487, 330–337 (2012).
Carrasco-Garcia, E. et al. SOX9-regulated cell plasticity in colorectal metastasis is attenuated by rapamycin. Sci. Rep. 6, 32350 (2016).
Liu, J. et al. Immune landscape and prognostic immune-related genes in KRAS-mutant colorectal cancer patients. J. Transl. Med. 19, 27 (2021).
Sun, M., Sun, T., He, Z. & Xiong, B. Identification of two novel biomarkers of rectal carcinoma progression and prognosis via co-expression network analysis. Oncotarget 8, 69594–69609 (2017).
Schatoff, E. M., Leach, B. I. & Dow, L. E. WNT signaling and colorectal cancer. Curr. Colorectal Cancer Rep. 13, 101–110 (2017).
Zhu, H. et al. IGFBP2 promotes the EMT of colorectal cancer cells by regulating E-cadherin expression. Int. J. Clin. Exp. Pathol. 12, 2559–2565 (2019).
Liu, Y.-L. et al. Ligustrazine reverts anthracycline chemotherapy resistance of human breast cancer by inhibiting JAK2/STAT3 signaling and decreasing fibrinogen gamma chain (FGG) expression. Am. J. Cancer Res. 10, 939–952 (2020).
Akkiprik, M., Hu, L., Sahin, A., Hao, X. & Zhang, W. The subcellular localization of IGFBP5 affects its cell growth and migration functions in breast cancer. BMC Cancer 9, 103 (2009).
Vieira, A. F. & Schmitt, F. An update on breast cancer multigene prognostic tests-emergent clinical biomarkers. Front. Med. 5, 248 (2018).
Ai, L. et al. The transglutaminase 2 gene (TGM2), a potential molecular marker for chemotherapeutic drug sensitivity, is epigenetically silenced in breast cancer. Carcinogenesis 29, 510–518 (2008).
Wang, Y. et al. Nicotinamide N-methyltransferase enhances chemoresistance in breast cancer through SIRT1 protein stabilization. Breast Cancer Res. 21, 64 (2019).
Bhattacharya, A. et al. A framework for transcriptome-wide association studies in breast cancer in diverse study populations. Genome Biol. 21, 42 (2020).
Zhu, C., Preissl, S. & Ren, B. Single-cell multimodal omics: the power of many. Nat. Methods 17, 11–14 (2020).
Cable, D. M. et al. Robust decomposition of cell type mixtures in spatial transcriptomics. Nat. Biotechnol. 40, 517–526 (2022).
Elosua-Bayes, M., Nieto, P., Mereu, E., Gut, I. & Heyn, H. SPOTlight: seeded NMF regression to deconvolute spatial transcriptomics spots with single-cell transcriptomes. Nucleic Acids Res. 49, e50 (2021).
Dong, R. & Yuan, G.-C. SpatialDWLS: accurate deconvolution of spatial transcriptomic data. Genome Biol. 22, 145 (2021).
Kleshchevnikov, V. et al. Cell2location maps fine-grained cell types in spatial transcriptomics. Nat. Biotechnol. 40, 661–671 (2022).
Bai, X., Lau, B., Grimes, S. M., Sathe, A. & Ji, H. P. Single cell multi-omic mapping of subclonal architecture and pathway phenotype in primary gastric and metastatic colon cancers. Preprint at bioRxiv https://doi.org/10.1101/2022.07.03.498616 (2022).
Sathe, A. et al. Single-cell genomic characterization reveals the cellular reprogramming of the gastric tumor microenvironment. Clin. Cancer Res. 26, 2640–2653 (2020).
Sathe, A. et al. GITR and TIGIT immunotherapy provokes divergent multicellular responses in the tumor microenvironment of gastrointestinal cancers. Genome Med. 15, 100 (2023).
Quinlan, A. R. & Hall, I. M. BEDTools: a flexible suite of utilities for comparing genomic features. Bioinformatics 26, 841–842 (2010).
Li, H. et al. The Sequence Alignment/Map format and SAMtools. Bioinformatics 25, 2078–2079 (2009).
Mason, K. et al. Niche-DE: niche-differential gene expression analysis in spatial transcriptomics data identifies context-dependent cell-cell interactions. Genome Biol. 25, 14 (2024).
Love, M. I., Huber, W. & Anders, S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 15, 550 (2014).
Packer, J. S. et al. A lineage-resolved molecular atlas of C. elegans embryogenesis at single-cell resolution. Science 365, eaax1971 (2019).
Hao, Y. et al. Integrated analysis of multimodal single-cell data. Cell 184, 3573–3587 (2021).
Hinrichs, A. S. et al. The UCSC Genome Browser Database: update 2006. Nucleic Acids Res. 34, D590–D598 (2006).
Andor, N. et al. Joint single cell DNA-seq and RNA-seq of gastric cancer cell lines reveals rules of in vitro evolution. NAR Genom. Bioinform. 2, lqaa016 (2020).
Acknowledgements
This work is supported by the National Institutes of Health (P01HG00205ESH to B.T.L., S.M.G. and H.P.J. and 5R01-HG006137-07 and 1U2CCA233285-01 to C.-Y.W. and N.R.Z.). Additional support to H.P.J. came from the Research Scholar Grant (RSG-13-297-01-TBG) from the American Cancer Society, the Clayville Foundation and the Gastric Cancer Foundation.
Author information
Authors and Affiliations
Contributions
C.-Y.W. and N.R.Z. conceived the computational methods and designed the study, with help from H.P.J. C.-Y.W. developed and implemented the computational methods. C.-Y.W. and J.R. conducted the data analyses. A.S. performed all related sample preparation and sequencing. J.R. and S.H. helped with benchmarking and improvement of the analysis pipeline. P.R.H. helped with pathology annotation and data interpretation. B.T.L. helped in data interpretation. S.M.G. performed data preprocessing and coordinated data transfer. H.P.J. advised on all experiments and data collection. C.-Y.W., J.R. and N.R.Z. wrote the manuscript. All authors read and approved the final manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Peer review
Peer review information
Nature Methods thanks Chenfei Wang and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available. Primary Handling Editor: Lin Tang, in collaboration with the Nature Methods team.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Extended data
Extended Data Fig. 1 Subclone detection and differential gene analysis an invasive ductal carcinoma sample.
a. Subclones detected from Clonalscope on the ST dataset from an invasive ductal carcinoma sample and paired spot-level pathologist annotation. b. The mean copy number profiles across the cells assigned to the three spatially segregated subclones (bottom), and the density plots of four regions with differences in the copy number states for at least one of the three subclones (top). c, d. show the top 6 highly expressed genes for tumor subclones 2 and 3. The color scale indicates log-normalized UMI counts for each gene.
Supplementary information
Supplementary Information (download PDF )
Supplementary Figs. 1–26, Tables 1–5 and Notes.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Wu, CY., Rong, J., Sathe, A. et al. Cancer subclone detection based on DNA copy number in single-cell and spatial omic sequencing data. Nat Methods 22, 1846–1856 (2025). https://doi.org/10.1038/s41592-025-02773-5
Received:
Accepted:
Published:
Version of record:
Issue date:
DOI: https://doi.org/10.1038/s41592-025-02773-5
This article is cited by
-
A tale of tumors
Nature Methods (2025)


