Skip to main content

Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

  • Article
  • Published:

Cancer subclone detection based on DNA copy number in single-cell and spatial omic sequencing data

Abstract

Somatic mutations such as copy number alterations accumulate during cancer progression, driving intratumor heterogeneity that impacts therapy effectiveness. Understanding the characteristics and spatial distribution of genetically distinct subclones is essential for unraveling tumor evolution and improving cancer treatment. Here we present Clonalscope, a subclone detection method using copy number profiles, applicable to spatial transcriptomics and single-cell sequencing data. Clonalscope implements a nested Chinese Restaurant Process to identify de novo tumor subclones, which can incorporate prior information from matched bulk DNA sequencing data for improved subclone detection and malignant cell labeling. On single-cell RNA sequencing and single-cell assay for transposase-accessible chromatin using sequencing data from gastrointestinal tumors, Clonalscope successfully labeled malignant cells and identified genetically different subclones with thorough validations. On spatial transcriptomics data from various primary and metastasized tumors, Clonalscope labeled malignant spots, traced subclones and identified spatially segregated subclones with distinct differentiation levels and expression of genes associated with drug resistance and survival.

This is a preview of subscription content, access via your institution

Access options

Buy this article

USD 39.95

Prices may be subject to local taxes which are calculated during checkout

Fig. 1: Overview of subclone detection with Clonalscope.
The alternative text for this image may have been generated using AI.
Fig. 2: Subclone detection and malignant cell labeling for scRNA-seq data validated by matched scDNA-seq data.
The alternative text for this image may have been generated using AI.
Fig. 3: Subclone detection based on coverage and allelic ratio in the scATAC–seq data from the SNU601 gastric cancer cell line.
The alternative text for this image may have been generated using AI.
Fig. 4: Subclone detection and malignant spot labeling in ST data.
The alternative text for this image may have been generated using AI.
Fig. 5: Subclone tracing on ST data.
The alternative text for this image may have been generated using AI.
Fig. 6: Subclone detection and differential gene analysis on a CRC sample.
The alternative text for this image may have been generated using AI.

Similar content being viewed by others

Data availability

The patient scRNA-seq and WGS data generated for this study are available under database of Genotypes and Phenotypes (dbGAP) identifier phs001818. Data are available with dbGaP authorized access for health/medical/biomedical purposes. The other patient scRNA-seq data were obtained from dbGAP under accession phs001818 (refs. 31,51). The patient scDNA-seq data were from dbGAP under accession phs001711 (ref. 9) and accession phs001818 (ref. 51). For the SNU601 sample, scDNA-seq data and scATAC–seq data were retrieved from the Sequence Read Archive under accession PRJNA598203 (ref. 60) and accession PRJNA674903 (ref. 9), respectively. For P9962-23049B and P9962-23050D, the ST data and WGS were uploaded to dbGAP identifier phs001818. For the V11Y04-378-A1 sample, the ST data were uploaded to GSE284061. For the mouse Slide-seq and Slide-DNA-seq30, the data were retrieved from PRJNA768453 and https://singlecell.broadinstitute.org/single_cell/study/SCP1278. The ST and WGS data of the SCC P6 were provided and retrieved from GSE144240 (ref. 23). For ST-colon 1 and ST-liver 1 (ref. 32), the data were retrieved from OEP001756 and http://www.cancerdiversity.asia/scCRLM. The three ST datasets of breast cancer were retrieved from 10x Genomics datasets as in Supplementary Table 3.

Code availability

Clonalscope is available via GitHub at https://github.com/seasoncloud/Clonalscope.

References

  1. Magrangeas, F. et al. Minor clone provides a reservoir for relapse in multiple myeloma. Leukemia 27, 473–481 (2013).

    Article  CAS  PubMed  Google Scholar 

  2. Ding, L. et al. Clonal evolution in relapsed acute myeloid leukaemia revealed by whole-genome sequencing. Nature 481, 506–510 (2012).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  3. Wang, K. et al. Genomic landscape of copy number aberrations enables the identification of oncogenic drivers in hepatocellular carcinoma. Hepatology 58, 706–717 (2013).

    Article  PubMed  Google Scholar 

  4. Beroukhim, R. et al. The landscape of somatic copy-number alteration across human cancers. Nature 463, 899–905 (2010).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  5. Futreal, P. A. et al. A census of human cancer genes. Nat. Rev. Cancer 4, 177–183 (2004).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  6. Kim, C. et al. Chemoresistance evolution in triple-negative breast cancer delineated by single-cell sequencing. Cell 173, 879–893 (2018).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  7. Laks, E. et al. Clonal decomposition and DNA replication states defined by scaled single-cell genome sequencing. Cell 179, 1207–1221 (2019).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  8. Navin, N. et al. Tumour evolution inferred by single-cell sequencing. Nature 472, 90–94 (2011).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  9. Wu, C.-Y. et al. Integrative single-cell analysis of allele-specific copy number alterations and chromatin accessibility in cancer. Nat. Biotechnol. 39, 1259–1269 (2021).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  10. Niknafs, N., Beleva-Guthrie, V., Naiman, D. Q. & Karchin, R. Subclonal hierarchy inference from somatic mutations: automatic reconstruction of cancer evolutionary trees from multi-region next generation sequencing. PLoS Comput. Biol. 11, e1004416 (2015).

    Article  PubMed  PubMed Central  Google Scholar 

  11. Yuan, K., Sakoparnig, T., Markowetz, F. & Beerenwinkel, N. BitPhylogeny: a probabilistic framework for reconstructing intra-tumor phylogenies. Genome Biol. 16, 36 (2015).

    Article  PubMed  PubMed Central  Google Scholar 

  12. Deshwar, A. G. et al. PhyloWGS: reconstructing subclonal composition and evolution from whole-genome sequencing of tumors. Genome Biol. 16, 35 (2015).

    Article  PubMed  PubMed Central  Google Scholar 

  13. El-Kebir, M., Satas, G., Oesper, L. & Raphael, B. J. Inferring the mutational history of a tumor using multi-state perfect phylogeny mixtures. Cell Syst. 3, 43–53 (2016).

    Article  CAS  PubMed  Google Scholar 

  14. Jiang, Y., Qiu, Y., Minn, A. J. & Zhang, N. R. Assessing intratumor heterogeneity and tracking longitudinal and spatial clonal evolutionary history by next-generation sequencing. Proc. Natl Acad. Sci. USA 113, E5528–E5537 (2016).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  15. Gao, R. et al. Delineating copy number and clonal substructure in human tumors from single-cell transcriptomes. Nat. Biotechnol. 39, 599–608 (2021).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  16. Patel, A. P. et al. Single-cell RNA-seq highlights intratumoral heterogeneity in primary glioblastoma. Science 344, 1396–1401 (2014).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  17. Serin Harmanci, A., Harmanci, A. O. & Zhou, X. CaSpER identifies and visualizes CNV events by integrative analysis of single-cell or bulk RNA-sequencing data. Nat. Commun. 11, 89 (2020).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  18. Fan, J. et al. Linking transcriptional and genetic tumor heterogeneity through allele analysis of single-cell RNA-seq data. Genome Res. 28, 1217–1227 (2018).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  19. Müller, S., Cho, A., Liu, S. J., Lim, D. A. & Diaz, A. CONICS integrates scRNA-seq with DNA sequencing to map gene expression to tumor sub-clones. Bioinformatics 34, 3217–3219 (2018).

    Article  PubMed  PubMed Central  Google Scholar 

  20. Granja, J. M. et al. Single-cell multiomic analysis identifies regulatory programs in mixed-phenotype acute leukemia. Nat. Biotechnol. 37, 1458–1465 (2019).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  21. Satpathy, A. T. et al. Massively parallel single-cell chromatin landscapes of human immune cell development and intratumoral T cell exhaustion. Nat. Biotechnol. 37, 925–936 (2019).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  22. Ståhl, P. L. et al. Visualization and analysis of gene expression in tissue sections by spatial transcriptomics. Science 353, 78–82 (2016).

    Article  PubMed  Google Scholar 

  23. Ji, A. L. et al. Multimodal analysis of composition and spatial architecture in human squamous cell carcinoma. Cell 182, 497–514 (2020).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  24. Erickson, A. et al. Spatially resolved clonal copy number alterations in benign and malignant tissue. Nature 608, 360–367 (2022).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  25. Elyanow, R., Zeira, R., Land, M. & Raphael, B. J. STARCH: copy number and clone inference from spatial transcriptomics data. Phys. Biol. 18, 035001 (2021).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  26. Zafar, H., Navin, N., Chen, K. & Nakhleh, L. SiCloneFit: Bayesian inference of population structure, genotype, and phylogeny of tumor clones from single-cell genome sequencing data. Genome Res. 29, 1847–1859 (2019).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  27. Borgsmüller, N. et al. BnpC: Bayesian non-parametric clustering of single-cell mutation profiles. Bioinformatics 36, 4854–4859 (2020).

    Article  PubMed  PubMed Central  Google Scholar 

  28. Myhre, S. et al. Influence of DNA copy number and mRNA levels on the expression of breast cancer related proteins. Mol. Oncol. 7, 704–718 (2013).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  29. Campbell, K. R. et al. clonealign: statistical integration of independent single-cell RNA and DNA sequencing data from human cancers. Genome Biol. 20, 54 (2019).

    Article  PubMed  PubMed Central  Google Scholar 

  30. Zhao, T. et al. Spatial genomics enables multi-modal study of clonal heterogeneity in tissues. Nature 601, 85–91 (2022).

    Article  CAS  PubMed  Google Scholar 

  31. Sathe, A. et al. Colorectal cancer metastases in the liver establish immunosuppressive spatial networking between tumor-associated SPP1+ macrophages and fibroblasts. Clin. Cancer Res. 29, 244–260 (2023).

    Article  CAS  PubMed  Google Scholar 

  32. Wu, Y. et al. Spatiotemporal immune landscape of colorectal cancer liver metastasis at single-cell level. Cancer Discov. 12, 134–153 (2022).

    Article  CAS  PubMed  Google Scholar 

  33. Cancer Genome Atlas Network. Comprehensive molecular characterization of human colon and rectal cancer. Nature 487, 330–337 (2012).

    Article  Google Scholar 

  34. Carrasco-Garcia, E. et al. SOX9-regulated cell plasticity in colorectal metastasis is attenuated by rapamycin. Sci. Rep. 6, 32350 (2016).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  35. Liu, J. et al. Immune landscape and prognostic immune-related genes in KRAS-mutant colorectal cancer patients. J. Transl. Med. 19, 27 (2021).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  36. Sun, M., Sun, T., He, Z. & Xiong, B. Identification of two novel biomarkers of rectal carcinoma progression and prognosis via co-expression network analysis. Oncotarget 8, 69594–69609 (2017).

    Article  PubMed  PubMed Central  Google Scholar 

  37. Schatoff, E. M., Leach, B. I. & Dow, L. E. WNT signaling and colorectal cancer. Curr. Colorectal Cancer Rep. 13, 101–110 (2017).

    PubMed  PubMed Central  Google Scholar 

  38. Zhu, H. et al. IGFBP2 promotes the EMT of colorectal cancer cells by regulating E-cadherin expression. Int. J. Clin. Exp. Pathol. 12, 2559–2565 (2019).

    CAS  PubMed  PubMed Central  Google Scholar 

  39. Liu, Y.-L. et al. Ligustrazine reverts anthracycline chemotherapy resistance of human breast cancer by inhibiting JAK2/STAT3 signaling and decreasing fibrinogen gamma chain (FGG) expression. Am. J. Cancer Res. 10, 939–952 (2020).

    CAS  PubMed  PubMed Central  Google Scholar 

  40. Akkiprik, M., Hu, L., Sahin, A., Hao, X. & Zhang, W. The subcellular localization of IGFBP5 affects its cell growth and migration functions in breast cancer. BMC Cancer 9, 103 (2009).

    Article  PubMed  PubMed Central  Google Scholar 

  41. Vieira, A. F. & Schmitt, F. An update on breast cancer multigene prognostic tests-emergent clinical biomarkers. Front. Med. 5, 248 (2018).

    Article  Google Scholar 

  42. Ai, L. et al. The transglutaminase 2 gene (TGM2), a potential molecular marker for chemotherapeutic drug sensitivity, is epigenetically silenced in breast cancer. Carcinogenesis 29, 510–518 (2008).

    Article  CAS  PubMed  Google Scholar 

  43. Wang, Y. et al. Nicotinamide N-methyltransferase enhances chemoresistance in breast cancer through SIRT1 protein stabilization. Breast Cancer Res. 21, 64 (2019).

    Article  PubMed  PubMed Central  Google Scholar 

  44. Bhattacharya, A. et al. A framework for transcriptome-wide association studies in breast cancer in diverse study populations. Genome Biol. 21, 42 (2020).

    Article  PubMed  PubMed Central  Google Scholar 

  45. Zhu, C., Preissl, S. & Ren, B. Single-cell multimodal omics: the power of many. Nat. Methods 17, 11–14 (2020).

    Article  CAS  PubMed  Google Scholar 

  46. Cable, D. M. et al. Robust decomposition of cell type mixtures in spatial transcriptomics. Nat. Biotechnol. 40, 517–526 (2022).

    Article  CAS  PubMed  Google Scholar 

  47. Elosua-Bayes, M., Nieto, P., Mereu, E., Gut, I. & Heyn, H. SPOTlight: seeded NMF regression to deconvolute spatial transcriptomics spots with single-cell transcriptomes. Nucleic Acids Res. 49, e50 (2021).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  48. Dong, R. & Yuan, G.-C. SpatialDWLS: accurate deconvolution of spatial transcriptomic data. Genome Biol. 22, 145 (2021).

    Article  PubMed  PubMed Central  Google Scholar 

  49. Kleshchevnikov, V. et al. Cell2location maps fine-grained cell types in spatial transcriptomics. Nat. Biotechnol. 40, 661–671 (2022).

    Article  CAS  PubMed  Google Scholar 

  50. Bai, X., Lau, B., Grimes, S. M., Sathe, A. & Ji, H. P. Single cell multi-omic mapping of subclonal architecture and pathway phenotype in primary gastric and metastatic colon cancers. Preprint at bioRxiv https://doi.org/10.1101/2022.07.03.498616 (2022).

  51. Sathe, A. et al. Single-cell genomic characterization reveals the cellular reprogramming of the gastric tumor microenvironment. Clin. Cancer Res. 26, 2640–2653 (2020).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  52. Sathe, A. et al. GITR and TIGIT immunotherapy provokes divergent multicellular responses in the tumor microenvironment of gastrointestinal cancers. Genome Med. 15, 100 (2023).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  53. Quinlan, A. R. & Hall, I. M. BEDTools: a flexible suite of utilities for comparing genomic features. Bioinformatics 26, 841–842 (2010).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  54. Li, H. et al. The Sequence Alignment/Map format and SAMtools. Bioinformatics 25, 2078–2079 (2009).

    Article  PubMed  PubMed Central  Google Scholar 

  55. Mason, K. et al. Niche-DE: niche-differential gene expression analysis in spatial transcriptomics data identifies context-dependent cell-cell interactions. Genome Biol. 25, 14 (2024).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  56. Love, M. I., Huber, W. & Anders, S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 15, 550 (2014).

    Article  PubMed  PubMed Central  Google Scholar 

  57. Packer, J. S. et al. A lineage-resolved molecular atlas of C. elegans embryogenesis at single-cell resolution. Science 365, eaax1971 (2019).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  58. Hao, Y. et al. Integrated analysis of multimodal single-cell data. Cell 184, 3573–3587 (2021).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  59. Hinrichs, A. S. et al. The UCSC Genome Browser Database: update 2006. Nucleic Acids Res. 34, D590–D598 (2006).

    Article  CAS  PubMed  Google Scholar 

  60. Andor, N. et al. Joint single cell DNA-seq and RNA-seq of gastric cancer cell lines reveals rules of in vitro evolution. NAR Genom. Bioinform. 2, lqaa016 (2020).

    Article  PubMed  PubMed Central  Google Scholar 

Download references

Acknowledgements

This work is supported by the National Institutes of Health (P01HG00205ESH to B.T.L., S.M.G. and H.P.J. and 5R01-HG006137-07 and 1U2CCA233285-01 to C.-Y.W. and N.R.Z.). Additional support to H.P.J. came from the Research Scholar Grant (RSG-13-297-01-TBG) from the American Cancer Society, the Clayville Foundation and the Gastric Cancer Foundation.

Author information

Authors and Affiliations

Authors

Contributions

C.-Y.W. and N.R.Z. conceived the computational methods and designed the study, with help from H.P.J. C.-Y.W. developed and implemented the computational methods. C.-Y.W. and J.R. conducted the data analyses. A.S. performed all related sample preparation and sequencing. J.R. and S.H. helped with benchmarking and improvement of the analysis pipeline. P.R.H. helped with pathology annotation and data interpretation. B.T.L. helped in data interpretation. S.M.G. performed data preprocessing and coordinated data transfer. H.P.J. advised on all experiments and data collection. C.-Y.W., J.R. and N.R.Z. wrote the manuscript. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Nancy R. Zhang.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature Methods thanks Chenfei Wang and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available. Primary Handling Editor: Lin Tang, in collaboration with the Nature Methods team.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Extended data

Extended Data Fig. 1 Subclone detection and differential gene analysis an invasive ductal carcinoma sample.

a. Subclones detected from Clonalscope on the ST dataset from an invasive ductal carcinoma sample and paired spot-level pathologist annotation. b. The mean copy number profiles across the cells assigned to the three spatially segregated subclones (bottom), and the density plots of four regions with differences in the copy number states for at least one of the three subclones (top). c, d. show the top 6 highly expressed genes for tumor subclones 2 and 3. The color scale indicates log-normalized UMI counts for each gene.

Supplementary information

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Wu, CY., Rong, J., Sathe, A. et al. Cancer subclone detection based on DNA copy number in single-cell and spatial omic sequencing data. Nat Methods 22, 1846–1856 (2025). https://doi.org/10.1038/s41592-025-02773-5

Download citation

  • Received:

  • Accepted:

  • Published:

  • Version of record:

  • Issue date:

  • DOI: https://doi.org/10.1038/s41592-025-02773-5

This article is cited by

Search

Quick links

Nature Briefing: Cancer

Sign up for the Nature Briefing: Cancer newsletter — what matters in cancer research, free to your inbox weekly.

Get what matters in cancer research, free to your inbox weekly. Sign up for Nature Briefing: Cancer