Skip to main content

Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

  • Article
  • Published:

Polyclonal-to-monoclonal transition in colorectal precancerous evolution

Abstract

Unravelling the origin and evolution of precancerous lesions is crucial for effectively preventing malignant transformation, yet our current knowledge remains limited1,2,3. Here we used a base editor-enabled DNA barcoding system4 to comprehensively map single-cell phylogenies in mouse models of intestinal tumorigenesis induced by inflammation or loss of the Apc gene. Through quantitative analysis of high-resolution phylogenies including 260,922 single cells from normal, inflamed and neoplastic intestinal tissues, we identified tens of independent cell lineages undergoing parallel clonal expansions within each lesion. We also found polyclonal origins of human sporadic colorectal polyps through bulk whole-exome sequencing and single-gland whole-genome sequencing. Genomic and clinical data support a model of polyclonal-to-monoclonal transition, with monoclonal lesions representing a more advanced stage. Single-cell RNA sequencing revealed extensive intercellular interactions in early polyclonal lesions, but there was significant loss of interactions during monoclonal transition. Therefore, our data suggest that colorectal precancer is often founded by many different lineages and highlight their cooperative interactions in the earliest stages of cancer formation. These findings provide insights into opportunities for earlier intervention in colorectal cancer.

This is a preview of subscription content, access via your institution

Access options

Buy this article

USD 39.95

Prices may be subject to local taxes which are calculated during checkout

Fig. 1: SMALT lineage tracing of mouse intestinal tumorigenesis.
The alternative text for this image may have been generated using AI.
Fig. 2: Single-cell phylogenies reveal the origin of inflammation-driven neoplasms.
The alternative text for this image may have been generated using AI.
Fig. 3: Polyclonal-to-monoclonal transition in human sporadic polyps.
The alternative text for this image may have been generated using AI.
Fig. 4: Intercellular interactions and polyclonal-to-monoclonal evolution model.
The alternative text for this image may have been generated using AI.

Similar content being viewed by others

Data availability

All processed data generated in this study have been deposited and are available at Zenodo (https://zenodo.org/records/11647317 (ref. 82)). The single-cell phylogenies generated from this study are displayed at https://smalt-phylogeny.org/. Raw data are publicly available from the National Genomics Data Center (NGDC) under the accession numbers PRJCA024217 and PRJCA023981. Public scRNA-seq data for two wild-type mouse colon samples can be accessed with GSE134255 (https://www.ncbi.nlm.nih.gov/).

Code availability

All computer code used in this study is available from the GitHub repositories https://github.com/zhaolianlu/SMALT-mouse and https://github.com/zhaolianlu/Homo-preCRC.

References

  1. Curtius, K., Wright, N. A. & Graham, T. A. Evolution of premalignant disease. Cold Spring Harb. Perspect. Med. 7, a026542 (2017).

    PubMed  PubMed Central  Google Scholar 

  2. Rane, J. K., Frankell, A. M., Weeden, C. E. & Swanton, C. Clonal evolution in healthy and premalignant tissues: implications for early cancer interception strategies. Cancer Prev. Res. 16, 369–378 (2023).

    CAS  Google Scholar 

  3. Zhang, S. et al. Tumor initiation and early tumorigenesis: molecular mechanisms and interventional targets. Signal Transduct. Target. Ther. 9, 149 (2024).

    PubMed  PubMed Central  Google Scholar 

  4. Liu, K. et al. Mapping single-cell-resolution cell phylogeny reveals cell population dynamics during organ development. Nat. Methods 18, 1506–1514 (2021).

    CAS  PubMed  Google Scholar 

  5. Curtius, K., Wright, N. A. & Graham, T. A. An evolutionary perspective on field cancerization. Nat. Rev. Cancer 18, 19–32 (2018).

    CAS  PubMed  Google Scholar 

  6. Dekker, E., Tanis, P. J., Vleugels, J. L. A., Kasi, P. M. & Wallace, M. B. Colorectal cancer. Lancet 394, 1467–1480 (2019).

    PubMed  Google Scholar 

  7. Fearon, E. R. & Vogelstein, B. A genetic model for colorectal tumorigenesis. Cell 61, 759–767 (1990).

    CAS  PubMed  Google Scholar 

  8. Gerstung, M. et al. The evolutionary history of 2,658 cancers. Nature 578, 122–128 (2020).

    CAS  PubMed  PubMed Central  ADS  Google Scholar 

  9. Parsons, B. L. Multiclonal tumor origin: evidence and implications. Mutat. Res. Rev. Mutat. Res. 777, 1–18 (2018).

    CAS  PubMed  Google Scholar 

  10. Chen, B. et al. Very large hidden genetic diversity in one single tumor: evidence for tumors-in-tumor. Natl Sci. Rev. 9, nwac250 (2022).

    PubMed  PubMed Central  Google Scholar 

  11. McKenna, A. et al. Whole-organism lineage tracing by combinatorial and cumulative genome editing. Science 353, aaf7907 (2016).

    PubMed  PubMed Central  Google Scholar 

  12. Chan, M. M. et al. Molecular recording of mammalian embryogenesis. Nature 570, 77–82 (2019).

    CAS  PubMed  PubMed Central  ADS  Google Scholar 

  13. Wagner, D. E. & Klein, A. M. Lineage tracing meets single-cell omics: opportunities and challenges. Nat. Rev. Genet. 21, 410–427 (2020).

    CAS  PubMed  PubMed Central  Google Scholar 

  14. Yang, D. et al. Lineage tracing reveals the phylodynamics, plasticity, and paths of tumor evolution. Cell 185, 1905–1923 e1925 (2022).

    CAS  PubMed  PubMed Central  Google Scholar 

  15. Wirtz, S. et al. Chemically induced mouse models of acute and chronic intestinal inflammation. Nat. Protoc. 12, 1295–1309 (2017).

    CAS  PubMed  Google Scholar 

  16. Pan, Q. et al. Genomic variants in mouse model induced by azoxymethane and dextran sodium sulfate improperly mimic human colorectal cancer. Sci Rep. 7, 25 (2017).

    PubMed  PubMed Central  ADS  Google Scholar 

  17. Roerink, S. F. et al. Intra-tumour diversification in colorectal cancer at the single-cell level. Nature 556, 457–462 (2018).

    CAS  PubMed  ADS  Google Scholar 

  18. Leystra, A. A. et al. Multi-ancestral origin of intestinal tumors: Impact on growth, progression, and drug efficacy. Cancer Rep. 5, e1459 (2022).

    CAS  Google Scholar 

  19. Novelli, M. R. et al. Polyclonal origin of colonic adenomas in an XO/XY patient with FAP. Science 272, 1187–1190 (1996).

    CAS  PubMed  ADS  Google Scholar 

  20. Merritt, A. J., Gould, K. A. & Dove, W. F. Polyclonal structure of intestinal adenomas in ApcMin/+ mice with concomitant loss of Apc+ from all tumor lineages. Proc. Natl Acad. Sci. USA 94, 13927–13931 (1997).

    CAS  PubMed  PubMed Central  ADS  Google Scholar 

  21. Halberg, R. B. & Dove, W. F. Polyclonal tumors in the mammalian intestine: are interactions among multiple initiated clones necessary for tumor initiation, growth, and progression? Cell Cycle 6, 44–51 (2007).

    CAS  PubMed  Google Scholar 

  22. Deng, S., Gong, H., Zhang, D., Zhang, M. & He, X. A statistical method for quantifying progenitor cells reveals incipient cell fate commitments. Nat. Methods 21, 597–608 (2024).

    CAS  PubMed  Google Scholar 

  23. Olafsson, S. et al. Somatic evolution in non-neoplastic IBD-affected colon. Cell 182, 672–684.e611 (2020).

    CAS  PubMed  PubMed Central  Google Scholar 

  24. Martincorena, I. et al. Universal patterns of selection in cancer and somatic tissues. Cell 171, 1029–1041.e1021 (2017).

    CAS  PubMed  PubMed Central  Google Scholar 

  25. Hu, Z. et al. Quantitative evidence for early metastatic seeding in colorectal cancer. Nat. Genet. 51, 1113–1122 (2019).

    CAS  PubMed  PubMed Central  Google Scholar 

  26. Potten, C. S. Stem cells in gastrointestinal epithelium: numbers, characteristics and death. Philos. Trans. R. Soc. B 353, 821–830 (1998).

    CAS  Google Scholar 

  27. van der Flier, L. G. & Clevers, H. Stem cells, self-renewal, and differentiation in the intestinal epithelium. Annu. Rev. Physiol. 71, 241–260 (2009).

    PubMed  Google Scholar 

  28. Hu, Z., Li, Z., Ma, Z. & Curtis, C. Multi-cancer analysis of clonality and the timing of systemic spread in paired primary tumors and metastases. Nat. Genet. 52, 701–708 (2020).

    CAS  PubMed  PubMed Central  Google Scholar 

  29. Huang, K. K. et al. Spatiotemporal genomic profiling of intestinal metaplasia reveals clonal dynamics of gastric cancer progression. Cancer Cell 41, 2019–2037.e8 (2023).

    CAS  PubMed  PubMed Central  Google Scholar 

  30. Gay, D. M. et al. Loss of BCL9/9l suppresses Wnt driven tumourigenesis in models that recapitulate human cancer. Nat. Commun. 10, 723 (2019).

    CAS  PubMed  PubMed Central  ADS  Google Scholar 

  31. Lee-Six, H. et al. The landscape of somatic mutation in normal colorectal epithelial cells. Nature 574, 532–537 (2019).

    CAS  PubMed  ADS  Google Scholar 

  32. Heide, T. et al. The co-evolution of the genome and epigenome in colorectal cancer. Nature 611, 733–743 (2022).

    CAS  PubMed  PubMed Central  ADS  Google Scholar 

  33. Vega, P. N. et al. Cancer-associated fibroblasts and squamous epithelial cells constitute a unique microenvironment in a mouse model of inflammation-induced colon cancer. Front. Oncol. 12, 878920 (2022).

    CAS  PubMed  PubMed Central  Google Scholar 

  34. Ma, R. Y., Black, A. & Qian, B. Z. Macrophage diversity in cancer revisited in the era of single-cell omics. Trends Immunol. 43, 546–563 (2022).

    CAS  PubMed  Google Scholar 

  35. Katzenelenbogen, Y. et al. Coupled scRNA-seq and intracellular protein activity reveal an immunosuppressive role of TREM2 in cancer. Cell 182, 872–885.e819 (2020).

    CAS  PubMed  Google Scholar 

  36. Jin, S. et al. Inference and analysis of cell–cell communication using CellChat. Nat. Commun. 12, 1088 (2021).

    CAS  PubMed  PubMed Central  ADS  Google Scholar 

  37. Teller, I. C. & Beaulieu, J. F. Interactions between laminin and epithelial cells in intestinal health and disease. Expert Rev. Mol. Med. https://doi.org/10.1017/S1462399401003623 (2001).

  38. Thliveris, A. T. et al. Polyclonality of familial murine adenomas: analyses of mouse chimeras with low tumor multiplicity suggest short-range interactions. Proc. Natl Acad. Sci. USA 102, 6960–6965 (2005).

    CAS  PubMed  PubMed Central  ADS  Google Scholar 

  39. Tabassum, D. P. & Polyak, K. Tumorigenesis: it takes a village. Nat. Rev. Cancer 15, 473–483 (2015).

    CAS  PubMed  Google Scholar 

  40. Lee, N. D., Kaveh, K. & Bozic, I. Clonal interactions in cancer: Integrating quantitative models with experimental and clinical data. Semin. Cancer Biol. 92, 61–73 (2023).

    PubMed  Google Scholar 

  41. Reeves, M. Q., Kandyba, E., Harris, S., Del Rosario, R. & Balmain, A. Multicolour lineage tracing reveals clonal dynamics of squamous carcinoma evolution from initiation to metastasis. Nat. Cell Biol. 20, 699–709 (2018).

    CAS  PubMed  PubMed Central  Google Scholar 

  42. Johnson, K. E. et al. Cancer cell population growth kinetics at low densities deviate from the exponential growth model and suggest an Allee effect. PLoS Biol. 17, e3000399 (2019).

    CAS  PubMed  PubMed Central  Google Scholar 

  43. Martinez, P. et al. Dynamic clonal equilibrium and predetermined cancer risk in Barrett’s oesophagus. Nat. Commun. 7, 12158 (2016).

    CAS  PubMed  PubMed Central  ADS  Google Scholar 

  44. Zhang, D., Zhang, A., He, X. & Deng, S. Variation in cancer risk between organs can not be explained by the degree of somatic clonal expansion. Adv. Biotechnol. 2, 18 (2024).

    Google Scholar 

  45. Li, H. New strategies to improve minimap2 alignment accuracy. Bioinformatics 37, 4572–4574 (2021).

    CAS  PubMed  PubMed Central  Google Scholar 

  46. Danecek, P. et al. Twelve years of SAMtools and BCFtools. Gigascience 10, giab008 (2021).

    PubMed  PubMed Central  Google Scholar 

  47. Edgar, R. C. Search and clustering orders of magnitude faster than BLAST. Bioinformatics 26, 2460–2461 (2010).

    CAS  PubMed  Google Scholar 

  48. Minh, B. Q. et al. IQ-TREE 2: new models and efficient methods for phylogenetic inference in the genomic era. Mol. Biol. Evol. 37, 1530–1534 (2020).

    CAS  PubMed  PubMed Central  Google Scholar 

  49. Gschwantler, M. et al. High-grade dysplasia and invasive carcinoma in colorectal adenomas: a multivariate analysis of the impact of adenoma and patient characteristics. Eur. J. Gastroenterol. Hepatol. 14, 183–188 (2002).

    PubMed  Google Scholar 

  50. Sawhney, M. S., Dickstein, J., LeClair, J., Lembo, C. & Yee, E. Adenomas with high-grade dysplasia and early adenocarcinoma are more likely to be sessile in the proximal colon. Colorectal Dis. 17, 682–688 (2015).

    CAS  PubMed  Google Scholar 

  51. Gibson, J. A. & Odze, R. D. Pathology of premalignant colorectal neoplasia. Dig. Endosc. 28, 312–323 (2016).

    PubMed  Google Scholar 

  52. Kuo, E., Wang, K. & Liu, X. A focused review on advances in risk stratification of malignant polyps. Gastroenterology Res. 13, 163–183 (2020).

    CAS  PubMed  PubMed Central  Google Scholar 

  53. Chen, S., Zhou, Y., Chen, Y. & Gu, J. fastp: an ultra-fast all-in-one FASTQ preprocessor. Bioinformatics 34, i884–i890 (2018).

    PubMed  PubMed Central  Google Scholar 

  54. Li, H. & Durbin, R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics 25, 1754–1760 (2009).

    CAS  PubMed  PubMed Central  Google Scholar 

  55. McKenna, A. et al. The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res. 20, 1297–1303 (2010).

    CAS  PubMed  PubMed Central  Google Scholar 

  56. DePristo, M. A. et al. A framework for variation discovery and genotyping using next-generation DNA sequencing data. Nat. Genet. 43, 491–498 (2011).

    CAS  PubMed  PubMed Central  Google Scholar 

  57. Van der Auwera, G. A. et al. From FastQ data to high confidence variant calls: the Genome Analysis Toolkit best practices pipeline. Curr. Protoc. Bioinformatics 43, 11 10 11–11 10 33 (2013).

    Google Scholar 

  58. Benjamin, D. et al. Calling somatic SNVs and indels with Mutect2. Preprint at bioRxiv https://doi.org/10.1101/861054v1 (2019).

  59. Wang, K., Li, M. & Hakonarson, H. ANNOVAR: functional annotation of genetic variants from high-throughput sequencing data. Nucleic Acids Res. 38, e164 (2010).

    PubMed  PubMed Central  Google Scholar 

  60. Ha, G. et al. TITAN: inference of copy number architectures in clonal cell populations from tumor whole-genome sequence data. Genome Res. 24, 1881–1893 (2014).

    CAS  PubMed  PubMed Central  Google Scholar 

  61. Tarabichi, M. et al. A practical guide to cancer subclonal reconstruction from DNA sequencing. Nat. Methods 18, 144–155 (2021).

    CAS  PubMed  PubMed Central  Google Scholar 

  62. Robinson, J. T. et al. Integrative genomics viewer. Nat. Biotechnol. 29, 24–26 (2011).

    CAS  PubMed  PubMed Central  Google Scholar 

  63. Lange, S. et al. Analysis pipelines for cancer genome sequencing in mice. Nat. Protoc. 15, 266–315 (2020).

    CAS  PubMed  Google Scholar 

  64. Ellis, P. et al. Reliable detection of somatic mutations in solid tissues by laser-capture microdissection and low-input DNA sequencing. Nat. Protoc. 16, 841–871 (2021).

    CAS  PubMed  Google Scholar 

  65. Kim, S. et al. Strelka2: fast and accurate calling of germline and somatic variants. Nat. Methods 15, 591–594 (2018).

    CAS  PubMed  Google Scholar 

  66. Saitou, N. & Nei, M. The neighbor-joining method: a new method for reconstructing phylogenetic trees. Mol. Biol. Evol. 4, 406–425 (1987).

    CAS  PubMed  Google Scholar 

  67. Cock, P. J. et al. Biopython: freely available Python tools for computational molecular biology and bioinformatics. Bioinformatics 25, 1422–1423 (2009).

    CAS  PubMed  PubMed Central  Google Scholar 

  68. Favero, F. et al. Sequenza: allele-specific copy number and mutation profiles from tumor sequencing data. Ann. Oncol. 26, 64–70 (2015).

    CAS  PubMed  Google Scholar 

  69. Lips, E. H. et al. Genomic analysis defines clonal relationships of ductal carcinoma in situ and recurrent invasive breast cancer. Nat. Genet. 54, 850–860 (2022).

    CAS  PubMed  PubMed Central  Google Scholar 

  70. Li, J. et al. Genomic and transcriptomic profiling of carcinogenesis in patients with familial adenomatous polyposis. Gut 69, 1283–1293 (2020).

    CAS  PubMed  Google Scholar 

  71. Bailey, M. H. et al. Comprehensive characterization of cancer driver genes and mutations. Cell 173, 371–385.e318 (2018).

    CAS  PubMed  PubMed Central  Google Scholar 

  72. Martin, F. J. et al. Ensembl 2023. Nucleic Acids Res. 51, D933–D941 (2023).

    CAS  PubMed  Google Scholar 

  73. Hao, Y. et al. Dictionary learning for integrative, multimodal and scalable single-cell analysis. Nat. Biotechnol. 42, 293–304 (2024).

    CAS  PubMed  Google Scholar 

  74. McGinnis, C. S., Murrow, L. M. & Gartner, Z. J. DoubletFinder: doublet detection in single-cell RNA sequencing data using artificial nearest neighbors. Cell Syst. 8, 329–337 e324 (2019).

    CAS  PubMed  PubMed Central  Google Scholar 

  75. Hafemeister, C. & Satija, R. Normalization and variance stabilization of single-cell RNA-seq data using regularized negative binomial regression. Genome Biol. 20, 296 (2019).

    CAS  PubMed  PubMed Central  Google Scholar 

  76. Choudhary, S. & Satija, R. Comparison and evaluation of statistical error models for scRNA-seq. Genome Biol. 23, 27 (2022).

    CAS  PubMed  PubMed Central  Google Scholar 

  77. Korsunsky, I. et al. Fast, sensitive and accurate integration of single-cell data with Harmony. Nat. Methods 16, 1289–1296 (2019).

    CAS  PubMed  PubMed Central  Google Scholar 

  78. Wu, T. et al. clusterProfiler 4.0: a universal enrichment tool for interpreting omics data. Innovation 2, 100141 (2021).

    CAS  PubMed  PubMed Central  Google Scholar 

  79. Hanzelmann, S., Castelo, R. & Guinney, J. GSVA: gene set variation analysis for microarray and RNA-seq data. BMC Bioinformatics 14, 7 (2013).

    PubMed  PubMed Central  Google Scholar 

  80. Dann, E., Henderson, N. C., Teichmann, S. A., Morgan, M. D. & Marioni, J. C. Differential abundance testing on single-cell data using k-nearest neighbor graphs. Nat. Biotechnol. 40, 245–253 (2022).

    CAS  PubMed  Google Scholar 

  81. Browaeys, R. et al. MultiNicheNet: a flexible framework for differential cell-cell communication analysis from multi-sample multi-condition single-cell transcriptomics data. Preprint at bioRxiv https://doi.org/10.1101/2023.06.13.544751v1 (2023).

  82. Lu. Z. et al. Systematic lineage tracing unveils polyclonal origin and evolution in colorectal precancer. Zenodo https://doi.org/10.5281/zenodo.11647317 (2024).

Download references

Acknowledgements

The authors thank C. Curtis, Q. Nie, H. Ji, W. Zhai and Hu laboratory members for constructive discussions, and B. Chen for the guidance of gland isolations. This work was supported by National Key R&D Program of China (2021YFA1302500 to Z. Hu and X.H.), National Natural Sciences Foundation of China (82241236 and 32270693 to Z. Hu, 32293190 and 32293191 to X.H., 82273346 to Z. He and 32100486 to Z.L.), Guangdong Basic and Applied Basic Research Foundation (2021B1515020042 to Z. Hu) and Shenzhen Science and Technology Program (RCBS20210706092346032 to Z.L.).

Author information

Authors and Affiliations

Authors

Contributions

Z. Hu, X.H. and Z.L. conceived and designed the study. Z.L., S.D., K.Z. and J.T. performed mouse experiments. Z.L. analysed the targeted long-read sequencing data. S.M. and H.Z. conducted single gland-related experiments. S.M. and D.X. analysed the WGS and WES data. X.Z. analysed the scRNA-seq data. Z. Hu and D.X. performed computational modelling and inferences. X.K. performed organoid cultures. J.W. provided guidance on data analysis. L.T.O.L. provided guidance on mouse experiments. Z. He, P.L. and J.Y. identified patients with synchronous polyps and CRC, performed colonoscopic polypectomy and collected clinical specimens. Y.Z. conducted pathological diagnosis and analysis. Z. Hu, Z.L., S.M., D.X. and X.Z. wrote the manuscript with contributions from all co-authors. Z. Hu, X.H. and Z. He supervised the project.

Corresponding authors

Correspondence to Zhen He, Xionglei He or Zheng Hu.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature thanks James DeGregori, Richard Halberg and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Extended data figures and tables

Extended Data Fig. 1 Establishment of AOM/DSS and ApcMin/+ mouse models carrying the SMALT lineage tracing system.

a, The 3 kb DNA barcode consists of 16 different tandem targets, each with an 18 bp iSceI binding site and a 156 bp editing region. The HsAID protein mainly induces C-to-T (or G-to-A) mutations in the editing region, which can be used to reconstruct single-cell phylogenies. b, The SMALT system was knocked into the H11 locus of C57BL/6 J mouse. The insert contains hsAID, iSceI and a 3 kb DNA barcode. c, Schematic of our mouse experiments following standard dose and course of drug treatment. d, Mouse absolute weight (top) and relative weight to initial timepoint (bottom) were recorded over time in days. Error bars presented as mean ± SEM. e, At the endpoint of experiment, normal colons, AOM/DSS colons or whole small intestines in ApcMin/+ mice were dissected.

Extended Data Fig. 2 Barcode mutation burden among sorted immune (CD45+) and epithelial (EpCAM+) cells.

a, FACS sorting of immune and epithelial neoplastic cells using the cell-surface markers, CD45 and EpCAM, respectively. Vertical and horizontal lines indicate the threshold for positive cells. b, The mutation burden for sorted immune cells and neoplastic cells, where a filter step for the tumor cells was applied using the cutoff of 75th percentile of total mutation counts among adjacent normal cells. P values, two-sided Wilcoxon rank-sum test. c, Representative single-cell phylogenies including real normal cells, plausible normal cells within neoplastic samples and post-filtering neoplastic cells. Red, post-filtering neoplastic cells; blue, normal colon cells from adjacent colon tissues; purple, plausible normal cells within neoplastic tissues (discarded in downstream analysis). Each phylogenetic tree are reconstructed using approximately 500 cells sampled from each cell subpopulation.

Extended Data Fig. 3 Phylogenetic trees of all 30 AOM/DSS neoplasms.

Each dot at the external node of phylogenetic trees represents an individual cell. Neoplastic cells are colored in red while normal colon cells from corresponding adjacent colons are colored in blue. Total number of cells from paired normal/neoplastic tissues and the number of founding progeniors are shown.

Extended Data Fig. 4 Polyclonal origins of ApcMin/+ mouse polyps.

a, Single-cell phylogenies including neoplastic cells from an individual polyp and normal intestinal cells from the same mouse. b, Image of the polyp Apc68_P5 which was dissected into five regions and sequenced separately. c, Single-cell phylogeny of Apc68_P5 showing strong clonal expansions in regions R1 and R5. d, Mutational cellular frequencies (MCF) on barcodes support that all polyps from the two mice are polyclonal, whereas certain regions of Apc68_P5 (R1 and R5) are under strong clonal expansions. e, Cells from clonally expanding regions of R1 (n = 924 cells) and R5 (n = 1,139 cells) have higher fitness scores as compared to regions R2 (n = 196 cells), R3 (n = 197 cells) and R4 (n = 454 cells). Box plots: bar, median; box, 25th to 75th interquartile range (IQR); vertical/horizontal line across box, data within 1.5 times the IQR.

Extended Data Fig. 5 The number and timing of founding progenitors in ApcMin/+ mouse polyps.

a, Schematic of the timing inference for the founding of ApcMin/+ polyclonal polyps. The average barcode mutation burden in normal cells can be expressed as m0 = µ0T, where T is the time from fertilized egg to time of polyp sampling. The average mutation burden for neoplastic cells within each monophyletic clade in the phylogenetic tree is m1. The ratio of the neoplastic cell mutation rate to normal cell mutation rate (denoted by r) is estimated from mutation accumulation data by in vitro organoid cultures. b-c, Organoids were generated from normal small intestine tissues and neoplastic polyps from a SMALT -carrying ApcMin/+ mouse. Organoids were cultured for 30 days with Dox (2 µg/ml)-containing medium. The SMALT barcode in organoid samples were sequenced by PacBio long-read sequencer at day 0, day 15 and day 30, respectively. d-e, Linear regression for the number of barcode mutations over the time of culture (days) for ApcMin/+ normal organoids (d) and neoplastic organoids (e), respectively. f, The estimated number of founding progenitors (Np) for each polyp from Apc68 (left) or Apc72 (right). Cells from each polyp were randomly downsampled 20 times for tree reconstruction and Np calculation (n = 20 downsamplings) (Methods). g, The estimated timing (in postnatal days) of progenitors founding each polyp in Apc68 (left) or Apc72 (right) (n = 20 downsamplings). Dot and error bars represent the median and interquartile range (IQR), respectively. Red and grey dashed lines denote the time of mouse birth and tissue sampling, respectively. Box plots: bar, median; box, 25th to 75th interquartile range (IQR); vertical/horizontal line across box, data within 1.5 times the IQR.

Extended Data Fig. 6 The landscape of somatic copy number alterations (SCNAs) in polyps and CRCs.

Each column in the figure represents a sample and each row defines a regional window in the chromosome. P_poly, polyclonal polyp; P_mono, monoclonal polyp; T_poly, polyclonal tumor; T_mono, monoclonal tumor. AMP, amplification; DEL, deletion. wGII, weighted genome instability index. CIN, chromosomal instability.

Extended Data Fig. 7 Whole-genome sequencing (WGS) of 32 single glands from normal and neoplastic tissues of a sporadic patient B139.

a, H&E images of the normal and five different neoplastic tissues. Scale bar: 200 um. b, Heatmap showing the occurrence of all SSNVs among individual glands. c, Heatmap showing the occurrence of putative driver mutations among individual glands. fs_Del, frameshift deletion; fs_Ins, frameshift insertion. SSNVs, somatic single nucleotide variants; SCNA, somatic copy number alterations; LOH, loss of heterozygosity. AMP, amplification.

Extended Data Fig. 8 Single-cell RNA-seq profiles of AOM/DSS-induced colon neoplasms.

a, Diagram of uniform manifold approximation and projection (UMAP) showing the integration of single-cell RNA-seq data from wild-type (WT) normal samples33 (n  =  2) and AOM/DSS neoplasms in this study (n  =  9). b, UMAP plot showing the single cells colored by individual samples. c, UMAP plot showing the single cells colored by annotated cell types. d, Proportion of cell types in each sample ordered by increase of lesion clonality (1/Np), where Np represents the number of founding progenitors. e, Spearman’s correlation between the proportion of macrophages, neutrophils, endothelial or epithelial cells and the lesion clonality (1/Np). The shaded region represents 95% CI of regression. Spearman’s ρ and P value are shown. d, Proportion of cell subtypes in each sample ordered by increase of lesion clonality (1/Np). g, Hallmark pathway analysis of differentially expressed genes between high clonality versus low clonality AOM/DSS neoplasms. Late and early lesions are defined as Np ≤ 3 and >3, respectively. NES, normalized enrichment score.

Extended Data Fig. 9 Circular network plots showing the intensity of 14 L/R interactions between epithelial cell types in early versus late AOM/DSS neoplasms.

The nodes represent major epithelial cell subtypes. The thickness of edges represents the average number of ligand/receptor interactions (LRIs) between two cell subtypes after 50 times of downsamplings.

Extended Data Fig. 10 Inter-cellular interactions and models of polyclonal orgins.

a-b, The number of LRIs among the 14 L/R pairs that are contributed by each of the 10 epithelial cell types. Each epithelial cell type can act as a sender or receiver. c, Pathways enriched and activated in Krt20+ neoplastic cells (Epi-Krt20) compared to other epithelial cell subclusters. Early and late represents the lesions with Np > 3 and Np \(\le \) 3, respectively. d, Two models of polyclonal origins in precancer. In the recruitment model, a neoplastic clone facilitates the growth of its neighboring normal cells. Subsequently, multiple lineages arise and contribute to a polyclonal neoplastic lesion. In the collision model, multiple neoplastic lineages, such as those independently acquiring a driver mutation, form a single lesion through random collisions in the same spatial proximity.

Supplementary information

Supplementary Information (download PDF )

Supplementary Figs. 1–28, Supplementary Notes 1–5 and descriptions for Supplementary Tables 1–17.

Reporting Summary (download PDF )

Supplementary Tables (download XLSX )

Supplementary Tables 1–17.

Peer Review File (download PDF )

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Lu, Z., Mo, S., Xie, D. et al. Polyclonal-to-monoclonal transition in colorectal precancerous evolution. Nature 636, 233–240 (2024). https://doi.org/10.1038/s41586-024-08133-1

Download citation

  • Received:

  • Accepted:

  • Published:

  • Version of record:

  • Issue date:

  • DOI: https://doi.org/10.1038/s41586-024-08133-1

This article is cited by

Search

Quick links

Nature Briefing: Cancer

Sign up for the Nature Briefing: Cancer newsletter — what matters in cancer research, free to your inbox weekly.

Get what matters in cancer research, free to your inbox weekly. Sign up for Nature Briefing: Cancer