Skip to main content

Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

  • Article
  • Published:

Mapping the zygote-to-adult developmental cell phylogeny in Arabidopsis thaliana reveals a three-cell rule of branching

Abstract

Similar to the evolutionary history of all species on Earth, the developmental lineage history of all cells in a multicellular organism is stored in a phylogenetic tree. Mapping the zygote-to-adult developmental cell phylogeny of a complex organism is of tremendous value but technically challenging. We here developed e-SMALT, a powerful cell lineage tracing system integrated with single-cell RNA sequencing, in Arabidopsis thaliana to record the zygote-to-adult developmental lineages of two plant individuals. The system performed efficiently in A. thaliana, with an average of ~50 barcoding mutations accumulated on the 1-kb barcode sequence of each cell in 3-month-old plants. Using the barcoding mutations, we reconstructed the phylogenetic tree for thousands of cells sampled from various shoot branches of each plant, with high statistical confidence and at single-cell resolution. The cell phylogenies show that cells of every shoot branch are derived from exactly three founder cells, each belonging to one of three early-determined lineages. The three-cell pattern holds for primary, secondary and tertiary branches, and even for single flowers/siliques. Incorporating single-cell RNA sequencing data revealed the three founder cells responsible for establishing the three germ layers of each branch/organ, which in turn updates our understanding of plant germ layers to single-cell resolution. We further showed that the three-cell rule reflects an adaptive strategy for an indeterminate plant to manage its stem cell pool, suggesting an analytical framework to unify the distinct strategies between plants and animals in organogenesis.

This is a preview of subscription content, access via your institution

Access options

Buy this article

USD 39.95

Prices may be subject to local taxes which are calculated during checkout

Fig. 1: Adapting the evolving barcoding system SMALT to Arabidopsis.
Fig. 2: Reconstruction of zygote-to-adult developmental cell phylogeny.
Fig. 3: The cell phylogeny reveals three founder cells for each branching process.
Fig. 4: Integrating SMALT with scRNA-seq.
Fig. 5: The three founder cells are responsible for establishing three germ layers.
Fig. 6: In-depth characterization of three germ layers at single-cell resolution.
Fig. 7: Reconciling the founder cell number during organogenesis.

Similar content being viewed by others

Data availability

The raw sequence data of lineage barcode amplicons and single-cell transcriptomics have been deposited in the Genome Sequence Archive in the National Genomics Data Center (https://ngdc.cncb.ac.cn/gsa) with accession number CRA016061.

Code availability

The software sources and core parameters are provided in Methods. The custom data analysis scripts are available via GitHub at https://github.com/XFannv/eSMALT.git.

References

  1. Frumkin, D., Wasserstrom, A., Kaplan, S., Feige, U. & Shapiro, E. Genomic variability within an organism exposes its cell lineage tree. PLoS Comput. Biol. 1, e50 (2005).

    Article  PubMed  PubMed Central  Google Scholar 

  2. Salipante, S. J. & Horwitz, M. S. Phylogenetic fate mapping. Proc. Natl Acad. Sci. USA 103, 5448–5453 (2006).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  3. Behjati, S. et al. Genome sequencing of normal cells reveals developmental lineages and mutational processes. Nature 513, 422–425 (2014).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  4. Park, S. et al. Clonal dynamics in early human embryogenesis inferred from somatic mutation. Nature 597, 393–397 (2021).

    Article  CAS  PubMed  Google Scholar 

  5. Spencer Chapman, M. et al. Lineage tracing of human development through somatic mutations. Nature 595, 85–90 (2021).

    Article  CAS  PubMed  Google Scholar 

  6. Mitchell, E. et al. Clonal dynamics of haematopoiesis across the human lifespan. Nature 606, 343–350 (2022).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  7. Weng, C. et al. Deciphering cell states and genealogies of human haematopoiesis. Nature 627, 389–398 (2024).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  8. Stadler, T., Pybus, O. G. & Stumpf, M. P. H. Phylodynamics for cell biologists. Science 371, eaah6266 (2021).

    Article  CAS  PubMed  Google Scholar 

  9. Domcke, S. & Shendure, J. A reference cell tree will serve science better than a reference cell atlas. Cell 186, 1103–1114 (2023).

    Article  CAS  PubMed  Google Scholar 

  10. Sulston, J. E., Schierenberg, E., White, J. G. & Thomson, J. N. The embryonic cell lineage of the nematode Caenorhabditis elegans. Dev. Biol. 100, 64–119 (1983).

    Article  CAS  PubMed  Google Scholar 

  11. Satina, S., Blakeslee, A. F. & Avery, A. G. Demonstration of the three germ layers in the shoot apex of Datura by means of induced polyploidy in periclinal chimeras. Am. J. Bot. 27, 895–905 (1940).

    Article  Google Scholar 

  12. Burk, L. G., Stewart, R. N. & Dermen, H. Histogenesis and genetics of a plastid-controlled chlorophyll variegation in tobacco. Am. J. Bot. 51, 713–724 (1964).

    Google Scholar 

  13. Stewart, R. N. & Burk, L. G. Independence of tissues derived from apical layers in ontogeny of the tobacco leaf and ovary. Am. J. Bot. 57, 1010–1016 (1970).

    Article  Google Scholar 

  14. Stewart, R. N., Semeniuk, P. & Dermen, H. Competition and accommodation between apical layers and their derivatives in the ontogeny of chimeral shoots of Pelargonium × hortorum. Am. J. Bot. 61, 54–67 (1974).

    Article  Google Scholar 

  15. Stewart, R. N. & Dermen, H. Ontogeny in monocotyledons as revealed by studies of the developmental anatomy of periclinal chloroplast chimeras. Am. J. Bot. 66, 47–58 (1979).

    Article  Google Scholar 

  16. Poethig, R. S. Clonal analysis of cell lineage patterns in plant development. Am. J. Bot. 74, 581–594 (1987).

    Article  Google Scholar 

  17. Furner, I. J., Ainscough, J. F. X., Pumfrey, J. A. & Petty, L. M. Clonal analysis of the late flowering fca mutant of Arabidopsis thaliana: cell fate and cell autonomy. Development 122, 1041–1050 (1996).

    Article  CAS  PubMed  Google Scholar 

  18. Furner, I. J. & Pumfrey, J. E. Cell fate in the shoot apical meristem of Arabidopsis thaliana. Development 115, 755–764 (1992).

    Article  Google Scholar 

  19. Irish, V. F. & Sussex, I. M. A fate map of the Arabidopsis embryonic shoot apical meristem. Development 115, 745–753 (1992).

    Article  Google Scholar 

  20. Stewart, R. N. & Dermen, H. Determination of number and mitotic activity of shoot apical initial cells by analysis of mericlinal chimeras. Am. J. Bot. 57, 816–826 (1970).

    Article  Google Scholar 

  21. Poethig, R. S. & Sussex, I. M. The cellular parameters of leaf development in tobacco: a clonal analysis. Planta 165, 170–184 (1985).

    Article  CAS  PubMed  Google Scholar 

  22. McDaniel, C. N. & Poethig, R. S. Cell-lineage patterns in the shoot apical meristem of the germinating maize embryo. Planta 175, 13–22 (1988).

    Article  CAS  PubMed  Google Scholar 

  23. Bossinger, G. & Smyth, D. R. Initiation patterns of flower and floral organ development in Arabidopsis thaliana. Development 122, 1093–1102 (1996).

    Article  CAS  PubMed  Google Scholar 

  24. Zhai, N. & Xu, L. CRE/LOX-based analysis of cell lineage during root formation and regeneration in Arabidopsis. aBIOTECH 1, 153–156 (2020).

    Article  PubMed  PubMed Central  Google Scholar 

  25. Saulsberry, A., Martin, P. R., O’Brien, T., Sieburth, L. E. & Pickett, F. B. The induced sector Arabidopsis apical embryonic fate map. Development 129, 3403–3410 (2002).

    Article  CAS  PubMed  Google Scholar 

  26. Chan, M. M. et al. Molecular recording of mammalian embryogenesis. Nature 570, 77–82 (2019).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  27. Bowling, S. et al. An engineered CRISPR–Cas9 mouse line for simultaneous readout of lineage histories and gene expression profiles in single cells. Cell 181, 1410–1422.e27 (2020).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  28. Quinn, J. J. et al. Single-cell lineages reveal the rates, routes, and drivers of metastasis in cancer xenografts. Science 371, eabc1944 (2021).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  29. Li, L. et al. A mouse model with high clonal barcode diversity for joint lineage, transcriptomic, and epigenomic profiling in single cells. Cell 186, 5183–5199.e22 (2023).

    Article  CAS  PubMed  Google Scholar 

  30. Baron, C. S. & van Oudenaarden, A. Unravelling cellular relationships during development and regeneration using genetic lineage tracing. Nat. Rev. Mol. Cell Biol. 20, 753–765 (2019).

    Article  CAS  PubMed  Google Scholar 

  31. Donà, M. et al. A versatile CRISPR-based system for lineage tracing in living plants. Plant J. 115, 1169–1184 (2023).

    Article  PubMed  Google Scholar 

  32. McKenna, A. et al. Whole-organism lineage tracing by combinatorial and cumulative genome editing. Science 353, aaf7907 (2016).

    Article  PubMed  PubMed Central  Google Scholar 

  33. Kalhor, R. et al. Developmental barcoding of whole mouse via homing CRISPR. Science 361, eaat9804 (2018).

    Article  PubMed  PubMed Central  Google Scholar 

  34. Salvador-Martínez, I., Grillo, M., Averof, M. & Telford, M. J. Is it possible to reconstruct an accurate cell lineage using CRISPR recorders?. eLife 8, e40292 (2019).

    Article  PubMed  PubMed Central  Google Scholar 

  35. Lu, X. et al. Development of an inducible DNA barcoding system to understand lineage changes in Arabidopsis regeneration. Dev. Cell 60, 305–319.e5 (2025).

    Article  CAS  PubMed  Google Scholar 

  36. Watson, J. M. et al. Germline replications and somatic mutation accumulation are independent of vegetative life span in Arabidopsis. Proc. Natl Acad. Sci. USA 113, 12226–12231 (2016).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  37. Liu, K. et al. Mapping single-cell-resolution cell phylogeny reveals cell population dynamics during organ development. Nat. Methods 18, 1506–1514 (2021).

    Article  CAS  PubMed  Google Scholar 

  38. Deng, S., Gong, H., Zhang, D., Zhang, M. & He, X. A statistical method for quantifying progenitor cells reveals incipient cell fate commitments. Nat. Methods 21, 597–608 (2024).

    Article  CAS  PubMed  Google Scholar 

  39. Lu, Z. et al. Polyclonal-to-monoclonal transition in colorectal precancerous evolution. Nature 636, 233–240 (2024).

    Article  CAS  PubMed  Google Scholar 

  40. Frank, M. H. & Chitwood, D. H. Plant chimeras: the good, the bad, and the ‘Bizzaria’. Dev. Biol. 419, 41–53 (2016).

    Article  CAS  PubMed  Google Scholar 

  41. Grandjean, O. et al. In vivo analysis of cell division, cell growth, and differentiation at the shoot apical meristem in Arabidopsis. Plant Cell 16, 74–87 (2004).

    Article  CAS  PubMed  Google Scholar 

  42. Chandler, J. W., Jacobs, B., Cole, M., Comelli, P. & Werr, W. DORNRÖSCHEN-LIKE expression marks Arabidopsis floral organ founder cells and precedes auxin response maxima. Plant Mol. Biol. 76, 171–185 (2011).

    Article  CAS  PubMed  Google Scholar 

  43. Kwiatkowska, D. Flower primordium formation at the Arabidopsis shoot apex: quantitative analysis of surface geometry and growth. J. Exp. Bot. 57, 571–580 (2006).

    Article  CAS  PubMed  Google Scholar 

  44. Long, J. & Barton, M. K. Initiation of axillary and floral meristems in Arabidopsis. Dev. Biol. 218, 341–353 (2000).

    Article  CAS  PubMed  Google Scholar 

  45. Jenik, P. D. & Irish, V. F. Regulation of cell proliferation patterns by homeotic genes during Arabidopsis floral development. Development 127, 1267–1276 (2000).

    Article  CAS  PubMed  Google Scholar 

  46. Deng, S. & He, X. Tree of life at two levels: from species to cell. Sci. Bull. 68, 2515–2518 (2023).

    Article  Google Scholar 

  47. Xiong, X. et al. Split complementation of base editors to minimize off-target edits. Nat. Plants 9, 1832–1847 (2023).

    Article  CAS  PubMed  Google Scholar 

  48. Fonfara, I., Curth, U., Pingoud, A. & Wende, W. Creating highly specific nucleases by fusion of active restriction endonucleases and catalytically inactive homing endonucleases. Nucleic Acids Res. 40, 847–860 (2012).

    Article  CAS  PubMed  Google Scholar 

  49. Mol, C. D. et al. Crystal structure of human uracil-DNA glycosylase in complex with a protein inhibitor: protein mimicry of DNA. Cell 82, 701–708 (1995).

    Article  CAS  PubMed  Google Scholar 

  50. Duan, C. et al. Reduced intrinsic DNA curvature leads to increased mutation rate. Genome Biol. 19, 132 (2018).

    Article  PubMed  PubMed Central  Google Scholar 

  51. Wang, Y. et al. Mesoscale DNA feature in antibody-coding sequence facilitates somatic hypermutation. Cell 186, 2193–2207.e19 (2023).

    Article  CAS  PubMed  Google Scholar 

  52. Niu, Q. et al. Efficient A·T to G·C base conversions in dicots using adenine base editors expressed under the tomato EF1α promoter. Plant Biotechnol. J. 21, 5–7 (2023).

    Article  PubMed  Google Scholar 

  53. Nguyen, L. T., Schmidt, H. A., Von Haeseler, A. & Minh, B. Q. IQ-TREE: a fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies. Mol. Biol. Evol. 32, 268–274 (2015).

    Article  CAS  PubMed  Google Scholar 

  54. Spanjaard, B. et al. Simultaneous lineage tracing and cell-type identification using CRISPR–Cas9-induced genetic scars. Nat. Biotechnol. 36, 469–473 (2018).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  55. Liu, Z., Zeng, H., Xiang, H., Deng, S. & He, X. Achieving single-cell-resolution lineage tracing in zebrafish by continuous barcoding mutations during embryogenesis. J. Genet. Genomics 51, 947–956 (2024).

    Article  CAS  PubMed  Google Scholar 

  56. Minh, B. Q., Nguyen, M. A. T. & von Haeseler, A. Ultrafast approximation for phylogenetic bootstrap. Mol. Biol. Evol. 30, 1188–1195 (2013).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  57. Dolan, L. & Poethig, R. S. The Okra leaf shape mutation in cotton is active in all cell layers of the leaf. Am. J. Bot. 85, 322–327 (1998).

    Article  CAS  PubMed  Google Scholar 

  58. Zeng, M. et al. Stem cell lineage in body layer specialization and vascular patterning of rice root and leaf. Sci. Bull. 61, 847–858 (2016).

    Article  CAS  Google Scholar 

  59. Marcotrigiano, M. & Bernatzky, R. Arrangement of cell layers in the shoot apical meristems of periclinal chimeras influences cell fate. Plant J. 7, 193–202 (1995).

    Article  Google Scholar 

  60. Burian, A. Does shoot apical meristem function as the germline in safeguarding against excess of mutations? Front. Plant Sci. 12, 707740 (2021).

    Article  PubMed  PubMed Central  Google Scholar 

  61. Chen, Y., Burian, A. & Johannes, F. Somatic epigenetic drift during shoot branching: a cell lineage-based model. Genetics 227, iyae091 (2024).

    Article  CAS  PubMed  Google Scholar 

  62. Iwasa, Y., Tomimoto, S. & Satake, A. The genetic structure within a single tree is determined by the behavior of the stem cells in the meristem. Genetics 223, iyad020 (2023).

    Article  CAS  PubMed  Google Scholar 

  63. Su, Y. H. et al. Integration of pluripotency pathways regulates stem cell maintenance in the Arabidopsis shoot meristem. Proc. Natl Acad. Sci. USA 117, 22561–22571 (2020).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  64. Cao, X. et al. A self-activation loop maintains meristematic cell fate for branching. Curr. Biol. 30, 1893–1904.e4 (2020).

    Article  CAS  PubMed  Google Scholar 

  65. Shi, B. et al. Two-step regulation of a meristematic cell population acting in shoot branching in Arabidopsis. PLoS Genet. 12, e1006168 (2016).

    Article  PubMed  PubMed Central  Google Scholar 

  66. Landrein, B. et al. Mechanical stress contributes to the expression of the STM homeobox gene in Arabidopsis shoot meristems. eLife 4, e07811 (2015).

    Article  PubMed  PubMed Central  Google Scholar 

  67. Xia, K. et al. The single-cell stereo-seq reveals region-specific cell subtypes and transcriptome profiling in Arabidopsis leaves. Dev. Cell 57, 1299–1310.e4 (2022).

    Article  CAS  PubMed  Google Scholar 

  68. Szymkowiak, E. J. & Irish, E. E. Interactions between jointless and wild-type tomato tissues during development of the pedicel abscission zone and the inflorescence meristem. Plant Cell 11, 159–175 (1999).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  69. Szymkowiak, E. J. & Sussex, I. M. The internal meristem layer (L3) determines floral meristem size and carpel number in tomato periclinal chimeras. Plant Cell 4, 1089–1100 (1992).

    CAS  PubMed  PubMed Central  Google Scholar 

  70. Helizon, H. et al. Class I KNOX transcription factors promote differentiation of cambial derivatives into xylem fibers in the Arabidopsis hypocotyl. Development 141, 4311–4319 (2014).

    Article  PubMed  Google Scholar 

  71. Stanger, B. Z., Tanaka, A. J. & Melton, D. A. Organ size is limited by the number of embryonic progenitor cells in the pancreas but not the liver. Nature 445, 886–891 (2007).

    Article  CAS  PubMed  Google Scholar 

  72. Pineda-Krch, M. & Fagerström, T. On the potential for evolutionary change in meristematic cell lineages through intraorganismal selection. J. Evol. Biol. 12, 681–688 (1999).

    Article  Google Scholar 

  73. Tomimoto, S. & Satake, A. Modelling somatic mutation accumulation and expansion in a long-lived tree with hierarchical modular architecture. J. Theor. Biol. 565, 111465 (2023).

    Article  CAS  PubMed  Google Scholar 

  74. Prusinkiewicz, P., Erasmus, Y., Lane, B., Harder, L. D. & Coen, E. Evolution and development of inflorescence architectures. Science 316, 1452–1456 (2007).

    Article  CAS  PubMed  Google Scholar 

  75. Qin, T. et al. A population of stem cells with strong regenerative potential discovered in deer antlers. Science 379, 840–847 (2023).

    Article  CAS  PubMed  Google Scholar 

  76. Li, J.-F. et al. Comprehensive protein-based artificial microRNA screens for effective gene silencing in plants. Plant Cell 25, 1507–1522 (2013).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  77. Wang, Z. P. et al. Egg cell-specific promoter-controlled CRISPR/Cas9 efficiently generates homozygous mutants for multiple target genes in Arabidopsis in a single generation. Genome Biol. 16, 144 (2015).

    Article  PubMed  PubMed Central  Google Scholar 

  78. Ingham, D. J., Beer, S., Money, S. & Hansen, G. Quantitative real-time PCR assay for determining transgene copy number in transformed plants. Biotechniques 31, 132–140 (2001).

    Article  CAS  PubMed  Google Scholar 

  79. Li, H. Minimap2: pairwise alignment for nucleotide sequences. Bioinformatics 34, 3094–3100 (2018).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  80. Hashimshony, T. et al. CEL-Seq2: sensitive highly-multiplexed single-cell RNA-seq. Genome Biol. 17, 77 (2016).

    Article  PubMed  PubMed Central  Google Scholar 

  81. Zheng, G. X. Y. et al. Massively parallel digital transcriptional profiling of single cells. Nat. Commun. 8, 14049 (2017).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  82. Hao, Y. et al. Integrated analysis of multimodal single-cell data. Cell 184, 3573–3587.e29 (2021).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  83. Zhang, T. Q., Chen, Y. & Wang, J. W. A single-cell analysis of the Arabidopsis vegetative shoot apex. Dev. Cell 56, 1056–1074 (2021).

    Article  CAS  PubMed  Google Scholar 

Download references

Acknowledgements

This study was supported by the National Natural Science Foundation of China (grant nos 32293191 to X.H., 32530019 to X.H., 32100485 to F.-N.X. and 32200494 to K.L.) and Guangdong Basic and Applied Basic Research Foundation (grant nos 2025A1515010629 to K.L. and 2022A1515010983 to F.-N.X.).

Author information

Authors and Affiliations

Authors

Contributions

F.-N.X., K.L., A.L. and Z.L. performed the experiments. F.-N.X., K.L., X.H. and J.W. analysed the data. X.H., J.-F.L. and Z.H. supervised the study. F.-N.X., K.L. and X.H. wrote the paper with input from all authors.

Corresponding author

Correspondence to Xionglei He.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature Plants thanks Qikun Liu, Ning Zhai and the other, anonymous, reviewer(s) for their contribution to the peer review of this work.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Extended data

Extended Data Fig. 1 Design of SMALT in Arabidopsis.

(a) Genotyping of ung msh2 double mutant. (b) Anatomical structure of the shoot apical meristem (SAM) in ung msh2 double mutant. 10 plants were analyzed with one representative image shown. Vibratome sections of the SAM were stained with Calcofluor White to visualize cell walls. Images were acquired using a ZEISS LSM880 confocal microscopy. The boxed region in the left panel is magnified in the middle panel, with cell layers color-coded in the right panel. Bar = 50 um. (c) mRNA expression level of the base editors driven by different promoters. RNA was extracted from young flowers. For each design 3 plants were analyzed. (d) Design of T-DNA containing base editor or lineage barcode. BlpR, phosphinothricin acetyltransferase; HygR, hygromycin B phosphotransferase; LB, left border of T-DNA; RB, right border of T-DNA. (e) Identification of T-DNA insertions of three SMALT lines through Nanopore sequencing. Blue bar indicates T-DNA containing lineage barcode, red bar indicates T-DNA of base editor, grey bar represents genome. Junction reads mapping to insertion sites were displayed below. Arrows indicate primers used for genotyping, with detailed sequence provided in Supplementary Table 8. (f) Sanger sequencing identifying two linked feature sites to decouple Barcode A and Barcode B of Plant-2. (g) An example demonstrating how to decouple barcode sequences of Plant-2 into Barcode A and Barcode B.

Extended Data Fig. 2 Time-course profiling of base editor activity and mutation accumulation.

(a) Expression of the base editor in different developmental stages and organs. The base editor and GFP are linked by 2A peptide and under the driven of SlRPS5a promoter. 12 plants were analyzed with representative images of each organ shown. Images were captured by ZEIZZ LSM880 confocal microscopy. Bar = 100 µm. (b) mRNA expression level of the base editor in different stages and organs. 12 independent SMALT lines were analyzed. House-keeping gene ACTIN serves as an internal control. (c) Time-course mutation analysis. 6 rosette leaves (RL1 to RL6) were collected at the second, forth, and tenth week as indicated. The inflorescence (INF) and siliques (SQ) were also collected as representatives of reproductive organs. White arrowheads indicate collected samples. Bar = 1 cm. 4 individuals (Plant-A, -B, -C, -D) were sampled. For each sample, all sequenced barcodes were counted. Violin plots show mutation number per barcode in each sample. Blue curves indicate LOESS-smoothed fitted values of mutation number in vegetative and reproductive organs, respectively, with shaded regions representing the 99% confidence interval of the fit.

Extended Data Fig. 3 Interpretation of phylogeny under different resolution.

(a) Impact of various mutation rates on reconstructed phylogeny. During the developmental process form a zygote to 64 cells, if mutations accumulated in every cell cycle, a fully-resolved phylogeny can be reconstructed with serial hierarchical bifurcations; if no edits in some cell generations, the reconstructed phylogeny would be partially-resolved with polytomies; variation of mutation accumulation rate per generation affects branch length of the reconstructed phylogeny. (b) Definition and calculation of bifurcation resolving rate (BRR). Upper panel showcases a fully resolved phylogenetic tree with 16 terminal tips and 15 internal nodes. Bottom panel shows a partial resolved tree with 16 terminal tips and 9 internal nodes. The red nodes indicate polytomies caused by lack of resolution. (c) Down sampling of the 1-kb barcode of Plant-1 to reconstruct phylogenies. Down-sampling trees are presented with the full-length tree face-to-face, with the same terminal nodes linked. Cells from each shoot branch were distinguished by color. (d) One-founder and multi-founder models under complete or compromised resolution. Assuming the stem cell pool (only one germ layer is considered here) contains 16 founder cells with known lineage relationships. In the one-founder model (upper-left panel), one cell per germ layer is drawn to initiate a shoot branch. With complete resolution (upper-middle panel), cells of each shoot branch form a single branch-specific clade. If barcoding mutations fail to occur in certain cell generations (red window of the upper-right panel), the phylogeny would comprise polytomies in which branch-specific clades disappear and cells of different shoot branches mix together. In the multi-founder model (bottom-left panel), multiple neighboring founder cells with varying lineage distances initiate a shoot branch (bottom-middle panel). With complete barcoding resolution, multiple branch-specific clades per shoot branch are expected. If barcoding mutations fail to occur in certain cell generations (red window of the bottom-right panel), again, polytomies would be observed and cells of different branches would mix.

Extended Data Fig. 4 Phylogeny analysis of Plant-2.

(a) Schematic drawing showing sample collection of Plant-2. (b) Reconstructed phylogenies of Plant-2. Barcodes from different samples are color-highlighted by shoot branch in separate panels. Barcode A and Barcode B are separately considered. Details of barcoding mutations were presented on the right. The numbers in grey circles show the bootstrap value of each branch-specific clade. (c) Phylogenic topology of the shoot branches in Plant-2. The bootstrap value of each internal node is labeled in grey circles.

Extended Data Fig. 5 The three-cell pattern holds for the secondary and tertiary branches and siliques.

(a) Sample collection and reconstructed phylogeny of the rosette branch 1 (R1) of Plant-1. A total of 888 cell barcodes from the secondary and tertiary branches are combined with the original 1,337 cell barcodes to build a new cell phylogeny. Cell barcodes from R1 and its all secondary and tertiary branches were highlighted in color respectively. Numbers in grey circle show the bootstrap value of each branch-specific clade. (b) Sample collection and reconstructed phylogeny of the cauline branch 2 (C2) of Plant-1. A total of 789 cell barcodes from the secondary and tertiary branches are combined with the original 1,337 cell barcodes to build a new cell phylogeny. Cell barcodes from C2 and its all secondary and tertiary branches were highlighted in color respectively. Numbers in grey circle show the bootstrap value of each branch-specific clade. (c) Cell phylogeny of Plant-1 consisting of inflorescence (INF) barcodes and silique (SQ) barcodes from the indicated six shoot branches. Each panel highlights samples form one shoot branch, with black tips and colored tips indicate INF barcodes and SQ barcodes, respectively. Numbers in grey circle show the bootstrap value of each shoot branch-specific clade.

Extended Data Fig. 6 Single cell library construction and cell atlas.

(a) Plant-1 RFP signal examination in planta and protoplasts. (b) Experimental design of four single cell libraries. Each library contains flowers or leaf from four SMALT lines. Each SMALT line contributes two unopen flowers or one rosette leaf to single library. (c) Quality control of four single cell libraries. Number of genes per cell and number of UMIs per cell were displayed in histograms. Shadows indicated the filter windows. (d) Comparison between different data integration methods. Gene-cell matrices of four single cell libraries generated from CellRanger were simply merged into one Seurat object and followed by data normalization and scaling under three different functions. (e) Sankey diagram showing the library/tissue origin of each cell. (f) Global differentiation trajectory based on pseudotime estimation by Monocle 3.

Extended Data Fig. 7 cDNA barcode amplification and recovery.

(a) Schematic diagram demonstrating principle of three amplification strategies. (b) Step-by-step statistics of cDNA barcode recovery. Data were separately counted in TT, TN, or NN amplicons. (c) Number of CCS reads assigned to an original barcode sequence. Each dot represents a UMI-consensus 1-kb barcode sequence. Data were separately counted in TT, TN, or NN amplicons. All quality barcode sequences were counted. (d) Number of UMI-consensus 1-kb barcode sequence assigned to a single cell. Each dot represents a single cell. Data were separately counted in TT, TN, or NN amplicons. All quality cells were counted. (e) Number of UMI-consensus 1-kb barcode sequence recovered per cell. All quality cells were counted. UMI generated from TT, TN, or NN amplicons were integrated by cell. Box plots in (c,d,e) show the median (centre line), the 25th and 75th percentiles (box bounds), and whiskers extending to the most extreme values within 1.5× the interquartile range. Notches indicate the approximate 95% confidence interval of the median. Outliers are not shown. (f) Correlation between barcode expression level and barcode recovery rate. Barcode expression is counted by the 1-kb barcode sequence detected in scRNA-seq. Barcode recovery rate indicates the number of full-length cDNA barcode recovered within a cell. (g) UMAP showing the number and percentage of cells with quality lineage barcode recovered in each library. (h) Number of cells with quality lineage barcode recovered in each plant and library. (i) Mutation spectrum of cDNA barcode. (j) Base editing window of cDNA barcode.

Extended Data Fig. 8 Cell type annotation of three lineages in Plant-2 phylogeny.

Left panel shows the reconstructed phylogenies consisting of both cDNA and DNA barcodes of Plant-2. Black tips indicate cDNA barcodes from single cell libraries. Grey tips represent DNA barcodes from 7 tissue samples in Extended Data Fig. 4. Cell cluster annotation of cDNA barcodes is displayed on the right panel.

Extended Data Fig. 9 Extensive lineage analysis of embryonic and post-embryonic tissues.

(a) A larger phylogeny of Plant-1. 1000 barcode reads were sampled from each shoot branch and subject to phylogeny reconstruction. For samples with less than 1000 barcodes, all available cell barcodes were included. Each panel highlights cells from a single shoot branch. (b and c) Phylogeny analysis of embryonic tissues of Plant-1 (B) and Plant-3 (C). Barcode reads were sampled from shoot branches (post-embryonic tissues), cotyledon, primary root, and lateral root, respectively, for phylogeny reconstruction. The Ln-I, Ln-II, and Ln-III of post-embryonic tissues are highlighted based on mutation profiles. (b) In Plant-1, barcodes from cotyledons and roots are broadly distributed across the entire phylogeny, whereas barcodes from post-embryonic tissues form three major clusters, corresponding to Ln-I, Ln-II, and Ln-III, respectively. Polytomy near the root of the phylogeny indicates insufficient barcoding efficiency during the very early embryonic lineages, preventing interpretation of the relationships among cotyledons, roots, and post-embryonic tissues. (c) In Plant-3, SMALT system was active only at early developmental stages. The reconstructed phylogeny forms three major clades (embryonic clones) with large polytomies, enabling a coarse assignment of embryonic lineage relationships. All barcodes from post-embryonic tissues fall exclusively into Clade-1. Cotyledon barcodes occur both within Clade-1—mixed with post-embryonic barcodes—and in a distinct cluster in Clade-3. Root barcodes occupy the remainder of Clade-3 as well as the entirety of Clade-2. Within Clade-1, the three post-embryonic lineages (Ln-I, Ln-II, Ln-III) can already be distinguished based on mutation signatures learned from single-cell and germline data, suggesting that the three lineages/germ layers of the shoot apical meristem should be established very early during embryogenesis.

Supplementary information

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Xia, FN., Liu, K., Wang, J. et al. Mapping the zygote-to-adult developmental cell phylogeny in Arabidopsis thaliana reveals a three-cell rule of branching. Nat. Plants (2026). https://doi.org/10.1038/s41477-026-02264-1

Download citation

  • Received:

  • Accepted:

  • Published:

  • Version of record:

  • DOI: https://doi.org/10.1038/s41477-026-02264-1

Search

Quick links

Nature Briefing: Translational Research

Sign up for the Nature Briefing: Translational Research newsletter — top stories in biotechnology, drug discovery and pharma.

Get what matters in translational research, free to your inbox weekly. Sign up for Nature Briefing: Translational Research