Skip to main content

Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

  • Protocol
  • Published:

Comparative genomics approach to infer ancestral cell karyotypes and reconstruct the evolutionary trajectories of plant chromosomes

Abstract

Chromosomes are key structural and functional components of heredity. Reconstruction of ancestral cell karyotypes (ACKs) and evolutionary trajectories of chromosomes (CETs) can reveal how structural and functional changes in chromosomes have occurred during evolution. The whole-genome duplication integrated analysis toolkit implements a telomere-centric model on the basis of the comparative analysis of gene collinearity within and between plant genomes to reconstruct the ACKs and CETs of many angiosperm plants whose genomes have been complicated by repeated polyploidization and subsequent repatterning. Here we summarize the steps for using whole-genome duplication integrated analysis to infer the gene collinearity within a genome or between genomes and to infer the repeated polyploidization specific to a plant or common to multiple plants or plant families. In addition, we provide an example with three grass genomes. We also describe procedures to infer ancestral chromosomes at key evolutionary nodes, to reconstruct CETs from the deep past to extant plants and to generate event-related hierarchical alignment of multiple genomes, which is realized by deciphering collinear genes produced by different polyploidization or speciation events. The Protocol guides users to infer ACKs and CETs in a plant taxon and between selected plants from different taxa, which is crucial to understand important sources of genetic innovation including chromosome evolution, genome complexity and origination and evolution of duplicated genes. This Protocol requires minimal bioinformatics knowledge, for example, retrieving data from public databases and running Python programs. Completing the protocol with the example data takes around 8 h.

Key points

  • Whole-genome duplication integrated analysis implements a telomere-centric model of chromosome evolution, whereby telomeres are assumed to have a central role in chromosome structural rearrangements during plant evolution. The tool infers gene collinearity within and between plant genomes and uses it to obtain ancestral cell karyotypes and reconstruct evolutionary trajectories of chromosomes.

  • Unlike related tools that rely on gene synteny, whole-genome duplication integrated analysis infers collinear genes that are related to specific polyploidization and speciation events.

This is a preview of subscription content, access via your institution

Access options

Buy this article

Prices may be subject to local taxes which are calculated during checkout

Fig. 1: Chromosome rearrangements and corresponding dot plot patterns.
Fig. 2: Steps to infer ACKs and CETs of plants.
Fig. 3: Homologous gene dot plots and inferred gene collinearity.
Fig. 4: Homologous correspondence between chromosomes subjected to DNA rearrangement.
Fig. 5: Reconstructed ACKs and CETs of grasses.
Fig. 6: Hierarchical alignment of chromosomes from rice, sorghum and maize based on gene collinearity.

Similar content being viewed by others

Data availability

Genome sequences and annotations used in this protocol can be retrieved from the NCBI genome (https://www.ncbi.nlm.nih.gov/datasets/genome). The data for this protocol are wrapped as a zip file named ‘WGDI_protocol.zip’, which is available at http://47.92.161.27:8000/biaoge/backend/download.html. Source data are provided with this paper.

Code availability

WGDI is available via GitHub at https://github.com/SunPengChuan/wgdi-example. Programs used in this protocol and example datasets are included within the ‘WGDI_protocol.zip’ file, which is available at http://47.92.161.27:8000/biaoge/backend/download.html.

References

  1. Jaillon, O. et al. The grapevine genome sequence suggests ancestral hexaploidization in major angiosperm phyla. Nature 449, 463–467 (2007).

    Article  CAS  PubMed  Google Scholar 

  2. Soltis, D. E., Soltis, P. S. & Tate, J. A. Advances in the study of polyploidy since Plant speciation. N. Phytol. 161, 173–191 (2004).

    Article  CAS  Google Scholar 

  3. Husband, B. C., Baldwin, S. J. & Suda, J. in Plant Genome Diversity Volume 2: Physical Structure, Behaviour and Evolution of Plant Genomes (eds Greilhuber, J. et al.) 255–276 (Springer, 2013).

  4. Wang, J. et al. Recursive paleohexaploidization shaped the durian genome. Plant Physiol. 179, 209–219 (2019).

    Article  CAS  PubMed  Google Scholar 

  5. Chalhoub, B. et al. Plant genetics. Early allopolyploid evolution in the post-Neolithic Brassica napus oilseed genome. Science 345, 950–953 (2014).

    Article  CAS  PubMed  Google Scholar 

  6. Sato, S. et al. The tomato genome sequence provides insights into fleshy fruit evolution. Nature 485, 635–641 (2012).

    Article  CAS  Google Scholar 

  7. Wang, X. et al. The genome of the mesopolyploid crop species Brassica rapa. Nat. Genet. 43, 1035–1039 (2011).

    Article  CAS  PubMed  Google Scholar 

  8. Liu, S. et al. The Brassica oleracea genome reveals the asymmetrical evolution of polyploid genomes. Nat. Commun. 5, 3930 (2014).

    Article  CAS  PubMed  Google Scholar 

  9. Shi, T. et al. The slow-evolving Acorus tatarinowii genome sheds light on ancestral monocot evolution. Nat. Plants 8, 764–777 (2022).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  10. Pont, C. et al. Paleogenomics: reconstruction of plant evolutionary trajectories from modern and ancient DNA. Genome Biol. 20, 29 (2019).

    Article  PubMed  PubMed Central  Google Scholar 

  11. Salse, J. et al. Reconstruction of monocotelydoneous proto-chromosomes reveals faster evolution in plants than in animals. Proc. Natl Acad. Sci. USA 106, 14908–14913 (2009).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  12. Murat, F. et al. Shared subgenome dominance following polyploidization explains grass genome evolutionary plasticity from a seven protochromosome ancestor with 16K protogenes. Genome Biol. Evol. 6, 12–33 (2014).

    Article  PubMed  Google Scholar 

  13. Sun, Y., Shang, L., Zhu, Q. H., Fan, L. & Guo, L. Twenty years of plant genome sequencing: achievements and challenges. Trends Plant Sci. 27, 391–401 (2022).

    Article  CAS  PubMed  Google Scholar 

  14. Wang, X. et al. Telomere-centric genome repatterning determines recurring chromosome number reductions during the evolution of eukaryotes. N. Phytol. 205, 378–389 (2015).

    Article  CAS  Google Scholar 

  15. Zhuang, W. et al. Reply to: evaluating two different models of peanut’s origin. Nat. Genet. 52, 560–563 (2020).

    Article  CAS  PubMed  Google Scholar 

  16. Wang, S. et al. High-quality reference genome sequences of two coconut cultivars provide insights into evolution of monocot chromosomes and differentiation of fiber content and plant height. Genome Biol. 22, 304 (2021).

    Article  PubMed  PubMed Central  Google Scholar 

  17. Zhang, Z. et al. An updated explanation of ancestral karyotype changes and reconstruction of evolutionary trajectories to form Camelina sativa chromosomes. BMC Genom. 21, 705 (2020).

    Article  CAS  Google Scholar 

  18. Song, X. et al. The celery genome sequence reveals sequential paleo-polyploidizations, karyotype evolution and resistance gene reduction in apiales. Plant Biotechnol. J. 19, 731–744 (2021).

    Article  CAS  PubMed  Google Scholar 

  19. Xie, D. et al. The wax gourd genomes offer insights into the genetic diversity and ancestral cucurbit karyotype. Nat. Commun. 10, 5158 (2019).

    Article  PubMed  PubMed Central  Google Scholar 

  20. Wang, X. & Wang, Z. Evolutionary genomics model of chromosome number reduction and B chromosome production. SCIENTIA SINICA Vitae 50, 524–537 (2020).

    Article  Google Scholar 

  21. Wang, J. et al. Comparative genomics analysis of rice and pineapple contributes to understand the chromosome number reduction and genomic changes in grasses. Front Genet. 7, 174 (2016).

    Article  PubMed  PubMed Central  Google Scholar 

  22. Wang, J. et al. Deciphering recursive polyploidization in Lamiales and reconstructing their chromosome evolutionary trajectories. Plant Physiol. 195, 2143–2157 (2024).

    Article  CAS  PubMed  Google Scholar 

  23. Sun, P. et al. WGDI: a user-friendly toolkit for evolutionary analyses of whole-genome duplications and ancestral karyotypes. Mol. Plant 15, 1841–1851 (2022).

    Article  CAS  PubMed  Google Scholar 

  24. Murat, F. et al. Karyotype and gene order evolution from reconstructed extinct ancestors highlight contrasts in genome plasticity of modern rosid crops. Genome Biol. Evol. 7, 735–749 (2015).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  25. Cai, J. J. & Osorio, D. in Computational Advances in Bio and Medical Sciences Lecture Notes in Computer Science (eds Bansal, M. S. et al.) Ch. 8, 88–94 (Springer, 2021).

  26. Gao, S. et al. IAGS: inferring ancestor genome structure under a wide range of evolutionary scenarios. Mol. Biol. Evol. 39, msac041 (2022).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  27. Mascher, M. et al. A chromosome conformation capture ordered sequence of the barley genome. Nature 544, 427–433 (2017).

    Article  CAS  PubMed  Google Scholar 

  28. Costanzo, K. B. V. Centromere structure and function. Prog. Mol. Subcell Biol. 56, 515–539 (2017).

    Article  PubMed  PubMed Central  Google Scholar 

  29. Wang, Y. et al. Detection of colinear blocks and synteny and evolutionary analyses based on utilization of MCScanX. Nat. Protoc. 19, 2206–2229 (2024).

    Article  CAS  PubMed  Google Scholar 

  30. Wang, Y. et al. MCScanX: a toolkit for detection and evolutionary analysis of gene synteny and collinearity. Nucleic Acids Res. 40, e49 (2012).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  31. Wang, X. et al. Statistical inference of chromosomal homology based on gene colinearity and applications to Arabidopsis and rice. BMC Bioinform. 7, 447 (2006).

    Article  Google Scholar 

  32. Tang, H. et al. tanghaibao/jcvi: JCVI v0.7.5 (v0.7.5). Zenodo https://doi.org/10.5281/zenodo.846919 (2017).

  33. Zhuang, W. et al. The genome of cultivated peanut provides insight into legume karyotypes, polyploid evolution and crop domestication. Nat. Genet. 51, 865–876 (2019).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  34. Wang, J. et al. An overlooked paleotetraploidization in Cucurbitaceae. Mol. Biol. Evol. 35, 16–26 (2018).

    Article  CAS  PubMed  Google Scholar 

  35. Wang, X. et al. Genome alignment spanning major poaceae lineages reveals heterogeneous evolutionary rates and alters inferred dates for key evolutionary events. Mol. Plant 8, 885–898 (2015).

    Article  CAS  PubMed  Google Scholar 

  36. Song, X. et al. Deciphering the high-quality genome sequence of coriander that causes controversial feelings. Plant Biotechnol. J. 18, 1444–1456 (2020).

    Article  CAS  PubMed  Google Scholar 

  37. Wang, J. et al. Paleo-polyploidization in Lycophytes. Genom. Proteom. Bioinform. 18, 333–340 (2020).

    Article  Google Scholar 

  38. Huang, H. -R. et al. Telomere-to-telomere haplotype-resolved reference genome reveals subgenome divergence and disease resistance in triploid Cavendish banana. Hortic. Res. 10, uhad153 (2023).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  39. Jiang, B. et al. Characterization and heterologous reconstitution of Taxus biosynthetic enzymes leading to baccatin III. Science 383, 622–629 (2024).

    Article  CAS  PubMed  Google Scholar 

Download references

Acknowledgements

This work was supported by research grants from the National Natural Science Foundation of China to X.W. (grant no. 32070669) and to Z.W. (grant no. 32000405), the Talented Scientist program from the Tangshan Municipal Bureau Human Resources and Social Security to X.W. (grant no. 16013601) and the Tangshan Science and Technology Program Project (grant no. 21130228) to Z.W.

Author information

Authors and Affiliations

Authors

Contributions

X.W. conceived and led the research and prepared the manuscript. B.S., W.L., Y.J., L.Y., T.W., M.Y., J.W., F.H., H.Q., H.Z., Z.W. and Y.J. analyzed the data and prepared the figures; B.S. and X.W. provided the description of WGDI; F.H. and J.W. analyzed the data related to partial monocots and eudicots, respectively, and W.L., Y.J., T.W. and Y.L. improved the programs inferring ACKs and CETs to ensure user-friendly output.

Corresponding author

Correspondence to Xiyin Wang.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature Protocols thanks Kai Ye and the other, anonymous, reviewer(s) for their contribution to the peer review of this work.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Key references

Wang, J. et al. Mol. Biol. Evol. 35, 16–26 (2018): https://doi.org/10.1093/molbev/msx242

Sun, P. et al. Mol. Plant 15, 1841–1851 (2022): https://doi.org/10.1016/j.molp.2022.10.018

Wang, S. et al. Genome Biol. 22, 304 (2021): https://doi.org/10.1186/s13059-021-02522-9

Supplementary information

Supplementary Information

Supplementary Figs. 1–4 and text.

Supplementary Data 1

Original data sets for readers to repeat the analysis described in the manuscript and the Supplementary text.

Supplementary Data 2

A demo for the protocol to run WGDI to infer ancestral chromosomes and evolutionary trajectories to form the chromosomes in extant plants.

Supplementary Data 3

Source data for Supplementary Figs. 1–3.

Source data

Source data of Figs. 3c–g and 6

Statistical source data.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Wang, X., Song, B., Liu, W. et al. Comparative genomics approach to infer ancestral cell karyotypes and reconstruct the evolutionary trajectories of plant chromosomes. Nat Protoc (2025). https://doi.org/10.1038/s41596-025-01173-5

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1038/s41596-025-01173-5

Search

Quick links

Nature Briefing AI and Robotics

Sign up for the Nature Briefing: AI and Robotics newsletter — what matters in AI and robotics research, free to your inbox weekly.

Get the most important science stories of the day, free in your inbox. Sign up for Nature Briefing: AI and Robotics