Abstract
The centromere paradox, in which functionally conserved centromeres exhibit rapid evolution, has long intrigued geneticists and evolutionary biologists. Despite its importance, the centromeric landscape remains poorly understood due to the lack of complete assemblies. Here we dissect the dynamic evolution of Brassica centromeres by generating telomere-to-telomere genome assemblies from seven morphotypes of B. rapa (AA) and the two tetraploids B. juncea (AABB) and B. napus (AACC). Pan-centromere analysis reveals that Brassica centromeres are extensively invaded by retrotransposons and show remarkable diversity in size and structure. While A- and C-genome centromeres feature distinct patterns of satellites, B-genome centromeres are devoid of satellites. The centromeric satellite expansion in the C-genome is reminiscent of the layered expansions observed in human centromeres. Accordingly, we propose a working model of centromere evolution reconstructing the key evolutionary events leading to current Brassica centromere structures. These insights will illuminate plant centromere evolution and guide the design of crop synthetic chromosomes.
This is a preview of subscription content, access via your institution
Access options
Access Nature and 54 other Nature Portfolio journals
Get Nature+, our best-value online-access subscription
$32.99 / 30 days
cancel any time
Subscribe to this journal
Receive 12 digital issues and online access to articles
$119.00 per year
only $9.92 per issue
Buy this article
- Purchase on SpringerLink
- Instant access to the full article PDF.
USD 39.95
Prices may be subject to local taxes which are calculated during checkout






Similar content being viewed by others
Data availability
The raw sequencing data and the genome assembly have been deposited at the National Center for Biotechnology Information under project number PRJNA1116321.
References
Naish, M. & Henderson, I. R. The structure, function, and evolution of plant centromeres. Genome Res. 34, 161–178 (2024).
Talbert, P. B. & Henikoff, S. The genetics and epigenetics of satellite centromeres. Genome Res. 32, 608–615 (2022).
Malik, H. S. & Henikoff, S. Major evolutionary transitions in centromere complexity. Cell 138, 1067–1082 (2009).
Presting, G. G. Centromeric retrotransposons and centromere function. Curr. Opin. Genet. Dev. 49, 79–84 (2018).
Wlodzimierz, P. et al. Cycles of satellite and transposon evolution in Arabidopsis centromeres. Nature 618, 557–565 (2023).
Nagaharu, U. Genome analysis in Brassica with special reference to the experimental formation of B. napus and peculiar mode of fertilization. Jpn. J. Bot. 7, 389–452 (1935).
Cai, X. et al. Impacts of allopolyploidization and structural variation on intraspecific diversification in Brassica rapa. Genome Biol. 22, 166 (2021).
Wang, X. et al. The genome of the mesopolyploid crop species Brassica rapa. Nat. Genet. 43, 1035–1039 (2011).
Zhang, L. et al. A near-complete genome assembly of Brassica rapa provides new insights into the evolution of centromeres. Plant Biotechnol. J. 21, 1022–1032 (2023).
Zhou, Y. et al. The complexity of structural variations in Brassica rapa revealed by assembly of two complete T2T genomes. Sci. Bull. 69, 2346–2351 (2024).
Lim, K. B. et al. Characterization of the centromere and peri-centromere retrotransposons in Brassica rapa and their distribution in related Brassica species. Plant J. 49, 173–183 (2007).
Lim, K. B. et al. Characterization of rDNAs and tandem repeats in the heterochromatin of Brassica rapa. Mol. Cells 19, 436–444 (2005).
Wang, G. X. et al. ChIP-cloning analysis uncovers centromere-specific retrotransposons in Brassica nigra and reveals their rapid diversification in Brassica allotetraploids. Chromosoma 128, 119–131 (2019).
Zhao, J. et al. Centromere repositioning and shifts in wheat evolution. Plant Commun. 4, 100556 (2023).
Chang, X. et al. High-quality Gossypium hirsutum and Gossypium barbadense genome assemblies reveal the landscape and evolution of centromeres. Plant Commun. 5, 100722 (2023).
Cheng, H., Concepcion, G. T., Feng, X., Zhang, H. & Li, H. Haplotype-resolved de novo assembly using phased assembly graphs with hifiasm. Nat. Methods 18, 170–175 (2021).
Naish, M. et al. The genetic and epigenetic landscape of the Arabidopsis centromeres. Science 374, eabi7489 (2021).
Altemose, N. et al. Complete genomic and epigenetic maps of human centromeres. Science 376, eabl4178 (2022).
Xie, L. et al. Genetic diversity and evolution of rice centromeres. Preprint at bioRxiv https://doi.org/10.1101/2024.07.28.605524 (2024).
Vollger, M. R., Kerpedjiev, P., Phillippy, A. M. & Eichler, E. E. StainedGlass: interactive visualization of massive tandem repeat structures with identity heatmaps. Bioinformatics 38, 2049–2051 (2022).
Yang, T. et al. Sinapis genomes provide insights into whole-genome triplication and divergence patterns within tribe Brassiceae. Plant. J. 113, 246–261 (2023).
Xu, L. et al. A chromosome-level genome assembly of radish (Raphanus sativus L.) reveals insights into genome adaptation and differential bolting regulation. Plant Biotechnol. J. 21, 990–1004 (2023).
Koch, M. A. & Lemmel, C. Zahora, a new monotypic genus from tribe Brassiceae (Brassicaceae) endemic to the Moroccan Sahara. PhytoKeys 135, 119–135 (2019).
Liu, S. et al. The Brassica oleracea genome reveals the asymmetrical evolution of polyploid genomes. Nat. Commun. 5, 3930 (2014).
Perumal, S. et al. A high-contiguity Brassica nigra genome localizes active centromeres and defines the ancestral Brassica genome. Nat. Plants 6, 929–941 (2020).
Wang, G. et al. Characterization of CENH3 proteins and centromere-associated DNA sequences in diploid and allotetraploid Brassica species. Chromosoma 120, 353–365 (2011).
Guo, N. et al. A graph-based pan-genome of Brassica oleracea provides new insights into its domestication and morphotype diversification. Plant Commun. 5, 100791 (2024).
Tsukahara, S. et al. Centrophilic retrotransposon integration via CENH3 chromatin in Arabidopsis. Nature 637, 744–748 (2025).
Town, C. D. et al. Comparative genomics of Brassica oleracea and Arabidopsis thaliana reveal gene loss, fragmentation, and dispersal after polyploidy. Plant Cell 18, 1348–1359 (2006).
Hoang, N. V. et al. Expanding the triangle of U: comparative analysis of the Hirschfeldia incana genome provides insights into chromosomal evolution, phylogenomics and high photosynthesis-related traits. Ann. Bot. 135, 1243–1247 (2025).
Chen, J. et al. A complete telomere-to-telomere assembly of the maize genome. Nat. Genet. 55, 1221–1231 (2023).
Li, B. et al. Wheat centromeric retrotransposons: the new ones take a major role in centromeric structure. Plant J. 73, 952–965 (2013).
Gao, X., Hou, Y., Ebina, H., Levin, H. L. & Voytas, D. F. Chromodomains direct integration of retrotransposons to heterochromatin. Genome Res. 18, 359–369 (2008).
Shen, F., Xu, S., Shen, Q., Bi, C. & Lysak, M. A. The allotetraploid horseradish genome provides insights into subgenome diversification and formation of critical traits. Nat. Commun. 14, 4102 (2023).
Chen, C. et al. Three near-complete genome assemblies reveal substantial centromere dynamics from diploid to tetraploid in Brachypodium genus. Genome Biol. 25, 63 (2024).
Gao, S. et al. The centromere landscapes of four karyotypically diverse Papaver species provide insights into chromosome evolution and speciation. Cell Genom. 4, 100626 (2024).
Su, H. et al. Centromere satellite repeats have undergone rapid changes in polyploid wheat subgenomes. Plant Cell 31, 2035–2051 (2019).
Li, B. et al. A gap-free reference genome reveals structural variations associated with flowering time in rapeseed (Brassica napus). Hortic. Res. 10, uhad171 (2023).
Belton, J. M. et al. Hi-C: a comprehensive technique to capture the conformation of genomes. Methods 58, 268–276 (2012).
Hu, J. et al. NextDenovo: an efficient error correction and accurate assembly tool for noisy long reads. Genome Biol. 25, 107 (2024).
Li, H. Minimap2: pairwise alignment for nucleotide sequences. Bioinformatics 34, 3094–3100 (2018).
Cabanettes, F. & Klopp, C. D-GENIES: dot plot large genomes in an interactive, efficient and simple way. PeerJ 6, e4958 (2018).
Hu, J., Fan, J., Sun, Z. & Liu, S. NextPolish: a fast and efficient genome polishing tool for long-read assembly. Bioinformatics 36, 2253–2255 (2020).
Li, H. & Durbin, R. Fast and accurate short read alignment with Burrows–Wheeler transform. Bioinformatics 25, 1754–1760 (2009).
Durand, N. C. et al. Juicer provides a one-click system for analyzing loop-resolution Hi-C experiments. Cell Syst. 3, 95–98 (2016).
Dudchenko, O. et al. De novo assembly of the Aedes aegypti genome using Hi-C yields chromosome-length scaffolds. Science 356, 92–95 (2017).
Durand, N. C. et al. Juicebox provides a visualization system for Hi-C contact maps with unlimited zoom. Cell Syst. 3, 99–101 (2016).
Goel, M., Sun, H., Jiao, W. B. & Schneeberger, K. SyRI: finding genomic rearrangements and local sequence differences from whole-genome assemblies. Genome Biol. 20, 277 (2019).
Manni, M., Berkeley, M. R., Seppey, M., Simao, F. A. & Zdobnov, E. M. BUSCO update: novel and streamlined workflows along with broader and deeper phylogenetic coverage for scoring of eukaryotic, prokaryotic, and viral genomes. Mol. Biol. Evol. 38, 4647–4654 (2021).
Rhie, A., Walenz, B. P., Koren, S. & Phillippy, A. M. Merqury: reference-free quality, completeness, and phasing assessment for genome assemblies. Genome Biol. 21, 245 (2020).
Tarailo-Graovac, M. & Chen, N. Using RepeatMasker to identify repetitive elements in genomic sequences. Curr. Protoc. Bioinform. 4, 4–10 (2009).
Xu, Z. & Wang, H. LTR_FINDER: an efficient tool for the prediction of full-length LTR retrotransposons. Nucleic Acids Res. 35, W265–W268 (2007).
Ellinghaus, D., Kurtz, S. & Willhoeft, U. LTRharvest, an efficient and flexible software for de novo detection of LTR retrotransposons. BMC Bioinform. 9, 18 (2008).
Ou, S. & Jiang, N. LTR_retriever: a highly accurate and sensitive program for identification of long terminal repeat retrotransposons. Plant Physiol. 176, 1410–1422 (2018).
Zhang, R. G. et al. TEsorter: an accurate and fast method to classify LTR-retrotransposons in plant genomes. Hortic. Res. 9, uhac017 (2022).
Cantarel, B. L. et al. MAKER: an easy-to-use annotation pipeline designed for emerging model organism genomes. Genome Res. 18, 188–196 (2008).
Li, W. & Godzik, A. Cd-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences. Bioinformatics 22, 1658–1659 (2006).
Kim, D., Langmead, B. & Salzberg, S. L. HISAT: a fast spliced aligner with low memory requirements. Nat. Methods 12, 357–360 (2015).
Pertea, M. et al. StringTie enables improved reconstruction of a transcriptome from RNA-seq reads. Nat. Biotechnol. 33, 290–295 (2015).
Lomsadze, A., Ter-Hovhannisyan, V., Chernoff, Y. O. & Borodovsky, M. Gene identification in novel eukaryotic genomes by self-training algorithm. Nucleic Acids Res. 33, 6494–6506 (2005).
Stanke, M., Tzvetkova, A. & Morgenstern, B. AUGUSTUS at EGASP: using EST, protein and genomic alignments for improved gene prediction in the human genome. Genome Biol. 7, S11 (2006).
Nevers, Y. et al. Quality assessment of gene repertoire annotations with OMArk. Nat. Biotechnol. 43, 124–133 (2025).
Chen, S., Zhou, Y., Chen, Y. & Gu, J. fastp: an ultra-fast all-in-one FASTQ preprocessor. Bioinformatics 34, i884–i890 (2018).
Langmead, B. & Salzberg, S. L. Fast gapped-read alignment with Bowtie2. Nat. Methods 9, 357–359 (2012).
Li, H. et al. The sequence alignment/map format and SAMtools. Bioinformatics 25, 2078–2079 (2009).
Ramirez, F. et al. deepTools2: a next generation web server for deep-sequencing data analysis. Nucleic Acids Res. 44, W160–W165 (2016).
Ni, P. et al. Genome-wide detection of cytosine methylations in plant from Nanopore data using deep learning. Nat. Commun. 12, 5976 (2021).
Simpson, J. T. et al. Detecting DNA cytosine methylation using nanopore sequencing. Nat. Methods 14, 407–410 (2017).
Danecek, P. et al. Twelve years of SAMtools and BCFtools. GigaScience 10, giab008 (2021).
Danecek, P. et al. The variant call format and VCFtools. Bioinformatics 27, 2156–2158 (2011).
Emms, D. M. & Kelly, S. OrthoFinder: phylogenetic orthology inference for comparative genomics. Genome Biol. 20, 238 (2019).
Stamatakis, A. RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics 30, 1312–1313 (2014).
Nguyen, L. T., Schmidt, H. A., von Haeseler, A. & Minh, B. Q. IQ-TREE: a fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies. Mol. Biol. Evol. 32, 268–274 (2015).
Wang, Y. et al. MCScanX: a toolkit for detection and evolutionary analysis of gene synteny and collinearity. Nucleic Acids Res. 40, e49 (2012).
Tang, H. et al. Synteny and collinearity in plant genomes. Science 320, 486–488 (2008).
Lovell, J. T. et al. GENESPACE tracks regions of interest and gene copy number variation across multiple genomes. eLife 11, e78526 (2022).
Acknowledgements
This work was supported by the Shandong Provincial Natural Science Foundation (grant no. SYS202206 to L.G.), the Taishan Scholars Program (L.G.) and the Natural Science Foundation for Distinguished Young Scholars of Shandong Province (grant no. ZR2023JQ010 to L.G.).
Author information
Authors and Affiliations
Contributions
L.G. conceived and supervised the project. W.C. performed the genome assemblies and pan-centromere analysis. J.W. performed the phylogenetic analysis. S.C. performed the DNA methylation analysis. D.M. conducted the CENH3 ChIP–seq experiment. H.F. provided the Chinese cabbage doubled haploid line ‘Futian’. Y.M. maintained the plant materials. L.Z. participated in the discussion of the results. W.C. and L.G. wrote the paper. All authors read and approved the final version of the paper.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Peer review
Peer review information
Nature Plants thanks Alexandros Bousios, Marcus Koch and the other, anonymous, reviewer(s) for their contribution to the peer review of this work.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Extended data
Extended Data Fig. 1 Genome assembly statistics and validation.
a, Assembly statistics of the ten Brassica genome used in this study. The dashed lines represent the contig N50 index. b, c, Whole-genome coverage of HiFi reads across A1 Futian (b) and A5 turnip (c) genome assembly. The regions of telomere, centromeric CEN176 and CEN238 arrays, 45S rDNA and 5S rDNA arrays and organelle insertions were marked on the bottom track.
Extended Data Fig. 2 Genome assembly and centromere structure of the allotetraploid B. juncea (AABB).
a, Circos plot of the AB genome assembly. Quantitative tracks were calculated in 100-kb bins. Full descriptions for the tracks are also available in the main Fig. 1 legend. b, The structure of B03 centromere and statistics of CENH3 log2(ChIP/input), CpG and CHG methylation, Copia density, and Gypsy density. c, StainedGlass heatmap of the pairwise sequence identity (%) between 5-kb bins. A CEN176 satellite relict was identified at the boundary of B08 centromere.
Extended Data Fig. 3 Genome assembly and centromere structure of the allotetraploid B. napus (AACC).
a, Circos plot of the AC genome assembly. Quantitative tracks were calculated in 100-kb bins. Track from outer to inner: chromosomes (red: centromere), GC content, gene density, Gypsy density, Copia density, CEN176 density, rDNA density, CpG methylation level estimated by ONT reads and colour ribbons representing syntenic blocks. b, The structure of C02 centromere and statistics of satellite density (red, forward; blue, reverse orientation), satellite edit distance relative to CentBr1 and CentBr2, Copia density, and Gypsy density. c, StainedGlass heatmaps of the pairwise sequence identity (%) between 5-kb bins. Three representative centromeres of C subgenome were exhibited.
Extended Data Fig. 4 Dynamics of CEN176 satellites in Brassica crops.
a, A maximum-likelihood phylogenetic tree of 6,487 representative (>50 occurrences in the pan-centromere collection) CEN176 satellites. b, Pairwise edit distance of top one (the highest occurrences) CEN176 monomer in each family. c, Consensus sequences of the CEN176 satellites in each family. The asterisk indicates the distinct bases between two major groups (I and II). d, Copy number of the CEN176 satellites across each chromosome and each accession. Seventeen families are coloured according to the taxonomic groups.
Extended Data Fig. 5 Presence of rDNA-derived CEN238/CEN208 repeat array in B. rapa/S. alba.
a, A circos plot of tandem repeats identified in the S. alba draft genome assembly. Shading is coloured according to repeat length (bp). Histogram corresponds to repeat families with different lengths. b, Alignment of two 45S rDNA units in B. rapa and S. alba, respectively. The BrCEN238 and SalCEN208 were probably originated from subrepeat array of corresponding 45S rDNA. c, Genome collinearity between B. rapa and S. alba. The blue labels mark the chromosomes with rDNA-derived repeat array. d, Dot plots comparing the representative centromeres from B. rapa and S. alba assemblies. The black and blue boxes represent the BrCEN176/SalCEN178 and BrCEN238/SalCEN208 repeat arrays, respectively.
Extended Data Fig. 6 Sequence alignment of CENH3 proteins and centromeric satellites.
a, b, Sequence alignment of centromeric satellites (a) and CENH3 proteins (b) from A. thaliana, R. sativus, S. alba, S. arvensis, B. rapa B. oleracea, and B. nigra. The B. nigra and S. arvensis CENH3 proteins has a deletion (red box) corresponding to the fourth and fifth exons of BrCENH3.
Extended Data Fig. 7 Edit distance of centromeric satellites relative to CentBr1 and CentBr2.
The edit distances of centromeric satellites in C subgenome were calculated. The previous X5A assembly showed similar results except for the C04 centromere. The box in the lower right corner shows the proposed centromere evolution model in C sub/genome.
Extended Data Fig. 8 Phylogenetic analysis of intact Ale retrotransposons in Brassica crops.
a-c, Ale phylogeny across Brassica crops and R. sativus. Tree labels are coloured according to sample accession (a), position inside or outside the centromeres (b) and the LTR insertion time (c). Scale bar, 0.5 substitutions per site. d, The insertion time of Ale retrotransposons among different sample accessions. The edges and centerlines of the boxes represent the interquartile range and medians. A two-sided Wilcoxon test was used to determine the significant levels. The insertion time of centrophilic Ale in A genome was significantly younger than these in the B (P = 6.6 × 10−7) and C genomes (P = 9.21 × 10−37).
Extended Data Fig. 9 Genome rearrangements and centromere evolution of B. rapa and B. oleracea.
a, Karyotype evolution from previous reported ancestral genome to current B. rapa and B. oleracea genomes. Syntenic blocks are ‘painted’ with colours corresponding to ancestral chromosomes. Black bars represent ancestral centromeric regions. b, Chromosome alignments illustrate the emergence of centromeric CEN176 and CEN238 arrays in A and C sub/genome.
Supplementary information
Supplementary Information (download PDF )
Supplementary Figs. 1–24, Notes 1 and 2 and references.
Supplementary Tables (download XLSX )
Supplementary Tables 1–10.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Chen, W., Wang, J., Chen, S. et al. Pan-centromere landscape and dynamic evolution in Brassica plants. Nat. Plants 11, 2240–2253 (2025). https://doi.org/10.1038/s41477-025-02131-5
Received:
Accepted:
Published:
Version of record:
Issue date:
DOI: https://doi.org/10.1038/s41477-025-02131-5


