Fig. 2: Genomic organization of baobab. | Nature Communications

Fig. 2: Genomic organization of baobab.

From: Chromosome-level baobab genome illuminates its evolutionary trajectory and environmental adaptation

Fig. 2

a Predominant and large centromere array with a base unit of 158 bp with higher order repeats (HORs) of 314 and 468 bp. b Comparative box plots showing telomere length distribution between baobab (Ad77271a and Ad77271b) and eight other randomly selected plant species. Baobabs have long telomeres (AAACCCT), reaching a maximum length of 30 kb, which aligns with the renowned longevity of baobabs81. The box plots display the median (line inside the box), the 25th and 75th percentiles (bounds of the box), and the minimum and maximum values (whiskers). Sample sizes (n) represent the number of biological replicates (reads) for each species: Thlaspi arvense (n = 562), Prunus armeniaca (n = 590), Arabidopsis thaliana (n = 16), Zea mays (n = 62), Gossypium (n = 1366), Salix dunnii (n = 1917), Artocarpus nanchuanensis (n = 605), Acer rubrum (n = 1650), A. digitata AdPI77271a (n = 344), A digitata AdPI77271b (n = 578). Data represent biological replicates, and no technical replicates were used. c Plot showing elevated DNA methylation (5-methylcytosine) levels in centromeric regions of chromosome 1. d Increased occurrence of tandem repeats within the centromeric region. e The meta-centromeric region of chromosome 1 is shown on a pairwise sequence identity heatmap. The scale below shows the percentages of sequence similarity. The blue dotted lines show regions along the chromosome. f Global distribution of genome features in A. digitata. Centromere arrays are plotted as dark green bars at y = 0.8. Genes are represented in blue, Mutator transposons in green, CACTA transposons in red, Gypsy retrotransposons in purple, Copia retrotransposons in orange, and CpG methylation in black. Transposable elements, such as Mutator transposons, tend to be located opposite to genes, and in the centromeric regions, Methylation is generally present throughout the genome. Plot window positions are divided by the chromosome total length to create a relative scale between 0 and 1. Source data are provided as a Source Data file.

Back to article page