Fig. 5: Young TE‑derived TSSs preserve intrinsic promoter architecture to drive precise transcription initiation.

A Different subfamilies of TEs showed distinct distributions of TSS positional patterns (y-axis, standard deviation of relative TSS positions) and TSS potential (dot size, median SuRE enrichment) with respective to evolutionary age (x-axis, median sequence divergence). Each dot represents one TE subfamily, with statistics calculated across all TE-derived TSSs within that subfamily. B TSS positions (left top), sequence divergence (left bottom), and SuRE enrichment profiles (right) of TE-derived TSSs arising from different Alu (AluY, n = 587; AluS, n = 1636; AluJ, n = 771; top panel) and L1 (L1H, n = 68; L1P, n = 1184; L1M, n = 1736; bottom panel) subfamilies. In each boxplot, the center line indicates the median, the box represents the interquartile range, and the whiskers extend to the most extreme values within 1.5× the interquartile range. One-sided Wilcoxon rank-sum test P-values are shown.